Papers
Topics
Authors
Recent
Search
2000 character limit reached

Omission-to-Modification Error Ratio

Updated 20 April 2026
  • Omission-to-Modification Error Ratio is a quantitative measure comparing omitted true ties (false negatives) with incorrectly reported ties (false positives) in network data.
  • It is employed in ROC-based aggregation methods where adjusting thresholds and weighting factors balances omission and modification errors for optimal network recovery.
  • Empirical studies in CSS networks show O:M ratios ranging from 4 to 20, highlighting the trade-off between liberal and conservative reporting in sparse social structures.

The Omission-to-Modification error ratio (O:M ratio) arises in the context of cognitive social structure (CSS) network studies, where individuals report not only their own direct social ties but also their perceptions of ties among all others in a bounded network. The O:M ratio serves as a quantitative metric that captures the trade-off between omission errors (false negatives, where actual ties are missed) and modification errors, also called commission errors (false positives, where non-ties are incorrectly reported as present). Used in conjunction with ROC-curve based aggregation methods, the O:M ratio enables explicit calibration of network estimates by controlling the relative emphasis placed on the minimization of each error type (Yenigun et al., 2016).

1. Formal Definition

The true relational structure is represented by a directed adjacency matrix A=(Aij)A=(A_{ij}) with size N×NN \times N, where Aij=1A_{ij}=1 if a true tie exists from ii to jj and $0$ otherwise. The estimated or perceived structure, denoted A^=(A^ij)\hat{A}=(\hat{A}_{ij}), may be inferred via aggregation, or may refer to a single respondent’s report. Using the indicator I{}I\{\cdot\} that is $1$ if its argument is true and 0 otherwise, the error rates are defined as:

  • Omission error rate EOE_O (false negatives):

N×NN \times N0

This quantifies the proportion of true ties missed in estimation.

  • Modification (commission) error rate N×NN \times N1 (false positives):

N×NN \times N2

This reflects the proportion of non-ties incorrectly categorized as ties.

  • O:M error ratio:

N×NN \times N3

This ratio encapsulates the relative prevalence of omission versus modification errors in any given network estimation scenario.

2. Role in ROC–Based Aggregation and Weighting

In ROC-curve based CSS aggregation, a threshold parameter N×NN \times N4 represents the minimum number of respondents required to agree on a tie for it to be considered present. For each N×NN \times N5, N×NN \times N6 and N×NN \times N7 are calculated, generating an empirical ROC curve plotting true positive rate (TPRN×NN \times N8) against false positive rate (FPRN×NN \times N9). The classical ROC criterion minimizes the Euclidean distance to the ideal Aij=1A_{ij}=10 point, corresponding to

Aij=1A_{ij}=11

which treats omission and commission errors equally (Aij=1A_{ij}=12).

The introduction of an explicit weighting factor Aij=1A_{ij}=13 generalizes this to

Aij=1A_{ij}=14

where Aij=1A_{ij}=15 corresponds to desired O:M emphasis (Aij=1A_{ij}=16 can also be denoted Aij=1A_{ij}=17). Selecting Aij=1A_{ij}=18 up-weights the cost of modification/commission errors, leading the optimization to favor thresholds that reduce false positives, often at the expense of increased omissions. The procedure thus allows the analyst to choose or justify a particular O:M trade-off directly in accordance with substantive considerations or network sparsity (Yenigun et al., 2016).

3. Conceptual and Empirical Patterns in CSS Work

CSS studies characteristically involve networks with low density (Aij=1A_{ij}=19) and a large imbalance between the number of possible ties and non-ties. Empirically, omission error rates (ii0) are observed between 0.54 and 0.72, while modification error rates (ii1) range from 0.03 to 0.14, yielding raw O:M ratios in the 4 to 20 range across studied datasets.

There is a strong negative correlation between ii2 and ii3 at the respondent level, indicating a trade-off: respondents who are "liberal" in their perceptions tend toward low omission but high commission errors; "conservative" respondents exhibit the reverse pattern. This supports the methodological need for explicit O:M balancing in aggregation (Yenigun et al., 2016).

4. O:M Ratio in Network Estimation Algorithm

The ROC-based threshold method for network aggregation (RTM) uses the O:M ratio via the weighting parameter ii4 to make the error trade-off both explicit and data-driven:

  1. Randomly sample ii5 CSS slices; compute their average density ii6.
  2. Set ii7 (or another value according to analytic priorities).
  3. For each candidate threshold ii8, calculate ii9, jj0, and jj1.
  4. Select jj2, and aggregate the network using this threshold.

In sparse networks, the recommended weight jj3 amplifies the importance of reducing false positives, which are disproportionately likely amidst a majority of non-ties.

5. Numerical and Simulation Results

Across five canonical CSS datasets (network size jj4–jj5, density jj6–jj7), observed mean jj8 values are jj9–$0$0 and $0$1 values are $0$2–$0$3, with O:M ratios generally between 4 and 20. An illustrative example ("High Tech Managers", $0$4, $0$5) demonstrates how varying $0$6 affects the O:M ratio:

Threshold $0$7 $0$8 $0$9 O:M
1 0.295 0.083 0.28
4 0.034 0.667 19.6

Correlation with the true network structure increases under the A^=(A^ij)\hat{A}=(\hat{A}_{ij})0-weighted ROC choice (e.g., A^=(A^ij)\hat{A}=(\hat{A}_{ij})1) versus an unweighted choice (A^=(A^ij)\hat{A}=(\hat{A}_{ij})2). Large-scale simulations confirm that using A^=(A^ij)\hat{A}=(\hat{A}_{ij})3 in the ROC minimization yields robust recovery, performing comparably or better than adaptive thresholding which constrains only commission errors.

6. Guidelines and Applied Selection

Choice of A^=(A^ij)\hat{A}=(\hat{A}_{ij})4 should be context-driven:

  • If false positives (commission errors) are much costlier, set A^=(A^ij)\hat{A}=(\hat{A}_{ij})5 to minimize A^=(A^ij)\hat{A}=(\hat{A}_{ij})6 further.
  • If false negatives (omission errors) are to be avoided, set A^=(A^ij)\hat{A}=(\hat{A}_{ij})7.
  • If no clear preference exists, use A^=(A^ij)\hat{A}=(\hat{A}_{ij})8 (classical ROC methodology).

The recommended approach is A^=(A^ij)\hat{A}=(\hat{A}_{ij})9 in sparse organizational networks, balancing the error rates for superior network recovery (Yenigun et al., 2016).

7. Worked Example

For a toy network with I{}I\{\cdot\}0, I{}I\{\cdot\}1 sampled slices, and observed counts as follows:

I{}I\{\cdot\}2 (threshold) False Positives False Negatives I{}I\{\cdot\}3 I{}I\{\cdot\}4 I{}I\{\cdot\}5 I{}I\{\cdot\}6 (for I{}I\{\cdot\}7=5)
1 2 1 0.333 0.25 0.417 1.667
2 1 2 0.167 0.5 0.528 1.305
3 0 3 0 0.75 0.75 0.75

With equal weighting (I{}I\{\cdot\}8), I{}I\{\cdot\}9 minimizes $1$0. With $1$1, corresponding to high stringency, $1$2 minimizes $1$3. This formalism directly operationalizes the trade-off between omission and commission errors within ROC-guided network estimation (Yenigun et al., 2016).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Omission-to-Modification Error Ratio (O:M).