Concentric Network Symmetry
- Concentric network symmetry is a measure that quantifies local structural regularity by assessing how network connectivity fans out from a node in concentric layers.
- It employs constrained random walks with entropy-based probabilities and applies backbone and merged transformations to capture multiscale node-centric balance.
- The framework is used to distinguish network models and enhance applications like authorship attribution by extracting distinctive stylistic patterns from word adjacency networks.
Concentric network symmetry quantifies the local structural regularity around nodes in complex networks by evaluating how uniformly network connectivity “fans out” from a focal node across concentric shells of increasing path length. Unlike global automorphism-based symmetry metrics, this framework captures multiscale, node-centric balance using entropy-based measures derived from constrained random walks, combined with specific topological transformations of the induced subgraph formed by progressively larger neighborhoods. Concentric network symmetry has demonstrated utility in distinguishing network models, characterizing real-world networks, and extracting distinctive stylistic signals from word adjacency networks for tasks such as authorship attribution.
1. Mathematical Framework and Definitions
Let be an undirected, unweighted network, with a reference node . The -th concentric level consists of nodes at shortest-path distance exactly from . The -pattern of node is the induced subgraph on nodes .
A concentric random walk is defined such that, starting at , the walker always moves outward; edges that revisit inner circles are forbidden. Transition probabilities 0 denote the probability of a walker reaching 1 in 2 steps via allowed paths.
Intra-level edges (within the same 3) are handled by two complementary transformations:
- Backbone pattern: All intra-level edges are removed (“infinite-cost”), emphasizing tree-like expansion.
- Merged pattern: All connected components of each level are fused into a “supernode”; parallel cross-level edges are recorded as weights (“zero-cost” intra-level moves).
The node-level symmetry at layer 4 is calculated as: 5 where 6 is the set of level-7 nodes actually reachable under the walk and 8 is the entropy of the conditional arrival probabilities: 9 The backbone symmetry 0 and merged symmetry 1 are computed using the backbone and merged patterns, respectively (Silva et al., 2014, Amancio et al., 2015).
2. Algorithmic Procedure and Implementation
The algorithm proceeds as follows:
- For a given reference node 2 and radius 3, perform BFS to record distances and retrieve all 4.
- For each 5, extract the 6-pattern subgraph.
- Construct backbone and merged auxiliary graphs by enforcing the respective intra-level edge transformations.
- Compute outward-only random-walk transition probabilities to 7 using dynamic programming, accounting only for allowed edge traversals.
- Compute the entropy 8 and determine the size of the reachable set 9.
- Assign 0 and 1 using the normalized formula above.
Computational complexity is 2 per node, where 3 is the radius of exploration (Silva et al., 2014).
3. Empirical Properties Across Network Types
Concentric symmetry metrics have been analyzed on various synthetic and real networks (Silva et al., 2014):
- Erdős–Rényi (ER): At 4, both measures cluster near 1 due to tree-like local structure. By 5, shortcut links reduce symmetry.
- Barabási–Albert (BA): Peripheral low-degree nodes retain high symmetry; symmetry is broken at hubs due to irregular branching.
- Random Geometric (GEO) & Voronoi (VOR): High local clustering leads to greater dispersion and rapid symmetry decay at larger 6.
- Real-world networks (Wikipedia citation, airport, street): Exhibit broad scatter in 7; tree-like and highly clustered fringes produce locally high symmetry, while heterogeneous cores exhibit lower values.
Low Pearson correlations (typically 8) with classical metrics (degree, betweenness, clustering) demonstrate that concentric symmetry quantifies structural features overlooked by conventional statistics.
4. Applications in Word Adjacency Networks and Authorship Attribution
Amancio et al. (Amancio et al., 2015) applied concentric symmetry to word adjacency networks derived from pre-processed texts. Nodes represent lemmatized, content-bearing words; edges encode consecutive word occurrences. For each word and each layer 9, backbone and merged symmetries are computed following the network-centric procedures above.
Notable findings include:
- The empirical distribution of merged symmetry 0 in novels resembles a shifted logistic or power-law, with high symmetry being rare.
- Backbone symmetry distributions are bimodal, reflecting regularity in peripheral words versus core hubs.
- The symmetry metrics are essentially uncorrelated with degree, betweenness, or clustering, confirming their orthogonality.
In authorship attribution, books by 8 authors were classified based on 1 and 2 for 3 computed on 229 common content words. Using a Support Vector Machine (SVM) or Multilayer Perceptron (MLP) with merged symmetry at 4, classification accuracy reached 82.5%—substantially higher than other topological or stylometric baselines. Notably, some authors systematically favor high- or low-symmetry patterns for particular words, visible as distinct clusters in scatterplots of 5 (Amancio et al., 2015).
| Layer 6 | SVM 7 | SVM 8 |
|---|---|---|
| 2 | 75.0% | 32.5% |
| 3 | 70.0% | 70.0% |
| 4 | 82.5% | 70.0% |
5. Statistical Interpretation and Principal Component Analysis
Principal component analysis (PCA) on the feature vector 9 demonstrates that the dominant components separate classes of synthetic versus real networks and discriminate between spatial, random, and scale-free models. The first two PCs typically capture about 86% of the total variance, with ER and street networks exhibiting highest overall symmetry, while scale-free and highly heterogeneous networks (BA, Wikipedia, airport) are lowest (Silva et al., 2014).
6. Implications and Applications
Concentric symmetry forms a bounded, local, multiscale family of topological features quantifying the radial regularity of network expansion from a node. The dual treatment of intra-level connectivity (backbone and merged) provides complementary perspectives:
- In text, these metrics enrich representations for authorship, style, topical-word identification, and retrieval, providing features that are functionally distinct from degree and centrality (Amancio et al., 2015).
- In general network science, they enable fine-grained discrimination among structural motifs and augment classical summary measures in network classification, clustering, and exploration of local versus global organization (Silva et al., 2014).
A plausible implication is that concentric symmetry can be generalized to semantic or higher-order concept networks, extending its reach in computational linguistics and complex systems analysis. Its complementarity with classical metrics suggests value in fusion-based graph analysis pipelines, especially where multiscale local regularity plays a functional or interpretive role.