Papers
Topics
Authors
Recent
Search
2000 character limit reached

3D LPA: Connectivity-Aware Label Propagation

Updated 25 January 2026
  • The paper introduces a connectivity-aware label propagation mechanism by integrating a neighborhood-strength metric to improve label selection and community detection quality.
  • It employs asynchronous updates and an active-node management strategy, drastically reducing redundant computations and iteration counts compared to standard LPA.
  • Empirical evaluations on synthetic and real networks show enhanced modularity and scalability, particularly in settings with high local clustering.

Connectivity-Aware Label Propagation Algorithm (3D LPA) delivers a community detection architecture that incorporates local neighborhood connectivity into the label update criterion, improving both computational efficiency and partition quality compared to the standard Label Propagation Algorithm (LPA). It achieves nearly-linear runtime via asynchronous updates and active-node management, and bolsters accuracy—particularly in networks characterized by high local clustering—by privileging neighbors tightly integrated within a node’s immediate environment (Xie et al., 2011).

1. Neighborhood-Strength Measure

In the standard LPA, each node ii updates its label by selecting the most frequently occurring label among its neighbors N(i)N(i). The 3D LPA introduces a neighborhood-strength metric to quantify, for each neighbor jN(i)j\in N(i), the extent to which jj connects to the remainder of ii's local environment. This is formalized by:

hj(i)={kN(i){i}:(j,k)E},h_j(i) = |\{ k\in N(i)\setminus\{i\} : (j,k)\in E \}|,

where EE is the edge set. Thus, hj(i)h_j(i) counts the number of edges from jj to other neighbors of ii (excluding ii itself). For each candidate label kk present in N(i)N(i), the subset CkN(i)C_k \subseteq N(i) of neighbors carrying label kk receives a neighborhood-strength-driven score:

S(Ck)=jCk[1+chj(i)],S(C_k) = \sum_{j\in C_k}\bigl[1 + c\,h_j(i)\bigr],

where c[0,1]c \in [0,1] is a tunable parameter. At c=0c=0, 3D LPA reverts to standard LPA; at c=1c=1, each intra-neighborhood link receives equal weighting. The S(Ck)S(C_k) score thus fuses label frequency with intra-neighborhood cohesiveness.

2. Generalized Label Update Rule

Instead of the majority-vote mechanism, 3D LPA enacts the following asynchronous update for each selected node ii:

  1. For each candidate label kk in N(i)N(i), identify the set Ck={jN(i):L(j)=k}C_k = \{j \in N(i) : L(j) = k\}.
  2. Compute S(Ck)S(C_k) as above for each candidate kk.
  3. Set

L(i)argmaxkS(Ck),L(i) \leftarrow \underset{k}{\arg\max} S(C_k),

breaking ties randomly.

This refinement weights influence so that neighbors more densely connected within ii’s immediate vicinity exert proportionally greater effect. As cc increases, structurally central neighbors are systematically preferred over sparsely connected ones for label propagation.

3. Algorithmic Workflow with Speed-Up Optimizations

3D LPA integrates two principal enhancements:

  • The connectivity-weighted label selection as described.
  • An active-node list that manages which nodes need possible label reevaluation, yielding substantial efficiency gains.

Initialization:

  • Assign each node a unique label: L(i)L(i) \gets unique identifier.
  • ActiveList \gets all nodes.

Iterative Update Loop:

  1. Randomly select and remove node ii from ActiveList.
  2. For all jN(i)j \in N(i), compute hj(i)h_j(i).
  3. Partition N(i)N(i) into CkC_k for each observed label kk.
  4. Compute S(Ck)=jCk[1+chj(i)]S(C_k) = \sum_{j\in C_k}[1 + c \cdot h_j(i)] for all kk.
  5. Set L(i)L(i) \gets label kk^* with maximal S(Ck)S(C_k) (breaking ties randomly).
  6. If L(i)L(i) changes, propagate status updates:
    • For ii and each neighbor N(i)\ell \in N(i), determine node status:
      • Interior: all neighbors share its label.
      • Passive: node would not change label upon update.
      • Active: otherwise.
    • Update ActiveList to reflect status changes; add newly active, remove newly passive.

Optimization remarks:

  • Naïve hj(i)h_j(i) computation scales as O(di2)O(d_i^2), but adjacency bitsets and incremental histograms reduce this to O(dj)O(d_j) (or amortized O(1)O(1) per link).
  • The active-node bookkeeping ensures each update is effective (i.e., results in a label change), minimizing redundant operations.

4. Computational Complexity Analysis

Let nn denote the number of nodes, mm the number of edges in a sparse graph (m=O(n)m = O(n)).

Original LPA:

  • Each node scan: O(di)O(d_i).
  • Empirically, the number of iterations per node, Torig/nT_{\text{orig}}/n, ranges $1$–$20$.
  • Total complexity: O(mTorig/n)O(m \cdot T_{\text{orig}} / n).

3D LPA:

  • Effective update cost: O(di+jN(i)dj)O(d_i + \sum_{j \in N(i)} d_j) using direct computation, reduced to O(di)O(d_i) by histogram/bitset optimizations.
  • Number of iterations matches the count of label changes (passive/interior nodes excluded), with T3D/n<3T_{3D}/n < 3 observed empirically—representing $6$–20×20\times fewer iterations than original LPA.
  • Overall runtime per pass: O(m)O(m), but with significantly diminished constant and pass count.

Both algorithms maintain nominal linear scaling, but 3D LPA’s strictly “effective” updates deliver reduced running time and improved scalability for larger networks.

5. Empirical Validation on Synthetic and Real Networks

3D LPA was evaluated on:

  • Synthetic benchmarks: LFR benchmarks (N=1000N=1000, k=5\langle k \rangle=5, variable mixing μ\mu).
    • Metrics: Normalized Mutual Information (NMI), Adjusted Rand Index (ARI) against ground truth.
    • For μ<0.3\mu < 0.3, all cc values yield similar results.
    • For μ>0.35\mu > 0.35 up to $0.65$, c>0c>0 variants outperform c=0c=0 by $5$–20%20\% NMI/ARI, and maintain partition integrity where standard LPA fails.
  • Real-world networks: Nine social graphs (size range $34$–$10,680$ nodes; datasets include karate, football, netscience, email, PGP).
    • Metric: modularity QQ, with $100$ runs per dataset.
    • Highest QQ: c>0c>0 (often c=1c=1) surpasses c=0c=0 by up to $0.04$ absolute.
    • Mean QQ: c>0c>0 is consistently higher and more stable, fewer all-in-one collapses (c=1c=1).
    • Iteration reduction: 1.5×1.5\times21×21\times across diverse networks.

Improvements are most pronounced when clustering coefficients are high, as the algorithm’s weighting rewards neighbors that form cohesive local structures.

6. Context and Interpretation

3D LPA augments the parameter-free, asynchronous update framework of standard label propagation by embedding the neighborhood-strength score S(Ck)S(C_k) for each candidate label. The integration of connectivity-aware scoring and efficient bookkeeping produces accelerated convergence and reinforced community boundaries within topologies abundant in local triangles. This suggests that the method is particularly suited for network domains—social, biological, information—where clustering and intra-community density are signature features. A plausible implication is that the approach is robust against “collapse” into trivial partitions in the presence of strong local structure, supporting its utility in large-scale, real-world network analysis (Xie et al., 2011).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Connectivity-Aware Label Propagation Algorithm (3D LPA).