HCTCD: Temporal & Collaboration Decay Metric
- The paper introduces HCTCD, which integrates temporal and collaboration decay into harmonic closeness to predict citation outcomes.
- The methodology applies exponential decay functions on collaboration recency and frequency, computed via BFS-derived shortest paths in co-authorship graphs.
- Empirical findings demonstrate HCTCD outperforms standard centrality measures, achieving up to a 4% improvement in citation prediction accuracy.
Harmonic Closeness with Temporal and Collaboration Count Decay (HCTCD) is a centrality metric developed to assess the structural position of authors in scientific collaboration networks while explicitly accounting for both the recency and the frequency of collaborations. By extending the classical harmonic closeness framework with decay mechanisms for temporal distance and collaboration intensity, HCTCD quantifies an author's global network reach, emphasizing the dynamic, evolving topology of co-authorship graphs. The metric has demonstrated superior predictive power for citation outcomes compared to standard centrality measures, indicating its utility in explaining disparities in scientific recognition within top-tier academic venues (Jie et al., 26 Dec 2025).
1. Formal Mathematical Definition
Let denote the undirected co-authorship graph at a given publication year . For every ordered author pair , the components are defined as:
- : the shortest-path distance in (unweighted)
- : “age” since the last collaboration (where is the year of the most recent prior joint publication)
- : total number of publications between and before
Two exponential decay terms are introduced:
- Temporal decay: ,
- Collaboration count decay: ,
The effective pairwise weight is .
HCTCD for node is:
where is the total number of authors.
2. Algorithmic Computation and Aggregation
For a given publication at :
Inputs:
- : Author list of paper
- The full co-authorship graph constructed from all publications before
The computation proceeds as follows:
- Precompute all-pairs shortest paths via BFS for the unweighted graph.
- For each node :
- For every :
- Compute and .
- Calculate .
- Accumulate .
- Normalize: .
- For every :
For team-level aggregation on an -author paper:
| Aggregation Type | Formula |
|---|---|
| Unweighted sum | |
| Unweighted average | |
| Weighted sum | |
| Weighted average |
Here, modulates emphasis on lead authors ( implies second author weight of the first).
3. Parameterization and Default Values
Empirically optimized parameters are:
- Temporal decay rate (negative allows older collaborations to retain partial influence)
- Collaboration decay rate (diminishing marginal returns for repeat co-authorship)
- Author-order weight
- Time windows for dynamic centrality: 1, 2, 4, 8, 16 years pre-; a fixed 8-year window is standard for regression analysis
A grid search over 2015–2016 data identified robust optimality ranges: (best at ), (best at $0.45$), , after which performance plateaus.
4. Comparative Perspective
Standard harmonic closeness centrality is defined as , neglecting both tie recency and collaboration frequency. HCTCD down-weights obsolete links and adjusts for repeated collaborations, thus capturing:
- The reduced importance of aged collaborations (via )
- Saturation effects from prolific dyads (via )
Unlike metrics focusing solely on time or frequency, HCTCD integrates both axes, reflecting that aged collaborations diminish in importance while repeated collaboration with the same partner yields diminishing additional network reach. A plausible implication is that HCTCD better models real-world scholarly influence dynamics in evolving scientific communities than single-factor extensions.
5. Empirical Findings and Predictive Utility
Empirical evaluation against citation percentile statistics for 17,942 papers yields:
- Bivariate Pearson correlation values (16-year window): degree centrality ($0.275$), closeness ($0.389$), HCTCD unweighted sum ($0.382$), HCTCD weighted sum ($0.397$, highest observed)
- Beta regression (citation percentile as outcome, controlling for content and covariates): including HCTCD increases from $0.6045$ to $0.6083$ (AIC), with estimated coefficient ,
- Predictive models: XGBoost MSE decreases from $0.03645$ to $0.03502$ (), correlation rises from $0.7025$ to $0.7165$; Random Forest correlation $0.717$ (with centralities) versus $0.700$ (without)
The combination of recency and tie-strength decay, along with harmonic closeness, affords HCTCD consistent improvements in both explanatory and predictive contexts for citation outcomes.
6. Implementation Complexity and Workflow
- All-pairs shortest paths: via BFS per node on an unweighted co-authorship graph
- Weight matrices (last collaboration time, collaboration count) are pre-aggregated by a single scan of the publication history
- Centrality scores are computed dynamically for each focal window; team-level HCTCD aggregation per publication uses the above author-weighted sums or averages
- Parameter optimization for is performed by grid search maximizing Pearson correlation to citation percentiles in a held-out subset
- Beta regression uses statsmodels.betareg (Python); model evaluation utilizes both regression and tree-based ensemble methods
7. Interpretation and Significance
HCTCD unifies global network centrality, recency weighting, and saturation-adaptive tie strength to model how changing collaboration patterns influence future scholarly impact. Temporal decay () ensures that recent collaborations exert stronger influence, while collaboration-count decay () prevents overemphasis on highly prolific dyads. Weighted author-order aggregation () captures the hierarchical prominence of lead authors. Across all tested scenarios, HCTCD outperforms classical centralities in accounting for citation disparities, offering up to a 4% reduction in predictive mean squared error and revealing the centrality-driven inequities embedded in citation dynamics. This metric substantiates arguments for network-aware evaluation frameworks in the assessment of scientific recognition (Jie et al., 26 Dec 2025).