Information Diffusion and External Influence in Networks
This paper addresses the dynamics of information diffusion within social networks, with a specific focus on distinguishing between information transmission via network connections and external sources. The authors develop a model that incorporates both internal diffusion mechanisms and external influences, such as mass media, to paper how information spreads, particularly within the Twitter network.
Model and Methodology
The authors present a probabilistic generative model that allows information to reach nodes through two channels: internal network links and external influences. The model acknowledges that traditional analyses often overlook the impact of external sources. This is critical, as approximately 29% of Twitter information volume is found to be influenced by external events, challenging the predominant node-to-node diffusion assumption.
Key components of the model include:
- Internal Hazard Function: Governs the time interval for information to pass from an infected node to its neighbors.
- Event Profile : Represents the time-varying probability of external exposures, capturing the presence of unobservable external influences.
- Exposure Curve : Translates the number of exposures into the likelihood of infection, thereby mapping exposure dynamics and probability of dissemination across the network.
The inference process involves iteratively estimating the exposure curve and event profile using a combination of predefined infection times and network structure, moving towards an optimal parameterization through convergence.
Empirical Analysis
Extensive experiments with synthetic and Twitter data validate the model. Synthetic data experiments demonstrate the model's robustness in accurately inferring parameters compared to simpler baseline methods.
In the real data analysis, the researchers apply the model to Twitter, specifically analyzing the spread of URLs. The inferred event profiles align well with known external events, demonstrating the model's capability to detect exogenous influences effectively. For instance, the model accurately identifies spikes in event profiles corresponding to specific external triggers in the Tucson, Arizona shooting case paper.
Key Findings and Implications
Several insights emerge from the analysis:
- Extent of External Influence: On average, 29% of URL exposures on Twitter are attributed to external sources. This finding underscores the significance of considering non-network influences in information diffusion models.
- Topic-Specific Insights: The model reveals differences in external influence across categories. Politics and Sports are notably influenced by external sources, whereas Technology and Entertainment are more internally driven.
- Exposure Dynamics: The paper provides empirical evidence suggesting a high selectivity in idea adoption, as inferred from the consistently low values, the peak probability of infection.
The proposed model not only distinguishes internal and external influences but also enhances the understanding of how information propagates within and outside the network structure. This has broader implications for developing strategies in network-based marketing, information control, and policy-making regarding the spread of information within digital environments.
Future Directions
The findings pose interesting questions for future exploration. Consideration of how individual node behaviors contribute to broader diffusion patterns could yield deeper insights into information dynamics. Furthermore, extending the model to address the heterogeneous nature of external influences across nodes could refine predictions and improve applications in identifying network influencers.
Overall, this paper contributes a nuanced understanding of the mechanisms driving information diffusion, revealing the critical role of external influences in network-based communication models.