Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Structure and Dynamics of Information Pathways in Online Media (1212.1464v1)

Published 6 Dec 2012 in cs.SI, cs.DS, cs.IR, and physics.soc-ph

Abstract: Diffusion of information, spread of rumors and infectious diseases are all instances of stochastic processes that occur over the edges of an underlying network. Many times networks over which contagions spread are unobserved, and such networks are often dynamic and change over time. In this paper, we investigate the problem of inferring dynamic networks based on information diffusion data. We assume there is an unobserved dynamic network that changes over time, while we observe the results of a dynamic process spreading over the edges of the network. The task then is to infer the edges and the dynamics of the underlying network. We develop an on-line algorithm that relies on stochastic convex optimization to efficiently solve the dynamic network inference problem. We apply our algorithm to information diffusion among 3.3 million mainstream media and blog sites and experiment with more than 179 million different pieces of information spreading over the network in a one year period. We study the evolution of information pathways in the online media space and find interesting insights. Information pathways for general recurrent topics are more stable across time than for on-going news events. Clusters of news media sites and blogs often emerge and vanish in matter of days for on-going news events. Major social movements and events involving civil population, such as the Libyan's civil war or Syria's uprise, lead to an increased amount of information pathways among blogs as well as in the overall increase in the network centrality of blogs and social media sites.

Insights into Dynamic Network Inference Using Information Diffusion Data

The paper "Structure and Dynamics of Information Pathways in Online Media" addresses the complex problem of inferring dynamic networks, a topic of significant interest due to its practical and theoretical implications for understanding the dissemination of information, behaviors, and even diseases over networks. The authors focus on scenarios where the networks over which these contagions or information spread are not directly observable and exhibit dynamic changes over time.

The key contribution of the paper lies in the novel approach to inferring such networks using data on information diffusion. At its core, the paper proposes an innovative on-line algorithm built on stochastic convex optimization techniques, designed to efficiently tackle the problem of dynamic network inference. This algorithm allows for the inference of daily networks of information diffusion across a vast dataset of over 3.3 million media sites and billions of information pieces tracked over a year-long period.

Algorithm and Methodology

The authors model information propagation as discrete networks of continuous temporal processes, leveraging prior work on static network models. The methodology is extended to account for time-varying transmission rates across network edges, utilizing stochastic gradient descent to iteratively refine the network models based on sampled diffusion cascades. This approach allows for efficient on-line updates to the network structure and parameters, capturing the evolution of pathways in response to the dynamics of ongoing events.

Empirical Evaluation

The algorithm is tested on both synthetic data and real-world datasets. The use of synthetic data confirms the algorithm's capability to accurately track dynamic changes in network topology and edge transmission rates across various network topologies and temporal patterns. When applied to real-world data, the algorithm provides valuable insights into information pathways across mainstream media and blogs in response to significant global events and topics, such as the Arab Spring movements or the Fukushima nuclear disaster. Notably, the paper highlights the observed increase in the centrality of blogs during civil movements, signifying their growing role in information dissemination.

Implications and Future Directions

Practically, the ability to dynamically infer evolving networks can greatly enhance monitoring and strategic interventions in information dissemination, public relations, and digital marketing. Theoretically, this work lays a foundation for further exploration into the temporal dynamics of networks, opening pathways for developing more sophisticated models adept at handling the complexities inherent in real-world data. Potential future developments may involve refining the algorithm's scalability, improving the precision of temporal inference, and extending the framework to account for additional external factors influencing network dynamics.

The paper situates itself as an important step towards understanding and modeling the flux in information pathways catering to dynamic environments, marking its relevance across domains like computer science, statistical modeling, and network analysis. Through meticulous experimentation and insightful analysis, it provides an advanced framework for researchers and practitioners interested in the complexities of dynamic information spread over large and evolving networks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Manuel Gomez Rodriguez (30 papers)
  2. Jure Leskovec (233 papers)
  3. Bernhard Schölkopf (412 papers)
Citations (279)