Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Information Contagion: an Empirical Study of the Spread of News on Digg and Twitter Social Networks (1003.2664v1)

Published 13 Mar 2010 in cs.CY and physics.soc-ph

Abstract: Social networks have emerged as a critical factor in information dissemination, search, marketing, expertise and influence discovery, and potentially an important tool for mobilizing people. Social media has made social networks ubiquitous, and also given researchers access to massive quantities of data for empirical analysis. These data sets offer a rich source of evidence for studying dynamics of individual and group behavior, the structure of networks and global patterns of the flow of information on them. However, in most previous studies, the structure of the underlying networks was not directly visible but had to be inferred from the flow of information from one individual to another. As a result, we do not yet understand dynamics of information spread on networks or how the structure of the network affects it. We address this gap by analyzing data from two popular social news sites. Specifically, we extract social networks of active users on Digg and Twitter, and track how interest in news stories spreads among them. We show that social networks play a crucial role in the spread of information on these sites, and that network structure affects dynamics of information flow.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Kristina Lerman (197 papers)
  2. Rumi Ghosh (24 papers)
Citations (895)

Summary

Information Contagion: An Empirical Study of the Spread of News on Digg and Twitter Social Networks

The paper "Information Contagion: an Empirical Study of the Spread of News on Digg and Twitter Social Networks" by Kristina Lerman and Rumi Ghosh provides a detailed empirical analysis of how information spreads through social networks, specifically focusing on Digg and Twitter. The paper examines the role social networks play in disseminating news stories and investigates how network structure impacts the dynamics of information flow.

Introduction

The significance of social networks in the dissemination of information is well established. With the advent of social media, the scope and scale of accessible data have dramatically increased, offering rich empirical foundations for understanding individual and group behavior in these networks. Previous research often lacked visibility into the underlying network structure and typically inferred it from observed information flow. By analyzing actual social network data from Digg and Twitter, this paper attempts to close the gap in understanding how network structure affects information dissemination dynamics.

Data Collection and Network Structure

Data was meticulously collected from Digg and Twitter, focusing on the activity of active users and their network connections. For Digg, 3,553 stories promoted to the front page in June 2009 were analyzed, whereas for Twitter, 398 stories from the same period were studied. The Digg dataset comprised interactions among 139,409 active users, with 71,834 of these having defined friendship links, forming a network of 258,220 friend links. Twitter's dataset involved 137,582 active users with a total of 6,200,051 follow relationships.

The network structure varied considerably between the two platforms. Digg's social network showed higher density and interconnectivity than Twitter's, evidenced by measures like the fraction of mutual links and clustering coefficients. This network characteristic heavily influences how information spreads across the platforms.

User Activity and Voting Dynamics

The analysis revealed that user activity on both platforms follows a long-tail distribution, with a small percentage of users contributing the majority of votes or retweets. However, despite both the platforms being used for similar purposes — to share news and information — their user interfaces and network dynamics resulted in different patterns of information spread.

On Digg, stories initially accumulate votes slowly until they are promoted to the front page. This event significantly accelerates vote accumulation. Conversely, on Twitter, the spread of a story via retweets is more consistent over time, suggesting a continuous and steady dissemination process through its social network.

Distribution of Story Popularity

Popularity distributions on both platforms adhered to a log-normal distribution, indicating that while most stories receive moderate attention, a few become exceedingly popular. However, the pattern differed for the distribution of network cascade sizes (i.e., the number of fan votes), which were normally distributed, suggesting different dynamics governing in-network activity.

Dynamics of Voting in Networks

A significant finding is the contrasting impact of network structure on information spread between Digg and Twitter. On Digg, due to denser interconnections, stories quickly propagate through user networks before promotion but slow down post-promotion due to exposure to unconnected users. On Twitter, the initial spread is slower but more far-reaching over time due to its less dense network.

Conclusions and Implications

The paper's findings emphasize the essential role network structure plays in the dynamics of information spread. Digg's dense network facilitates rapid initial spread within the network, while Twitter's broader, less interconnected network supports a wider but slower dissemination process.

Understanding these patterns holds significant implications for optimizing information dissemination strategies in social networks and improving peer production systems. Distinguishing in-network and out-of-network activities not only aids in identifying high-quality contributions but also in better predicting the spread and impact of information.

Future Directions

Future research could explore more nuanced models of user behavior and network dynamics to predict information spread more accurately. By incorporating factors such as user influence and topic-specific dissemination patterns, the predictive power of these models could be enhanced. Additionally, experimenting with different network structures could yield insights into optimizing social platforms for more effective information dissemination.

In summary, this paper provides pivotal insights into the empirical dynamics of information spread on social media platforms, underlining the critical influence of network structure on these processes. The results form a foundation for further explorations into optimizing digital information dissemination.