Homophily-adjusted social influence estimation (2405.18413v1)
Abstract: Homophily and social influence are two key concepts of social network analysis. Distinguishing between these phenomena is difficult, and approaches to disambiguate the two have been primarily limited to longitudinal data analyses. In this study, we provide sufficient conditions for valid estimation of social influence through cross-sectional data, leading to a novel homophily-adjusted social influence model which addresses the backdoor pathway of latent homophilic features. The oft-used network autocorrelation model (NAM) is the special case of our proposed model with no latent homophily, suggesting that the NAM is only valid when all homophilic attributes are observed. We conducted an extensive simulation study to evaluate the performance of our proposed homophily-adjusted model, comparing its results with those from the conventional NAM. Our findings shed light on the nuanced dynamics of social networks, presenting a valuable tool for researchers seeking to estimate the effects of social influence while accounting for homophily. Code to implement our approach is available at https://github.com/hanhtdpham/hanam.
- Social fmri: Investigating and shaping social mechanisms in the real world. Pervasive and mobile computing, 7(6):643–659, 2011.
- Z. W. Almquist. networkdata: Lin Freeman’s Network Data Collection, 2014. URL https://github.com/Z-co/networkdata. R package version 0.01.
- C. T. Butts. sna: Tools for Social Network Analysis, 2020. URL https://CRAN.R-project.org/package=sna. R package version 2.6.
- C. T. Butts. A perturbative solution to the linear influence/network autocorrelation model under network dynamics. arXiv preprint arXiv:2310.20163, 2023.
- A limited memory algorithm for bound constrained optimization. SIAM Journal on Scientific Computing, 16(5):1190–1208, 1995. 10.1137/0916069. URL https://doi.org/10.1137/0916069.
- C.-F. Chen. On asymptotic normality of limiting density functions with bayesian implications. Journal of the Royal Statistical Society: Series B, 47(3):540–546, 1985. URL https://rss.onlinelibrary.wiley.com/doi/abs/10.1111/j.2517-6161.1985.tb01384.x.
- Bayesian estimation of the network autocorrelation model. Social Networks, 48:213 – 236, 2017. ISSN 0378-8733. https://doi.org/10.1016/j.socnet.2016.09.002. URL http://www.sciencedirect.com/science/article/pii/S0378873316300600.
- P. Doreian. Linear models with spatially distributed data: Spatial disturbances or spatial effects? Sociological Methods & Research, 9(1):29–60, 1980.
- P. Doreian. Network autocorrelation models: Problems and prospects. 1989. Prepared for the 1989 Symposium “Spatial Statistics: Past, present, future,” Department of Geography, Syracuse University.
- H. Glanz and L. Carvalho. An expectation–maximization algorithm for the matrix normal distribution with an application in remote sensing. Journal of Multivariate Analysis, 167:31–48, 2018. ISSN 0047-259X. https://doi.org/10.1016/j.jmva.2018.03.010. URL https://www.sciencedirect.com/science/article/pii/S0047259X17300477.
- Model-based clustering for social networks. Journal of the Royal Statistical Society: Series A (Statistics in Society), 170(2):301–354, 2007.
- K. M. Harris. The national longitudinal study of adolescent health: Research design. http://www. cpc. unc. edu/projects/addhealth/design, 2011.
- Latent space approaches to social network analysis. Journal of the american Statistical association, 97(460):1090–1098, 2002.
- Approximate riemannian conjugate gradient learning for fixed-form variational bayes. Journal of Machine Learning Research, 11:3235–3268, 2010. ISSN 1532-4435.
- A generalized spatial two-stage least squares procedure for estimating a spatial autoregressive model with autoregressive disturbances. The journal of real estate finance and economics, 17:99–121, 1998.
- Fitting position latent cluster models for social networks with latentnet. Journal of Statistical Software, 24(5):1–23, 2008.
- latentnet: Latent Position and Cluster Models for Statistical Networks. The Statnet Project (https://statnet.org), 2020. URL https://CRAN.R-project.org/package=latentnet. R package version 2.10.5.
- Representing degree distributions, clustering, and homophily in social networks with latent cluster random effects models. Social Networks, 31(3):204–213, 2009.
- H. Li and D. K. Sewell. A comparison of estimators for the network autocorrelation model based on observed social networks. Social Networks, 66:202–210, 2021.
- Network cross-validation by edge sampling. Biometrika, 107(2):257–276, 04 2020. ISSN 0006-3444. 10.1093/biomet/asaa006. URL https://doi.org/10.1093/biomet/asaa006.
- Identifying the latent space geometry of network models through analysis of curvature. Journal of the Royal Statistical Society Series B: Statistical Methodology, 85(2):240–292, 03 2023. ISSN 1369-7412. 10.1093/jrsssb/qkad002. URL https://doi.org/10.1093/jrsssb/qkad002.
- E. McFowland and C. R. Shalizi. Estimating causal peer influence in homophilous social networks by inferring latent locations. Journal of the American Statistical Association, 118(541):707–718, 2023. 10.1080/01621459.2021.1953506. URL https://doi.org/10.1080/01621459.2021.1953506.
- K. Nowicki and T. A. B. Snijders. Estimation and prediction for stochastic blockstructures. Journal of the American Statistical Association, 96(455):1077–1087, 2001.
- K. Ord. Estimation methods for models of spatial interaction. Journal of the American Statistical Association, 70(349):120–126, 1975.
- Maximum likelihood embedding of logistic random dot product graphs. Proceedings of the AAAI Conference on Artificial Intelligence, 34(04):5289–5297, Apr. 2020. 10.1609/aaai.v34i04.5975. URL https://ojs.aaai.org/index.php/AAAI/article/view/5975.
- Physical activity and mental health: current concepts. Sports medicine, 29:167–180, 2000.
- Properties of latent variable network models. Network Science, 4(4):407–432, 2016. 10.1017/nws.2016.23.
- M. Salter-Townshend and T. B. Murphy. Variational bayesian inference for the latent position cluster model for network data. Computational Statistics & Data Analysis, 57(1):661–671, 2013.
- D. K. Sewell and Y. Chen. Latent space models for dynamic networks with weighted edges. Social Networks, 44:105–116, 2016.
- Homophily and contagion are generically confounded in observational social network studies. Sociological methods & research, 40(2):211–239, 2011.
- Siena - simulation investigation for empirical network analysis, 2023. URL https://www.stats.ox.ac.uk/~snijders/siena/.
- Dynamic networks and behavior: Separating selection from influence. Sociological methodology, 40(1):329–393, 2010.
- Health benefits of physical activity: the evidence. CMAJ, 174(6):801–809, 2006. ISSN 0820-3946. 10.1503/cmaj.051351. URL https://www.cmaj.ca/content/174/6/801.
- R. Xu. Alternative estimation methods for identifying contagion effects in dynamic social networks: A latent-space adjusted approach. Social Networks, 54:101–117, 2018.