Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
175 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Twitter User Representation Using Weakly Supervised Graph Embedding (2108.08988v3)

Published 20 Aug 2021 in cs.CL, cs.AI, cs.CY, cs.LG, and cs.SI

Abstract: Social media platforms provide convenient means for users to participate in multiple online activities on various contents and create fast widespread interactions. However, this rapidly growing access has also increased the diverse information, and characterizing user types to understand people's lifestyle decisions shared in social media is challenging. In this paper, we propose a weakly supervised graph embedding based framework for understanding user types. We evaluate the user embedding learned using weak supervision over well-being related tweets from Twitter, focusing on 'Yoga', 'Keto diet'. Experiments on real-world datasets demonstrate that the proposed framework outperforms the baselines for detecting user types. Finally, we illustrate data analysis on different types of users (e.g., practitioner vs. promotional) from our dataset. While we focus on lifestyle-related tweets (i.e., yoga, keto), our method for constructing user representation readily generalizes to other domains.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (60)
  1. Quantifying mental health from social media with neural user embeddings. arXiv preprint arXiv:1705.00335.
  2. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of machine learning research, 7(11).
  3. Building a sentiment summarizer for local service reviews.
  4. Latent dirichlet allocation. the Journal of machine Learning research, 3: 993–1022.
  5. An unsupervised aspect-sentiment model for online reviews. In Human language technologies: The 2010 annual conference of the North American chapter of the association for computational linguistics, 804–812.
  6. Cohen, J. 1960. A coefficient of agreement for nominal scales. Educational and psychological measurement, 20(1): 37–46.
  7. Predicting depression via social media. In Seventh international AAAI conference on weblogs and social media.
  8. You Shall Know a User by the Company It Keeps: Dynamic Representations for Social Media Users in NLP. arXiv preprint arXiv:1909.00412.
  9. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 4171–4186.
  10. Dredze, M. 2012. How social media will change public health. IEEE Intelligent Systems, 27(4): 81–84.
  11. A unified neural network model for geolocating twitter users. In Proceedings of the 22nd Conference on Computational Natural Language Learning, 42–53.
  12. Predicting personality from twitter. In 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, 149–156. IEEE.
  13. Investigating the relationship between the content of online word of mouth, advertising, and brand performance. Marketing Science, 33(2): 241–258.
  14. Goyeche, J. R. 1979. Yoga as therapy in psychosomatic medicine. Psychotherapy and Psychosomatics, 31(1-4): 373–381.
  15. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining, 855–864.
  16. Long short-term memory. Neural computation, 9(8): 1735–1780.
  17. A Hierarchical Location Prediction Neural Network for Twitter User Geolocation. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 4734–4744.
  18. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Proceedings of the International AAAI Conference on Web and Social Media, volume 8.
  19. Islam, T. 2019. Yoga-Veganism: Correlation Mining of Twitter Health Data. arXiv preprint arXiv:1906.07668.
  20. Does yoga make you happy? analyzing twitter user happiness using textual and temporal information. In 2020 IEEE International Conference on Big Data (Big Data), 4241–4249. IEEE.
  21. Analysis of Twitter Users’ Lifestyle Choices using Joint Embedding Model. In Proceedings of the International AAAI Conference on Web and Social Media, volume 15, 242–253.
  22. Do You Do Yoga? Understanding Twitter Users’ Types and Motivations using Social and Textual Information. In 2021 IEEE 15th International Conference on Semantic Computing (ICSC), 362–365. IEEE.
  23. Effects of a high-protein ketogenic diet on hunger, appetite, and weight loss in obese men feeding ad libitum. The American journal of clinical nutrition, 87(1): 44–55.
  24. Community detection in social networks based on improved Label Propagation Algorithm and balanced link density. Physics Letters A, 383(8): 718–727.
  25. Khalsa, S. B. S. 2004. Treatment of chronic insomnia with yoga: A preliminary study with sleep–wake diaries. Applied psychophysiology and biofeedback, 29(4): 269–278.
  26. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
  27. Private traits and attributes are predictable from digital records of human behavior. Proceedings of the national academy of sciences, 110(15): 5802–5805.
  28. Decoupled Weight Decay Regularization. In International Conference on Learning Representations.
  29. Hierarchical modeling for user personality prediction: The role of message-level attention. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 5306–5316.
  30. A novel intervention including individualized nutritional recommendations reduces hemoglobin A1c level, medication use, and weight in type 2 diabetes. JMIR diabetes, 2(1): e5.
  31. Birds of a feather: Homophily in social networks. Annual review of sociology, 27(1): 415–444.
  32. Abusive language detection with graph convolutional networks. arXiv preprint arXiv:1904.04073.
  33. Neural character-based composition models for abuse detection. arXiv preprint arXiv:1809.00378.
  34. Unifying text, metadata, and user network representations with a neural network for geolocation prediction. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1260–1272.
  35. From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of the International AAAI Conference on Web and Social Media, volume 4.
  36. Pantic, I. 2014. Online social networking and mental health. Cyberpsychology, Behavior, and Social Networking, 17(10): 652–657.
  37. Beyond weight loss: a review of the therapeutic uses of very-low-carbohydrate (ketogenic) diets. European journal of clinical nutrition, 67(8): 789–796.
  38. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), 1532–1543.
  39. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, 701–710.
  40. Exploiting Text and Network Context for Geolocation of Social Media Users. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 1362–1367.
  41. Forecasting the onset and course of mental illness with Twitter data. Scientific reports, 7(1): 13006.
  42. Characterizing and detecting hateful users on twitter. arXiv preprint arXiv:1803.08977.
  43. The health benefits of yoga and exercise: a review of comparison studies. The journal of alternative and complementary medicine, 16(1): 3–12.
  44. Bidirectional recurrent neural networks. IEEE transactions on Signal Processing, 45(11): 2673–2681.
  45. Characterizing Geographic Variation in Well-Being Using Tweets. ICWSM, 13: 583–591.
  46. Personality, gender, and age in the language of social media: The open-vocabulary approach. PloS one, 8(9): e73791.
  47. Predicting individual well-being through the language of social media. In Biocomputing 2016: Proceedings of the Pacific Symposium, 516–527. World Scientific.
  48. Document-word co-regularization for semi-supervised sentiment analysis. In 2008 Eighth ieee international conference on data mining, 1025–1030. IEEE.
  49. An evidence-based review of yoga as a complementary intervention for patients with cancer. Psycho-Oncology: Journal of the Psychological, Social and Behavioral Dimensions of Cancer, 18(5): 465–475.
  50. Twitter polarity classification with label propagation over lexical links and the follower graph. In Proceedings of the First workshop on Unsupervised Learning in NLP, 53–63.
  51. Soft-supervised learning for text classification. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 1090–1099.
  52. Weakly-supervised acquisition of labeled class instances using graph random walks. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 582–590.
  53. New regularized algorithms for transductive learning. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, 442–457. Springer.
  54. Line: Large-scale information network embedding. In Proceedings of the 24th international conference on world wide web, 1067–1077.
  55. Visualizing data using t-SNE. Journal of machine learning research, 9(11).
  56. Detecting emotions in social media: A constrained optimization approach. In Twenty-fourth international joint conference on artificial intelligence.
  57. Life satisfaction and the pursuit of happiness on Twitter. PloS one, 11(3): e0150881.
  58. Overcoming language variation in sentiment analysis with social attention. Transactions of the Association for Computational Linguistics, 5: 295–307.
  59. A modified yoga-based exercise program in hemodialysis patients: a randomized controlled study. Complementary therapies in medicine, 15(3): 164–171.
  60. Learning from Labeled and Unlabeled Data with Label Propagation. Technical report.
Citations (8)

Summary

We haven't generated a summary for this paper yet.