Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Can Large Language Models Transform Computational Social Science? (2305.03514v3)

Published 12 Apr 2023 in cs.CL and cs.LG

Abstract: LLMs are capable of successfully performing many language processing tasks zero-shot (without training data). If zero-shot LLMs can also reliably classify and explain social phenomena like persuasiveness and political ideology, then LLMs could augment the Computational Social Science (CSS) pipeline in important ways. This work provides a road map for using LLMs as CSS tools. Towards this end, we contribute a set of prompting best practices and an extensive evaluation pipeline to measure the zero-shot performance of 13 LLMs on 25 representative English CSS benchmarks. On taxonomic labeling tasks (classification), LLMs fail to outperform the best fine-tuned models but still achieve fair levels of agreement with humans. On free-form coding tasks (generation), LLMs produce explanations that often exceed the quality of crowdworkers' gold references. We conclude that the performance of today's LLMs can augment the CSS research pipeline in two ways: (1) serving as zero-shot data annotators on human annotation teams, and (2) bootstrapping challenging creative generation tasks (e.g., explaining the underlying attributes of a text). In summary, LLMs are posed to meaningfully participate in social science analysis in partnership with humans.

Evaluating the Potential of LLMs in Computational Social Science

The paper "Can LLMs Transform Computational Social Science?" by Ziems et al. explores the capacity of LLMs to supplement and enhance computational social science (CSS) research methodologies. The authors assess whether LLMs can reliably classify and elucidate social phenomena, such as persuasiveness and political ideology, without the need for extensive supervised training data. This document provides a comprehensive evaluation framework, addressing the adoption of LLMs in CSS and systematically examining their performance across a curated selection of representative tasks.

Methodological Framework

The authors first delineate the role of CSS, highlighting the potential for LLMs to obviate the constraints of resource-intensive data labeling processes. Subsequently, they devise a thorough evaluation schema, involving an extensive set of prompts and an array of both proprietary and open-source LLMs. Thirteen models are assessed across 25 CSS benchmarks, encapsulating a diverse spectrum of tasks typical to computational social science.

Evaluation Pipeline

The evaluation pipeline focuses on achieving two main outcomes: benchmarking zero-shot LLM performance against fine-tuned models and identifying instances where LLMs might serve as effective augmentative tools within human annotation workflows. Tasks are divided into utterance-level, conversation-level, and document-level analyses, reflective of the stratification inherent in CSS methodologies.

  1. Utterance-Level Tasks: These include detecting dialect features, emotions, figurative language, hate speech, humor, and political ideologies, among others. The tasks are regarded as foundational, allowing for robust classification and coding necessary for later inferential analyses.
  2. Conversation-Level Tasks: Analysis here targets the finer granularity of dialog acts and inter-personal dynamics, including empathy detection in mental health support contexts and power dynamics in social interactions.
  3. Document-Level Tasks: The focus extends towards coding longer narrative constructs within media articles, emphasizing tasks like event argument extraction and ideological stance classification.

Model Performance

The paper yields several insights into the performance paradigms of LLMs:

  • Classification Tasks: LLMs failed to outperform well-tuned classifiers but nonetheless demonstrated significant levels of agreement with human annotations. These findings underscore the potential role of LLMs as supplementary annotators that can hasten the annotation process.
  • Generative Tasks: On tasks requiring text generation, such as explaining social biases or figurative language, some LLMs managed to produce outputs exceeding the quality of human reference in certain contexts. This accentuates their utility in generating explanatory content, which typically demands nuanced detailing and domain understanding.

Implications and Prospective Developments

The paper posits that while LLMs are not ready to completely replace human annotation, they can play an instrumental role in forming the backbone of a blended supervised-unsupervised labeling scheme. This implies a collaborative dynamic where LLM-generated annotations are curated and refined by human experts, potentially leading to substantial savings in annotation effort.

Future trajectories for LLM application in CSS could involve their integration into thematic analysis systems and the continuous refinement of model schemas through exposure to diverse datasets. The authors advocate for careful experimentation with LLMs as an evolving facet of CSS toolkits, driven by the understanding that comprehensive content analysis is paramount for yielding interpretable, socially valuable insights.

Moreover, this paper paves the way for further inquiries into the ethical considerations and constraints inherent in relying on models trained on previously existing data and underlines the exigency for regular updates and ethical model iterations. In sum, the research by Ziems et al. delineates a rich avenue for LLMs to augment the depth and reach of computational social science, pushing the boundaries of current analytical methods while addressing pertinent practical and theoretical implications.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (306)
  1. Abbott, Andrew. 1990. Conceptions of time and events in social science methods: Causal and narrative approaches. Historical Methods: A Journal of Quantitative and Interdisciplinary History, 23(4):140–150.
  2. Persistent anti-muslim bias in large language models. In Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society, pages 298–306.
  3. Abul-Fottouh, Deena and Tina Fetner. 2018. Solidarity or schism: ideological congruence and the twitter networks of egyptian activists. Mobilization: An International Quarterly, 23(1):23–44.
  4. Ahmed, Amr and Eric Xing. 2010. Staying informed: Supervised and semi-supervised multi-view topical analysis of ideological perspective. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pages 1140–1150, Association for Computational Linguistics, Cambridge, MA.
  5. Aspectnews: Aspect-oriented summarization of news documents. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 6494–6506.
  6. How to ask for a favor: A case study on the success of altruistic requests. In Proceedings of the International AAAI Conference on Web and Social Media, volume 8, pages 12–21.
  7. Altikriti, Sahar. 2016. Persuasive speech acts in barack obama’s inaugural speeches (2009, 2013) and the last state of the union address (2016). International Journal of Linguistics, 8(2):47–66.
  8. Identifying media frames and frame dynamics within and across policy issues. In New directions in analyzing text as Data workshop.
  9. Who attains social status? effects of personality and physical attractiveness in social groups. Journal of personality and social psychology, 81(1):116.
  10. Andersson, Lynne M and Christine M Pearson. 1999. Tit for tat? the spiraling effect of incivility in the workplace. Academy of management review, 24(3):452–471.
  11. Out of one, many: Using language models to simulate human samples. Political Analysis, pages 1–15.
  12. Bail, Chris. 2014. Terrified: How anti-Muslim fringe organizations became mainstream. Princeton University Press.
  13. Bail, Christopher A. 2016. Emotional feedback and the viral spread of social media messages about autism spectrum disorders. American journal of public health, 106(7):1173–1180.
  14. Bail, Christopher A. 2023. Can generative ai improve social science?
  15. Bakhtin, Mikhail Mikhaĭlovich. 2010. Speech genres and other late essays. University of Texas press.
  16. We can detect your bias: Predicting the political ideology of news articles. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 4982–4991, Association for Computational Linguistics, Online.
  17. Predicting factuality of reporting and bias of news media sources. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3528–3539, Association for Computational Linguistics, Brussels, Belgium.
  18. Gender identity and lexical variation in social media. Journal of Sociolinguistics, 18(2):135–160.
  19. Learning latent personas of film characters. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 352–361, Association for Computational Linguistics, Sofia, Bulgaria.
  20. Bamman, David and Noah A. Smith. 2015. Open extraction of fine-grained political statements. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 76–85, Association for Computational Linguistics, Lisbon, Portugal.
  21. A multitask, multilingual, multimodal evaluation of chatgpt on reasoning, hallucination, and interactivity. ArXiv preprint, abs/2302.04023.
  22. The critical periphery in the growth of social protests. PloS one, 10(11):e0143611.
  23. Longformer: The long-document transformer. ArXiv preprint, abs/2004.05150.
  24. Belz, Anja and Eric Kow. 2010. Comparing rating scales and preference judgements in language evaluation. In Proceedings of the 6th International Natural Language Generation Conference.
  25. On the dangers of stochastic parrots: Can language models be too big? In Proceedings of the 2021 ACM conference on fairness, accountability, and transparency, pages 610–623.
  26. Berelson, Bernard. 1952. Content analysis in communication research.
  27. Doctor who? framing through names and titles in German. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 4924–4932, European Language Resources Association, Marseille, France.
  28. Bhatia, Sudeep. 2017. Associative judgment and vector space semantics. Psychological review, 124(1):1.
  29. Bicchieri, Cristina. 2005. The grammar of society: The nature and dynamics of social norms. Cambridge University Press.
  30. Emotions, oral arguments, and supreme court decision making. The Journal of Politics, 73(2):572–581.
  31. Language (technology) is power: A critical survey of “bias” in NLP. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 5454–5476, Association for Computational Linguistics, Online.
  32. Twitter mood predicts the stock market. Journal of computational science, 2(1):1–8.
  33. Picking on the same person: Does algorithmic monoculture lead to outcome homogenization? Advances in Neural Information Processing Systems, 35:3663–3678.
  34. On the opportunities and risks of foundation models. ArXiv preprint, abs/2108.07258.
  35. Looking for a handsome carpenter! debiasing GPT-3 job advertisements. In Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP), pages 212–224, Association for Computational Linguistics, Seattle, Washington.
  36. Box-Steffensmeier, Janet M and Bradford S Jones. 2004. Event history modeling: A guide for social scientists. Cambridge University Press.
  37. The narrative arc: Revealing core narrative structures through text analysis. Science advances, 6(32):eaba2196.
  38. Brahman, Faeze and Snigdha Chaturvedi. 2020. Modeling protagonist emotions for emotion-aware storytelling. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 5277–5294, Association for Computational Linguistics, Online.
  39. Extracting social power relationships from natural language. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 773–782, Association for Computational Linguistics, Portland, Oregon, USA.
  40. Bootstrapped text-level named entity recognition for literature. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 344–350, Association for Computational Linguistics, Berlin, Germany.
  41. Brown, Penelope and Stephen C Levinson. 1987. Politeness: Some universals in language usage, volume 4. Cambridge university press.
  42. Language models are few-shot learners. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual.
  43. Bucholtz, Mary and Kira Hall. 2005. Identity and interaction: A sociocultural linguistic approach. Discourse studies, 7(4-5):585–614.
  44. Modeling empathy and distress in reaction to news stories. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 4758–4765, Association for Computational Linguistics, Brussels, Belgium.
  45. Towards an ISO standard for dialogue act annotation. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10), European Language Resources Association (ELRA), Valletta, Malta.
  46. Burke, Moira and Robert Kraut. 2008. Mind your ps and qs: the impact of politeness and rudeness in online communities. In Proceedings of the 2008 ACM conference on Computer supported cooperative work, pages 281–284.
  47. Cdlm: Cross-document language modeling. In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 2648–2662.
  48. Extracting training data from large language models. In 30th USENIX Security Symposium (USENIX Security 21), pages 2633–2650.
  49. Training temporal word embeddings with a compass. In The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, Honolulu, Hawaii, USA, January 27 - February 1, 2019, pages 6326–6334, AAAI Press.
  50. Experimental evidence for tipping points in social convention. Science, 360(6393):1116–1119.
  51. The price of debiasing automatic metrics in natural language evalaution. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 643–653, Association for Computational Linguistics, Melbourne, Australia.
  52. Figurative language in recognizing textual entailment. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 3354–3361, Association for Computational Linguistics, Online.
  53. Flute: Figurative language understanding through textual explanations. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 7139–7159.
  54. Chambers, Nathanael and Dan Jurafsky. 2008. Unsupervised learning of narrative event chains. In Proceedings of ACL-08: HLT, pages 789–797, Association for Computational Linguistics, Columbus, Ohio.
  55. Reading tea leaves: How humans interpret topic models. In Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, Vancouver, British Columbia, Canada, pages 288–296, Curran Associates, Inc.
  56. ConvoKit: A toolkit for the analysis of conversations. In Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 57–60, Association for Computational Linguistics, 1st virtual meeting.
  57. Clean or annotate: How to spend a limited data collection budget. In Proceedings of the Third Workshop on Deep Learning for Low-Resource Natural Language Processing, pages 152–168.
  58. Chen, Peng-Yu and Von-Wun Soo. 2018. Humor recognition using deep learning. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pages 113–117, Association for Computational Linguistics, New Orleans, Louisiana.
  59. Do cascades recur? In Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11 - 15, 2016, pages 671–681, ACM.
  60. Anyone can become a troll: Causes of trolling behavior in online discussions. In Proceedings of the 2017 ACM conference on computer supported cooperative work and social computing, pages 1217–1230.
  61. Antisocial behavior in online discussion communities. In Proceedings of the international aaai conference on web and social media, volume 9, pages 61–70.
  62. Generating long sequences with sparse transformers. ArXiv preprint, abs/1904.10509.
  63. Deep reinforcement learning from human preferences. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA, pages 4299–4307.
  64. Language models trained on media diets can predict public opinion. ArXiv preprint, abs/2303.16779.
  65. Learning personas from dialogue with attentive memory networks. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 2638–2646, Association for Computational Linguistics, Brussels, Belgium.
  66. Scaling instruction-finetuned language models. ArXiv preprint, abs/2210.11416.
  67. Cialdini, Robert B. 2003. Influence. Influence At Work.
  68. Living machines: A study of atypical animacy. In Proceedings of the 28th International Conference on Computational Linguistics, pages 4534–4545, International Committee on Computational Linguistics, Barcelona, Spain (Online).
  69. The longest month: analyzing covid-19 vaccination opinions dynamics from tweets in the month following the first vaccine announcement. Ieee Access, 9:33203–33223.
  70. Designing interactive systems to encourage empathy between users. In Proceedings of the 2014 companion publication on Designing interactive systems. pages 13–16.
  71. Craig, Holly K and Julie A Washington. 2002. Oral language expectations for african american preschoolers and kindergartners.
  72. Echoes of power: language effects and power differences in social interaction. In Proceedings of the 21st World Wide Web Conference 2012, WWW 2012, Lyon, France, April 16-20, 2012, pages 699–708, ACM.
  73. A computational approach to politeness with application to social factors. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 250–259, Association for Computational Linguistics, Sofia, Bulgaria.
  74. Dang, Hoa Trang. 2005. Overview of duc 2005. In Proceedings of the document understanding conference, volume 2005, pages 1–12, Citeseer.
  75. Davies, Catherine E. 2017. Sociolinguistic approaches to humor. In The Routledge handbook of language and humor. Routledge, pages 472–488.
  76. Del Tredici, Marco and Raquel Fernández. 2017. Semantic variation in online communities of practice. In IWCS 2017 - 12th International Conference on Computational Semantics - Long papers.
  77. Creating a novel geolocation corpus from historical texts. In Proceedings of the 10th Linguistic Annotation Workshop held in conjunction with ACL 2016 (LAW-X 2016), pages 188–198, Association for Computational Linguistics, Berlin, Germany.
  78. Analyzing polarization in social media: Method and application to tweets on 21 mass shootings. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 2970–3005, Association for Computational Linguistics, Minneapolis, Minnesota.
  79. Learning to recognize dialect features. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 2315–2338, Association for Computational Linguistics, Online.
  80. Bold: Dataset and metrics for measuring biases in open-ended language generation. In Proceedings of the 2021 ACM conference on fairness, accountability, and transparency, pages 862–872.
  81. DiMaggio, Paul. 2015. Adapting computational text analysis to social science (and vice versa). Big Data & Society, 2(2):2053951715602908.
  82. Exploiting affinities between topic modeling and the sociological perspective on culture: Application to newspaper coverage of us government arts funding. Poetics, 41(6):570–606.
  83. Dobson, James E. 2019. Critical digital humanities: the search for a methodology. University of Illinois Press.
  84. Easley, David and Jon Kleinberg. 2010. Networks, crowds, and markets: Reasoning about a highly connected world. Cambridge university press.
  85. Using large language model annotations for valid downstream statistical inference in social science: Design-based semi-supervised learning. ArXiv preprint, abs/2306.04746.
  86. Eisenstein, Jacob. 2012. Mapping the geographical diffusion of new words. ArXiv preprint, abs/1210.5268.
  87. Diffusion of lexical change in social media. PloS one, 9(11):e113114.
  88. Md3: The multi-dialect dataset of dialogues. ArXiv preprint, abs/2305.11355.
  89. Discovering sociolinguistic associations with structured sparsity. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 1365–1374, Association for Computational Linguistics, Portland, Oregon, USA.
  90. Ellsberg, Daniel. 1961. Risk, ambiguity, and the savage axioms. The quarterly journal of economics, 75(4):643–669.
  91. Latent hatred: A benchmark for understanding implicit hate speech. ArXiv preprint, abs/2109.05322.
  92. Entman, Robert M. 1993. Framing: Toward clarification of a fractured paradigm. Journal of communication, 43(4):51–58.
  93. Evans, James A and Pedro Aceves. 2016. Machine translation: Mining text for social theory. Annual review of sociology, 42:21–50.
  94. Summeval: Re-evaluating summarization evaluation. Transactions of the Association for Computational Linguistics, 9:391–409.
  95. Identification of leaders, lurkers, associates and spammers in a social network: context-dependent and context-independent approaches. Social Network Analysis and Mining, 1(3):241–254.
  96. Framing and agenda-setting in Russian news: a computational analysis of intricate political strategies. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3570–3580, Association for Computational Linguistics, Brussels, Belgium.
  97. Flores, René D. 2017. Do anti-immigrant laws shape public sentiment? a study of arizona’s sb 1070 using twitter data. American Journal of Sociology, 123(2):333–384.
  98. Misinfo reaction frames: Reasoning about readers’ reactions to news headlines. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3108–3127.
  99. Neural metaphor detection in context. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 607–613, Association for Computational Linguistics, Brussels, Belgium.
  100. Garfinkel, Harold. 2016. Studies in ethnomethodology. In Social Theory Re-Wired. Routledge, pages 85–95.
  101. Word embeddings quantify 100 years of gender and ethnic stereotypes. Proceedings of the National Academy of Sciences, 115(16):E3635–E3644.
  102. Personality and political attitudes: Relationships across issue domains and political contexts. American Political Science Review, 104(1):111–133.
  103. Chatgpt outperforms crowd-workers for text-annotation tasks. ArXiv preprint, abs/2303.15056.
  104. Golder, Scott A and Michael W Macy. 2014. Digital footprints: Opportunities and challenges for online social research. Annual Review of Sociology, 40:129–152.
  105. Who is the hero, the villain, and the victim? detection of roles in news articles using natural language techniques. In 23rd International Conference on Intelligent User Interfaces, pages 311–315.
  106. News summarization and evaluation in the era of gpt-3. ArXiv preprint, abs/2209.12356.
  107. Graham, Elizabeth E. 1995. The involvement of sense of humor in the development of social relationships. Communication Reports, 8(2):158–169.
  108. Grimmer, Justin. 2010. A bayesian hierarchical topic model for political texts: Measuring expressed agendas in senate press releases. Political Analysis, 18(1):1–35.
  109. Testing the etch-a-sketch hypothesis: a computational analysis of mitt romney’s ideological makeover during the 2012 primary vs. general elections. In APSA 2013 Annual Meeting Paper, American Political Science Association 2013 Annual Meeting.
  110. Grudin, Jonathan. 2006. Why personas work: The psychological evidence. The persona lifecycle, 12:642–664.
  111. Cultural shift or linguistic drift? comparing two computational measures of semantic change. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pages 2116–2121, Association for Computational Linguistics, Austin, Texas.
  112. Diachronic word embeddings reveal statistical laws of semantic change. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1489–1501, Association for Computational Linguistics, Berlin, Germany.
  113. Rating versus ranking: What is the best way to reduce response and language bias in cross-national research? International business review, 18(4):417–432.
  114. Hashim, Safair Safwat Mohammed and Suhair Safwat. 2015. Speech acts in political speeches. Journal of Modern Education Review, 5(7):699–706.
  115. Measuring massive multitask language understanding. In International Conference on Learning Representations.
  116. Herring, Susan C. 1994. Politeness in computer culture: Why women thank and men flame. In Cultural performances: Proceedings of the third Berkeley women and language conference, pages 278–294.
  117. Personalized persuasion: Tailoring persuasive appeals to recipients’ personality traits. Psychological science, 23(6):578–581.
  118. Training compute-optimal large language models. ArXiv preprint, abs/2203.15556.
  119. Prediction and explanation in social systems. Science, 355(6324):486–488.
  120. The Pragmatics behind Politics: Modelling Metaphor, Framing and Emotion in Political Discourse. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 4479–4488, Association for Computational Linguistics, Online.
  121. Challenges and applications of automated extraction of socio-political events from text (CASE 2021): Workshop and shared task report. In Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021), pages 1–9, Association for Computational Linguistics, Online.
  122. Hutchby, Ian and Robin Wooffitt. 2008. Conversation analysis. Polity.
  123. Ignatow, Gabe and Rada Mihalcea. 2016. Text mining: A guidebook for the social sciences. Sage Publications.
  124. Iyengar, Shanto. 1990. Framing responsibility for political issues: The case of poverty. Political behavior, 12(1):19–40.
  125. Figurative usage detection of symptom words to improve personal health mention detection. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 1142–1147, Association for Computational Linguistics, Florence, Italy.
  126. Political ideology detection using recursive neural networks. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1113–1122, Association for Computational Linguistics, Baltimore, Maryland.
  127. Feuding families and former Friends: Unsupervised learning for dynamic fictional relationships. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1534–1544, Association for Computational Linguistics, San Diego, California.
  128. Jacobs, Arthur M and Annette Kinder. 2018. What makes a metaphor literary? answers from two computational studies. Metaphor and Symbol, 33(2):85–100.
  129. Inducing stereotypical character roles from plot structure. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 492–497.
  130. Detecting latent ideology in expert text: Evidence from academic papers in economics. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 1804–1809, Association for Computational Linguistics, Doha, Qatar.
  131. A hybrid approach for drug abuse events extraction from twitter. Procedia computer science, 96:1032–1040.
  132. SLIDE - a sentiment lexicon of common idioms. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), European Language Resources Association (ELRA), Miyazaki, Japan.
  133. Jockers, Matthew L and David Mimno. 2013. Significant themes in 19th-century literature. Poetics, 41(6):750–769.
  134. Ideological phrase indicators for classification of political discourse framing on Twitter. In Proceedings of the Second Workshop on NLP and Computational Social Science, pages 90–99, Association for Computational Linguistics, Vancouver, Canada.
  135. Jones, Erik and Jacob Steinhardt. 2022. Capturing failures of large language models via human cognitive biases. ArXiv preprint, abs/2202.12299.
  136. Automatic sarcasm detection: A survey. ACM Computing Surveys (CSUR), 50(5):1–22.
  137. Scaling laws for neural language models. ArXiv preprint, abs/2001.08361.
  138. The perils of using Mechanical Turk to evaluate open-ended text generation. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 1265–1285, Association for Computational Linguistics, Online and Punta Cana, Dominican Republic.
  139. Towards modelling language innovation acceptance in online social networks. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, San Francisco, CA, USA, February 22-25, 2016, pages 553–562, ACM.
  140. Metaphor detection in a poetry corpus. In Proceedings of the Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 1–9, Association for Computational Linguistics, Vancouver, Canada.
  141. Analytical sociology and computational social science. Journal of Computational Social Science, 1(1):3–14.
  142. Dynabench: Rethinking benchmarking in NLP. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4110–4124, Association for Computational Linguistics, Online.
  143. Supervisor humor and employee outcomes: The role of social distance and affective trust in supervisor. Journal of Business and Psychology, 31:125–139.
  144. Temporal analysis of language through neural language models. In Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science, pages 61–65, Association for Computational Linguistics, Baltimore, MD, USA.
  145. King, Gary and Will Lowe. 2003. An automated information extraction tool for international conflict data with performance as good as human coders: A rare events evaluation design. International Organization, 57(3):617–642.
  146. Innateness and culture in the evolution of language. Proceedings of the National Academy of Sciences, 104(12):5241–5245.
  147. Human decisions and machine predictions. The quarterly journal of economics, 133(1):237–293.
  148. Kleinberg, Jon and Manish Raghavan. 2021. Algorithmic monoculture and social welfare. Proceedings of the National Academy of Sciences, 118(22):e2018340118.
  149. Koralus, Philipp and Vincent Wang-Maścianica. 2023. Humans in humans out: On gpt converging toward common sense in both success and failure. ArXiv preprint, abs/2303.17276.
  150. Private traits and attributes are predictable from digital records of human behavior. Proceedings of the national academy of sciences, 110(15):5802–5805.
  151. The geometry of culture: Analyzing the meanings of class through word embeddings. American Sociological Review, 84(5):905–949.
  152. Kuhn, Thomas S. 1962. The structure of Scientific Revolutions. The University of Chicago Press.
  153. Kuipers, Giselinde. 2009. Humor styles and symbolic boundaries.
  154. Statistically significant detection of linguistic change. In Proceedings of the 24th International Conference on World Wide Web, WWW 2015, Florence, Italy, May 18-22, 2015, pages 625–635, ACM.
  155. Exploring cyberbullying and other toxic behavior in team competition online games. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, CHI 2015, Seoul, Republic of Korea, April 18-23, 2015, pages 3739–3748, ACM.
  156. Labatut, Vincent and Xavier Bost. 2019. Extraction and analysis of fictional character networks: A survey. ACM Computing Surveys (CSUR), 52(5):1–40.
  157. Multilingual stance detection in social media political debates. Computer Speech & Language, 63:101075.
  158. Event extraction from historical texts: A new dataset for black rebellions. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 2390–2400, Association for Computational Linguistics, Online.
  159. Computational social science. Science, 323(5915):721–723.
  160. The science of fake news. Science, 359(6380):1094–1096.
  161. Computational social science: Obstacles and opportunities. Science, 369(6507):1060–1062.
  162. Improving hate speech type and target detection with hateful metaphor features. In Proceedings of the Fourth Workshop on NLP for Internet Freedom: Censorship, Disinformation, and Propaganda, pages 7–16, Association for Computational Linguistics, Online.
  163. Meme-tracking and the dynamics of the news cycle. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 497–506.
  164. Document-level event argument extraction by conditional generation. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 894–908, Association for Computational Linguistics, Online.
  165. Pre-trained language models for interactive decision-making. Advances in Neural Information Processing Systems, 35:31199–31212.
  166. Holistic evaluation of language models. ArXiv preprint, abs/2211.09110.
  167. How NOT to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pages 2122–2132, Association for Computational Linguistics, Austin, Texas.
  168. Roberta: A robustly optimized bert pretraining approach.
  169. Lucy, Li and David Bamman. 2021. Gender and representation bias in GPT-3 generated stories. In Proceedings of the Third Workshop on Narrative Understanding, pages 48–55, Association for Computational Linguistics, Virtual.
  170. Desmog: Detecting stance in media on global warming. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings, pages 3296–3315.
  171. Detecting stance in media on global warming. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 3296–3315, Association for Computational Linguistics, Online.
  172. A survey on empathetic dialogue systems. Information Fusion, 64:50–70.
  173. Machina, Mark J. 1987. Choice under uncertainty: Problems solved and unsolved. Journal of Economic perspectives, 1(1):121–154.
  174. Roles and success in Wikipedia talk pages: Identifying latent patterns of behavior. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1026–1035, Asian Federation of Natural Language Processing, Taipei, Taiwan.
  175. Markiewicz, Dorothy. 1974. Effects of humor on persuasion. Sociometry, pages 407–422.
  176. Martin, Rod A and Thomas Ford. 2018. The psychology of humor: An integrative approach. Academic press.
  177. A survey on computational propaganda detection. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020, pages 4826–4832, ijcai.org.
  178. Hate begets hate: A temporal study of hate speech. Proceedings of the ACM on Human-Computer Interaction, 4(CSCW2):1–24.
  179. Modeling framing in immigration discourse on social media. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 2219–2263, Association for Computational Linguistics, Online.
  180. Mihalcea, Rada and Vivi Nastase. 2012. Word epoch disambiguation: Finding how words change over time. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 259–263, Association for Computational Linguistics, Jeju Island, Korea.
  181. Mihalcea, Rada and Carlo Strapparava. 2005. Making computers laugh: Investigations in automatic humor recognition. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pages 531–538, Association for Computational Linguistics, Vancouver, British Columbia, Canada.
  182. Miller, Tim. 2019. Explanation in artificial intelligence: Insights from the social sciences. Artificial intelligence, 267:1–38.
  183. Min, Bonan and Xiaoxi Zhao. 2019. Measure country-level socio-economic indicators with streaming news: An empirical study. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 1249–1254, Association for Computational Linguistics, Hong Kong, China.
  184. A modular architecture for unsupervised sarcasm generation. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 6144–6154, Association for Computational Linguistics, Hong Kong, China.
  185. SemEval-2016 task 6: Detecting stance in tweets. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), pages 31–41, Association for Computational Linguistics, San Diego, California.
  186. Murphy, Kevin M and Andrei Shleifer. 2004. Persuasion in politics. American Economic Review, 94(2):435–439.
  187. Murray, Michael et al. 2015. Narrative psychology. Qualitative psychology: A practical guide to research methods, pages 85–107.
  188. Beyond western, educated, industrial, rich, and democratic (weird) psychology: Measuring and mapping scales of cultural and psychological distance. Psychological science, 31(6):678–701.
  189. Nelson, Laura K. 2015. Political logics as cultural memory: cognitive structures, local continuities, and women’s organizations in chicago and new york city. Work. Pap. Kellogg School Manag., Northwestern Univ.
  190. Nelson, Laura K. 2021. Cycles of conflict, a century of continuity: the impact of persistent place-based political logics on social movement strategy. American Journal of Sociology, 127(1):1–59.
  191. Computational sociolinguistics: A survey. Computational linguistics, 42(3):537–593.
  192. Nguyen, Thien Hai and Kiyoaki Shirai. 2015. Topic modeling based sentiment analysis on social media for stock market prediction. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1354–1364, Association for Computational Linguistics, Beijing, China.
  193. Improving the generalizability of depression detection by leveraging clinical questionnaires. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 8446–8459.
  194. Niculae, Vlad and Cristian Danescu-Niculescu-Mizil. 2014. Brighter than gold: Figurative language in user generated comparisons. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 2008–2018, Association for Computational Linguistics, Doha, Qatar.
  195. Why we need new evaluation metrics for NLG. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2241–2252, Association for Computational Linguistics, Copenhagen, Denmark.
  196. Ofer, Dan and Dafna Shahaf. 2022. Cards against AI: Predicting humor in a fill-in-the-blank party game. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 5397–5403, Association for Computational Linguistics, Abu Dhabi, United Arab Emirates.
  197. Empathic conversations: A multi-level dataset of contextualized conversations. ArXiv preprint, abs/2205.12698.
  198. OpenAI. 2023. Gpt-4 technical report.
  199. The emotional side of software developers in jira. In Proceedings of the 13th international conference on mining software repositories, pages 480–483.
  200. Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35:27730–27744.
  201. Paltridge, Brian and Jill Burton. 2000. Making sense of discourse analysis. Antipodean Educational Enterprises Gold Coast, Queensland.
  202. Generative agents: Interactive simulacra of human behavior.
  203. Social simulacra: Creating populated prototypes for social computing systems. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, pages 1–18.
  204. Pennebaker, James W and Laura A King. 1999. Linguistic styles: language use as an individual difference. Journal of personality and social psychology, 77(6):1296.
  205. True few-shot learning with language models. Advances in neural information processing systems, 34:11054–11070.
  206. Pfeil, Ulrike and Panayiotis Zaphiris. 2007. Patterns of empathy in online communication. In Proceedings of the SIGCHI conference on Human factors in computing systems, pages 919–928.
  207. Picard, Rosalind W. 2000. Affective computing. MIT press.
  208. Pilehvar, Mohammad Taher and Jose Camacho-Collados. 2019. WiC: the word-in-context dataset for evaluating context-sensitive meaning representations. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 1267–1273, Association for Computational Linguistics, Minneapolis, Minnesota.
  209. Narrative theory for computational narrative understanding. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 298–311.
  210. Plutchik, Robert. 1980. A general psychoevolutionary theory of emotion. In Theories of emotion. Elsevier, pages 3–33.
  211. Post, Matt. 2018. A call for clarity in reporting BLEU scores. In Proceedings of the Third Conference on Machine Translation: Research Papers, pages 186–191, Association for Computational Linguistics, Brussels, Belgium.
  212. Predicting overt display of power in written dialogs. In Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 518–522, Association for Computational Linguistics, Montréal, Canada.
  213. Gender and power: How gender and gender environment affect manifestations of power. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 1965–1976, Association for Computational Linguistics, Doha, Qatar.
  214. Preece, Jenny. 1998. Empathic communities: Reaching out across the web. interactions, 5(2):32–43.
  215. An analysis of the user occupational class through Twitter content. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1754–1764, Association for Computational Linguistics, Beijing, China.
  216. Beyond binary labels: Political ideology prediction of Twitter users. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 729–740, Association for Computational Linguistics, Vancouver, Canada.
  217. Purschke, Christoph and Dirk Hovy. 2019. Lörres, möppes, and the swiss.(re) discovering regional patterns in anonymous social media data. Journal of Linguistic Geography, 7(2):113–134.
  218. Is chatgpt a general-purpose natural language processing task solver? ArXiv preprint, abs/2302.06476.
  219. Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 21(1):5485–5551.
  220. Ai and the everything in the whole wide world benchmark. In Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks, volume 1, Curran.
  221. Snorkel: Rapid training data creation with weak supervision. In Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases, volume 11, page 269, NIH Public Access.
  222. Ribeiro, Marco Tulio and Scott Lundberg. 2022. Adaptive testing and debugging of nlp models. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3253–3267.
  223. Calls to action on social media: Potential for censorship and social impact. EMNLP-IJCNLP 2019, page 36.
  224. Cognitive-behavioral therapy.
  225. Roy, Shamik and Dan Goldwasser. 2020. Weakly supervised learning of nuanced frames for analyzing polarization in news media. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7698–7716, Association for Computational Linguistics, Online.
  226. Rudolph, Maja R. and David M. Blei. 2018. Dynamic embeddings for language evolution. In Proceedings of the 2018 World Wide Web Conference on World Wide Web, WWW 2018, Lyon, France, April 23-27, 2018, pages 1003–1011, ACM.
  227. Where new words are born: Distributional semantic analysis of neologisms and their semantic neighborhoods. In Proceedings of the Society for Computation in Linguistics 2020, pages 367–376, Association for Computational Linguistics, New York, New York.
  228. Sacks, Harvey. 1992. Lectures on conversation: Volume i. Malden, Massachusetts: Blackwell.
  229. Saldias, Belen and Deb Roy. 2020. Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types. In Proceedings of the First Joint Workshop on Narrative Understanding, Storylines, and Events, pages 78–86, Association for Computational Linguistics, Online.
  230. Experimental study of inequality and unpredictability in an artificial cultural market. science, 311(5762):854–856.
  231. Multitask prompted training enables zero-shot task generalization. In International Conference on Learning Representations.
  232. Santhanam, Sashank and Samira Shaikh. 2019. Towards best experiment design for evaluating dialogue system output. In Proceedings of the 12th International Conference on Natural Language Generation, pages 88–94, Association for Computational Linguistics, Tokyo, Japan.
  233. Whose opinions do language models reflect?
  234. Whose opinions do language models reflect? ArXiv preprint, abs/2303.17548.
  235. The risk of racial bias in hate speech detection. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 1668–1678, Association for Computational Linguistics, Florence, Italy.
  236. Social bias frames: Reasoning about social and power implications of language. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 5477–5490, Association for Computational Linguistics, Online.
  237. Recollection versus imagination: Exploring human memory and cognition via neural language models. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 1970–1978, Association for Computational Linguistics, Online.
  238. Quantifying the narrative flow of imagined versus autobiographical stories. Proceedings of the National Academy of Sciences, 119(45):e2211715119.
  239. Connotation frames of power and agency in modern films. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2329–2334, Association for Computational Linguistics, Copenhagen, Denmark.
  240. CARER: Contextualized affect representations for emotion recognition. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3687–3697, Association for Computational Linguistics, Brussels, Belgium.
  241. Schelling, Thomas C. 1971. Dynamic models of segregation. Journal of mathematical sociology, 1(2):143–186.
  242. SemEval-2020 task 1: Unsupervised lexical semantic change detection. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1–23, International Committee for Computational Linguistics, Barcelona (online).
  243. Civil unrest on Twitter (CUT): A dataset of tweets to support research on civil unrest. In Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020), pages 215–221, Association for Computational Linguistics, Online.
  244. BLEURT: Learning robust metrics for text generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 7881–7892, Association for Computational Linguistics, Online.
  245. Modeling motivational interviewing strategies on an online peer-to-peer counseling platform. Proceedings of the ACM on Human-Computer Interaction, 6(CSCW2):1–24.
  246. Examining the ordering of rhetorical strategies in persuasive requests. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 1299–1306, Association for Computational Linguistics, Online.
  247. On second thought, let’s not think step by step! bias and toxicity in zero-shot reasoning. ArXiv preprint, abs/2212.08061.
  248. Towards facilitating empathic conversations in online mental health support: A reinforcement learning approach. In Proceedings of the Web Conference 2021, pages 194–205.
  249. A computational approach to understanding empathy expressed in text-based mental health support. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 5263–5276, Association for Computational Linguistics, Online.
  250. Hierarchical Chinese legal event extraction via pedal attention mechanism. In Proceedings of the 28th International Conference on Computational Linguistics, pages 100–113, International Committee on Computational Linguistics, Barcelona, Spain (Online).
  251. Societal biases in language generation: Progress and challenges. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 4275–4293, Association for Computational Linguistics, Online.
  252. Sherman, Lawrence W. 1988. Humor and social distance in elementary school children.
  253. Shmueli, Galit et al. 2010. To explain or to predict? Statistical science, 25(3):289–310.
  254. Shubik, Martin. 1982. Game theory in the social sciences: concepts and solutions.
  255. Tweet stance detection using multi-kernel convolution and attentive lstm variants. IEICE TRANSACTIONS on Information and Systems, 102(12):2493–2503.
  256. Silverman, David. 1998. Harvey Sacks: Social science and conversation analysis. Oxford University Press on Demand.
  257. Literary event detection. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3623–3634, Association for Computational Linguistics, Florence, Italy.
  258. Cheap and fast – but is it good? evaluating non-expert annotations for natural language tasks. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pages 254–263, Association for Computational Linguistics, Honolulu, Hawaii.
  259. Solaiman, Irene and Christy Dennison. 2021. Process for adapting language models to society (palms) with values-targeted datasets. Advances in Neural Information Processing Systems, 34:5861–5873.
  260. Sprugnoli, Rachele and Sara Tonelli. 2019. Novel event detection and classification for historical texts. Computational Linguistics, 45(2):229–265.
  261. Beyond the imitation game: Quantifying and extrapolating the capabilities of language models. Transactions on Machine Learning Research.
  262. Inferring interpersonal relations in narrative summaries. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 12-17, 2016, Phoenix, Arizona, USA, pages 2807–2813, AAAI Press.
  263. Predicting the topical stance and political leaning of media using tweets. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 527–537, Association for Computational Linguistics, Online.
  264. Persuasion and social movements. Waveland Press.
  265. Dialogue act modeling for automatic tagging and recognition of conversational speech. Computational Linguistics, 26(3):339–374.
  266. The general inquirer: A computer approach to content analysis.
  267. IMPLI: Investigating NLI models’ performance on figurative language. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5375–5388, Association for Computational Linguistics, Dublin, Ireland.
  268. Suler, John. 2005. Contemporary media forum: The online disinhibition effect. International Journal of Applied Psychoanalytic Studies.
  269. The effect of wording on message propagation: Topic- and author-controlled natural experiments on Twitter. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 175–185, Association for Computational Linguistics, Baltimore, Maryland.
  270. Winning arguments: Interaction dynamics and persuasion strategies in good-faith online discussions. In Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11 - 15, 2016, pages 613–624, ACM.
  271. Accountability and empathy by design: Encouraging bystander intervention to cyberbullying on social media. Proceedings of the ACM on Human-Computer Interaction, 3(CSCW):1–26.
  272. Trope, Yaacov and Nira Liberman. 2010. Construal-level theory of psychological distance. Psychological review, 117(2):440.
  273. Tufekci, Zeynep and Christopher Wilson. 2012. Social media and the decision to participate in political protest: Observations from tahrir square. Journal of communication, 62(2):363–379.
  274. Mr. bennet, his coachman, and the archbishop walk into a bar but only one of them gets recognized: On the difficulty of detecting characters in literary texts. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 769–774, Association for Computational Linguistics, Lisbon, Portugal.
  275. The spread of true and false news online. science, 359(6380):1146–1151.
  276. Evaluation methods for topic models. In Proceedings of the 26th Annual International Conference on Machine Learning, ICML 2009, Montreal, Quebec, Canada, June 14-18, 2009, volume 382 of ACM International Conference Proceeding Series, pages 1105–1112, ACM.
  277. Super-naturalinstructions: Generalization via declarative instructions on 1600+ nlp tasks. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 5085–5109.
  278. Wardhaugh, Ronald and Janet M Fuller. 2021. An introduction to sociolinguistics. John Wiley & Sons.
  279. Weller, Orion and Kevin Seppi. 2019. Humor detection: A transformer gets the last laugh. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3621–3625, Association for Computational Linguistics, Hong Kong, China.
  280. Finding social roles in wikipedia. In Proceedings of the 2011 iConference, iConference ’11, pages 122–129, ACM, New York, NY, USA.
  281. Reframing human-AI collaboration for generating free-text explanations. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 632–658, Association for Computational Linguistics, Seattle, United States.
  282. Wu, Minghao and Alham Fikri Aji. 2023. Style over substance: Evaluation biases for large language models. ArXiv preprint, abs/2307.03025.
  283. Xiang, Wei and Bang Wang. 2019. A survey of event extraction from text. IEEE Access, 7:173111–173137.
  284. Let’s make your request more persuasive: Modeling persuasive strategies via semi-supervised neural nets on crowdfunding platforms. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 3620–3630, Association for Computational Linguistics, Minneapolis, Minnesota.
  285. Seekers, providers, welcomers, and storytellers: Modeling social roles in online health communities. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, CHI 2019, Glasgow, Scotland, UK, May 04-09, 2019, page 344, ACM.
  286. Weakly supervised role identification in teamwork interactions. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1671–1680, Association for Computational Linguistics, Beijing, China.
  287. Adapt-and-distill: Developing small, fast and effective pretrained language models for domains. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 460–470, Association for Computational Linguistics, Online.
  288. Yarkoni, Tal and Jacob Westfall. 2017. Choosing prediction over explanation in psychology: Lessons from machine learning. Perspectives on Psychological Science, 12(6):1100–1122.
  289. Score: Pre-training for context representation in conversational semantic parsing. In International Conference on Learning Representations.
  290. Cold-start active learning through self-supervised language modeling. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7935–7948, Association for Computational Linguistics, Online.
  291. Zarrella, Guido and Amy Marsh. 2016. MITRE at SemEval-2016 task 6: Transfer learning for stance detection. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), pages 458–463, Association for Computational Linguistics, San Diego, California.
  292. Why do you feel this way? summarizing triggers of emotions in social media posts. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 9436–9453.
  293. Characterizing online discussion using coarse discourse sequences. In Proceedings of the International AAAI Conference on Web and Social Media, volume 11, pages 357–366.
  294. Expert finding in a social network. In Advances in Databases: Concepts, Systems and Applications. Springer, pages 1066–1069.
  295. Conversations gone awry: Detecting early signs of conversational failure. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1350–1361, Association for Computational Linguistics, Melbourne, Australia.
  296. Bertscore: Evaluating text generation with BERT. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020, OpenReview.net.
  297. Gender bias in coreference resolution: Evaluation and debiasing methods. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pages 15–20, Association for Computational Linguistics, New Orleans, Louisiana.
  298. Calibrate before use: Improving few-shot performance of language models. In International Conference on Machine Learning, pages 12697–12706, PMLR.
  299. Zhou, Naitian and David Jurgens. 2020. Condolence and empathy in online communities. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 609–626, Association for Computational Linguistics, Online.
  300. Zhu, Jian and David Jurgens. 2021a. Idiosyncratic but not arbitrary: Learning idiolects in online registers reveals distinctive yet consistent individual styles. ArXiv preprint, abs/2109.03158.
  301. Zhu, Jian and David Jurgens. 2021b. The structure of online social networks modulates the rate of lexical change. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 2201–2218, Association for Computational Linguistics, Online.
  302. Exploring ai ethics of chatgpt: A diagnostic analysis. ArXiv preprint, abs/2301.12867.
  303. Multi-VALUE: A framework for cross-dialectal English NLP. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 744–768, Association for Computational Linguistics, Toronto, Canada.
  304. Inducing positive perspectives with text reframing. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3682–3700.
  305. Ziems, Caleb and Diyi Yang. 2021. To protect and to serve? analyzing entity-centric framing of police violence. In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 957–976.
  306. Zong, Mingyu and Bhaskar Krishnamachari. 2022. a survey on gpt-3.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Caleb Ziems (22 papers)
  2. William Held (17 papers)
  3. Omar Shaikh (23 papers)
  4. Jiaao Chen (31 papers)
  5. Zhehao Zhang (18 papers)
  6. Diyi Yang (151 papers)
Citations (188)
Youtube Logo Streamline Icon: https://streamlinehq.com