ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions (2407.02472v2)
Abstract: This study introduces ValueScope, a framework leveraging LLMs to quantify social norms and values within online communities, grounded in social science perspectives on normative structures. We employ ValueScope to dissect and analyze linguistic and stylistic expressions across 13 Reddit communities categorized under gender, politics, science, and finance. Our analysis provides a quantitative foundation showing that even closely related communities exhibit remarkably diverse norms. This diversity supports existing theories and adds a new dimension--community preference--to understanding community interactions. ValueScope not only delineates differing social norms among communities but also effectively traces their evolution and the influence of significant external events like the U.S. presidential elections and the emergence of new sub-communities. The framework thus highlights the pivotal role of social norms in shaping online interactions, presenting a substantial advance in both the theory and application of social norm studies in digital spaces.
- SemEval-2016 task 1: Semantic textual similarity, monolingual and cross-lingual evaluation. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), pages 497–511, San Diego, California. Association for Computational Linguistics.
- Stela: a community-centred approach to norm elicitation for ai alignment. Scientific Reports, 14(1):6616.
- Cristina Bicchieri. 2005. The grammar of society: The nature and dynamics of social norms. Cambridge University Press.
- Social Norms. In Edward N. Zalta and Uri Nodelman, editors, The Stanford Encyclopedia of Philosophy, Winter 2023 edition. Metaphysics Research Lab, Stanford University.
- Evaluating the evaluation metrics for style transfer: A case study in multilingual formality transfer. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 1321–1336, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
- Andrew D Brown. 2022. Identities in and around organizations: Towards an identity work perspective. Human relations, 75(7):1205–1237.
- Penelope Brown and Stephen C Levinson. 1987. Politeness: Some universals in language usage, volume 4. Cambridge university press.
- Language models are few-shot learners.
- Norms matter: Contrasting social support around behavior change in online weight loss communities. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, pages 1–14.
- The internet’s hidden rules: An empirical study of reddit norm violations at micro, meso, and macro scales. Proc. ACM Hum.-Comput. Interact., 2(CSCW).
- Walter Coutu. 1951. Role-playing vs. role-taking: An appeal for clarification. American sociological review, 16(2):180–187.
- "they are uncultured": Unveiling covert harms and social threats in llm generated conversations.
- No country for old members: user lifecycle and linguistic change in online communities. Proceedings of the 22nd international conference on World Wide Web.
- Do people prefer leaders who enforce norms? reputational effects of reward and punishment decisions in noisy social dilemmas. Journal of Experimental Social Psychology, 84:103800.
- Marco Del Tredici and Raquel Fernández. 2017. Semantic variation in online communities of practice. In Proceedings of the 12th International Conference on Computational Semantics (IWCS) — Long papers.
- Marco Del Tredici and Raquel Fernández. 2018. The road to success: Assessing the fate of linguistic innovations in online communities. In Proceedings of the 27th International Conference on Computational Linguistics, pages 1591–1603, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
- Detecting text formality: A study of text classification approaches. arXiv preprint arXiv:2204.08975.
- Larry Dignan. 2024. Reddit’s data licensing play: Do you want your llm trained on reddit data?
- Penelope Eckert. 1989. Jocks and burnouts: Social categories and identity in the high school. Teachers College Press.
- Penelope Eckert and Sally Mcconnell-Ginet. 1999. New generalizations and explanations in language and gender research. Language in Society, 28:185 – 201.
- Penelope Eckert and Sally McConnell-Ginet. 2013a. Language and Gender, 2 edition. Cambridge University Press.
- Penelope Eckert and Sally McConnell-Ginet. 2013b. Language and gender. Cambridge University Press.
- Reddit rules! characterizing an ecosystem of governance. In Proceedings of the International AAAI Conference on Web and Social Media, volume 12.
- Inverse constitutional ai: Compressing preferences into principles.
- Dialogue response rankingtraining with large-scale human feedback data. In EMNLP.
- Carroll J Glynn and Michael E Huge. 2007. Opinions as norms: Applying a return potential model to the study of communication behaviors. Communication Research, 34(5):548–568.
- Erving Goffman. 1955. On face-work: An analysis of ritual elements in social interaction. Psychiatry, 18(3):213–231.
- Counterfactual probing for the influence of affect and specificity on intergroup bias. arXiv preprint arXiv:2305.16409.
- Not what you’ve signed up for: Compromising real-world llm-integrated applications with indirect prompt injection. In Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security, pages 79–90.
- Deberta: Decoding-enhanced bert with disentangled attention. In International Conference on Learning Representations.
- Libby Hemphill and Jahna Otterbacher. 2012. Learning the lingo? gender, prestige and linguistic adaptation in review communities. In Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work, CSCW ’12, page 305–314, New York, NY, USA. Association for Computing Machinery.
- A return potential measure of setting norms for aggression. American Journal of Community Psychology, 33(3-4):131–149.
- John Herrman. 2021. Everything’s a joke until it’s not.
- Science, askscience, and badscience: On the coexistence of highly related communities. In International Conference on Web and Social Media.
- Jay Jackson. 1966. A conceptual and measurement model for norms and roles. The Pacific Sociological Review, 9(1):35–47.
- Jay Jackson. 1975. Normative power and conflict potential. Sociological Methods & Research, 4(2):237–263.
- Automatic sarcasm detection: A survey. ACM Comput. Surv., 50(5).
- Anna Kasunic and Geoff Kaufman. 2018. " at least the pizzas you make are hot": Norms, values, and abrasive humor on the subreddit r/roastme. In Proceedings of the International AAAI Conference on Web and Social Media, volume 12.
- Towards modelling language innovation acceptance in online social networks. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, WSDM ’16, page 553–562, New York, NY, USA. Association for Computing Machinery.
- Amy Jo Kim. 2006. Community building on the web: Secret strategies for successful online communities. Peachpit press.
- Sanford Labovitz and Robert Hagedorn. 1973. Measuring social norms. Pacific Sociological Review, 16(3):283–303.
- Robin Lakoff. 1973. The logic of politeness: Or, minding your p’s and q’s. In Proceedings from the Annual Meeting of the Chicago Linguistic Society, volume 9, pages 292–305. Chicago Linguistic Society.
- Crowdsourcing civility: A natural experiment examining the effects of distributed moderation in online forums. Gov. Inf. Q., 31:317–326.
- J Richard Landis and Gary G Koch. 1977. The measurement of observer agreement for categorical data. biometrics, pages 159–174.
- Justin Lee and Sowmya Vajjala. 2022. A neural pairwise ranking model for readability assessment. arXiv preprint arXiv:2203.07450.
- Norms and their relationship to behavior in worksite settings: an application of the jackson return potential model. American journal of health behavior, 29(3):258–268.
- Characterizing English variation across social media communities with BERT. Transactions of the Association for Computational Linguistics, 9:538–556.
- Charles G McClintock. 1978. Social values: Their definition, measurement and development. Journal of Research & Development in Education.
- Rachel I McDonald and Christian S Crandall. 2015. Social norms and social influence. Current Opinion in Behavioral Sciences, 3:147–151.
- Evaluating style transfer for text. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 495–504, Minneapolis, Minnesota. Association for Computational Linguistics.
- Shyamal Mishra and Preetha Chatterjee. 2023. Exploring chatgpt for toxicity detection in github. arXiv preprint arXiv:2312.13105.
- Laetitia B Mulder. 2008. The difference between punishments and rewards in fostering moral concerns in social decision making. Journal of Experimental Social Psychology, 44(6):1436–1443.
- Yair Neuman and Yochai Cohen. 2023. Ai for identifying social norm violation. Scientific Reports, 13(1):8103.
- Jessica M Nolan. 2015. Using jackson’s return potential model to explore the normativeness of recycling. Environment and Behavior, 47(8):835–855.
- OpenAI. 2024a. Best practices for prompt engineering with the openai api. https://help.openai.com/en/articles/6654000-best-practices-for-prompt-engineering-with-the-openai-api. Accessed:2024-01-11.
- OpenAI. 2024b. Prompt engineering. https://platform.openai.com/docs/guides/prompt-engineering. Accessed:2024-01-11.
- OpenAI. 2024. Text generation models. https://platform.openai.com/docs/guides/text-generation.
- Gpt-4 technical report.
- Detecting community sensitive norm violations in online conversations. In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 3386–3397, Punta Cana, Dominican Republic. Association for Computational Linguistics.
- Quick, community-specific learning: How distinctive toxicity norms are maintained in political subreddits. In International Conference on Web and Social Media.
- What makes it ok to set a fire? iterative self-distillation of contexts and rationales for disambiguating defeasible social and moral situations. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 12140–12159, Singapore. Association for Computational Linguistics.
- Qinlan Shen and Carolyn P Rosé. 2022. A tale of two subreddits: Measuring the impacts of quarantines on political engagement on reddit. In Proceedings of the International AAAI Conference on Web and Social Media, volume 16, pages 932–943.
- Large language models encode clinical knowledge. Nature, 620(7972):172–180.
- Measuring misogyny in natural language generation: Preliminary results from a case study on two reddit communities. arXiv preprint arXiv:2312.03330.
- Value kaleidoscope: Engaging ai with pluralistic human values, rights, and duties. In AAAI Conference on Artificial Intelligence.
- Claudio Vaz Torres. 1999. Leadership style norms among americans and brazilians: assessing differences using jackson’s return potential model. California School of Professional Psychology-San Diego.
- Llama 2: Open foundation and fine-tuned chat models.
- UNICEF. 2021. Defining social norms and related concepts.
- Kenneth Wallen and Gerard Kyle. 2018. Extending the return potential model with a descriptive normative belief measure. Society & Natural Resources, 31:1–7.
- Etienne Wenger-Trayner and Beverly Wenger-Trayner. 2015. Introduction to communities of practice: A brief overview of the concept and its uses.
- Ex machina: Personal attacks seen at scale. In Proceedings of the 26th International Conference on World Wide Web, WWW ’17, page 1391–1399, Republic and Canton of Geneva, CHE. International World Wide Web Conferences Steering Committee.
- Understanding the diverging user trajectories in highly-related online communities during the covid-19 pandemic.
- Bertscore: Evaluating text generation with bert. In International Conference on Learning Representations.
- How we learn social norms: a three-stage model for social norm learning. Frontiers in Psychology, 14:1153809.
- Dialogpt: Large-scale generative pre-training for conversational response generation.
- Judging llm-as-a-judge with mt-bench and chatbot arena.