Harmonizing Code-mixed Conversations: Personality-assisted Code-mixed Response Generation in Dialogues (2401.12995v1)
Abstract: Code-mixing, the blending of multiple languages within a single conversation, introduces a distinctive challenge, particularly in the context of response generation. Capturing the intricacies of code-mixing proves to be a formidable task, given the wide-ranging variations influenced by individual speaking styles and cultural backgrounds. In this study, we explore response generation within code-mixed conversations. We introduce a novel approach centered on harnessing the Big Five personality traits acquired in an unsupervised manner from the conversations to bolster the performance of response generation. These inferred personality attributes are seamlessly woven into the fabric of the dialogue context, using a novel fusion mechanism, PA3. It uses an effective two-step attention formulation to fuse the dialogue and personality information. This fusion not only enhances the contextual relevance of generated responses but also elevates the overall performance of the model. Our experimental results, grounded in a dataset comprising of multi-party Hindi-English code-mix conversations, highlight the substantial advantages offered by personality-infused models over their conventional counterparts. This is evident in the increase observed in ROUGE and BLUE scores for the response generation task when the identified personality is seamlessly integrated into the dialogue context. Qualitative assessment for personality identification and response generation aligns well with our quantitative results.
- Towards code-mixed Hinglish dialogue generation. In Proceedings of the 3rd Workshop on Natural Language Processing for Conversational AI, pages 271–280, Online. Association for Computational Linguistics.
- What code-switching strategies are effective in dialog systems? In Proceedings of the Society for Computation in Linguistics 2020, pages 254–264.
- Firoj Alam and Giuseppe Riccardi. 2014. Fusion of acoustic, linguistic and psycholinguistic features for speaker personality traits recognition. In 2014 IEEE international conference on acoustics, speech and signal processing (ICASSP), pages 955–959. IEEE.
- Multi-label emotion classification on code-mixed text: Data and methods. IEEE Access, 10:8779–8789.
- A dataset for building code-mixed goal oriented conversation systems. In Proceedings of the 27th International Conference on Computational Linguistics, pages 3766–3780, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
- Do multilingual users prefer chat-bots that code-mix? let’s nudge and find out! Proc. ACM Hum.-Comput. Interact., 4(CSCW1).
- Multi-modal sarcasm detection and humor classification in code-mixed conversations. IEEE Transactions on Affective Computing.
- Arlin Benjamin Jr. 2020. Type A/B Personalities.
- Myers Isabel Briggs and Peter B. Myers. 1995. Gifts Differing : Understanding Personality Type. Davies-Black Publishing.
- Humor detection in english-urdu code-mixed language. In 2023 3rd International Conference on Artificial Intelligence (ICAI), pages 26–31. IEEE.
- James N. Butcher and Carolyn L. Williams. 2009. Personality assessment with the mmpi-2: Historical roots, international adaptations, and current challenges. Applied Psychology: Health and Well-Being, 1(1):105–135.
- Fabio Celli and Bruno Lepri. 2018. Is big five better than mbti? a personality computing challenge using twitter data. In CLiC-it.
- Listener’s social identity matters in personalised response generation. In Proceedings of the 13th International Conference on Natural Language Generation, pages 205–215, Dublin, Ireland. Association for Computational Linguistics.
- Listener’s social identity matters in personalised response generation. arXiv preprint arXiv:2010.14342.
- A survey on dialogue systems: Recent advances and new frontiers. SIGKDD Explor. Newsl., 19(2):25–35.
- Paul T Costa and Robert R McCrae. 1992. Normal personality assessment in clinical practice: The neo personality inventory. Psychological assessment, 4(1):5.
- Paul T Costa Jr and Robert R McCrae. 2008. The Revised Neo Personality Inventory (neo-pi-r). Sage Publications, Inc.
- J M Digman. 1990. Personality structure: Emergence of the five-factor model. Annual Review of Psychology, 41(1):417–440.
- Wizard of wikipedia: Knowledge-powered conversational agents. arXiv preprint arXiv:1811.01241.
- A survey of natural language generation. ACM Comput. Surv., 55(8).
- Suman Dowlagar and Radhika Mamidi. 2023. A code-mixed task-oriented dialog dataset for medical domain. Computer Speech and Language, 78:101449.
- A survey of response generation of dialogue systems. International Journal of Computer and Information Engineering, 14(12):461–472.
- Multitask learning for multilingual intent detection and slot filling in dialogue systems. Information Fusion, 91:299–315.
- Predicting personality with social media. In CHI’11 extended abstracts on human factors in computing systems, pages 253–262.
- Alexa, let’s work together: Introducing the first alexa prize taskbot challenge on conversational task assistance.
- Axial attention in multidimensional transformers.
- Personalization in goal-oriented dialog. arXiv preprint arXiv:1706.07503.
- Gabriele Kasper and Johannes Wagner. 2014. Conversation analysis in applied linguistics. Annual Review of Applied Linguistics, 34:171–212.
- Humor detection in English-Hindi code-mixed social media content : Corpus and baseline system. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA).
- Private traits and attributes are predictable from digital records of human behavior. Proceedings of the national academy of sciences, 110(15):5802–5805.
- Dialogue agents 101: A beginner’s guide to critical ingredients for designing effective conversational systems.
- When did you become so smart, oh wise one?! sarcasm explanation in multi-modal multi-party dialogues. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5956–5968, Dublin, Ireland. Association for Computational Linguistics.
- Explaining (sarcastic) utterances to enhance affect understanding in multimodal dialogues. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 12986–12994.
- BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 7871–7880, Online. Association for Computational Linguistics.
- A persona-based neural conversation model. arXiv preprint arXiv:1603.06155.
- Chin-Yew Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out, pages 74–81, Barcelona, Spain. Association for Computational Linguistics.
- Multilingual denoising pre-training for neural machine translation. Transactions of the Association for Computational Linguistics, 8:726–742.
- Ro{bert}a: A robustly optimized {bert} pretraining approach.
- Attention-informed mixed-language training for zero-shot cross-lingual task-oriented dialogue systems. Proceedings of the AAAI Conference on Artificial Intelligence, 34(05):8433–8440.
- Managing speaker identity and user profiles in a spoken dialogue system. Procesamiento del lenguaje natural, (43):77–84.
- Detecting offensive speech in conversational code-mixed dialogue on social media: A contextual dataset and benchmark experiments. Expert Systems with Applications, 215:119342.
- Using linguistic cues for the automatic recognition of personality in conversation and text. Journal of artificial intelligence research, 30:457–500.
- Mary L McHugh. 2012. Interrater reliability: the kappa statistic. Biochemia medica, 22(3):276–282.
- Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781.
- Overview of the hasoc subtrack at fire 2021: Hate speech and offensive content identification in english and indo-aryan languages and conversational hate speech. In Proceedings of the 13th Annual Meeting of the Forum for Information Retrieval Evaluation, pages 1–3.
- Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 311–318, Philadelphia, Pennsylvania, USA. Association for Computational Linguistics.
- Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res., 21(1).
- Recipes for building an open-domain chatbot. arXiv preprint arXiv:2004.13637.
- Personality, gender, and age in the language of social media: The open-vocabulary approach. PloS one, 8(9):e73791.
- Get to the point: Summarization with pointer-generator networks. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1073–1083, Vancouver, Canada. Association for Computational Linguistics.
- Knowing what to say: Towards knowledge grounded code-mixed response generation for open-domain conversations. Knowledge-Based Systems, 249:108900.
- Empathic response generation in chatbots. In Proceedings of 4th Swiss Text Analytics Conference (SwissText 2019), 18-19 June 2019, Wintherthur, Switzerland. 18-19 June 2019.
- Naf’an Tarihoran and Iin Ratna Sumirat. 2022. The impact of social media on the use of code mixing by generation z. International Journal of Interactive Mobile Technologies (iJIM), 16(7):54–69.
- Mary WJ Tay. 1989. Code switching and code mixing as a communicative strategy in multilingual discourse. World Englishes, 8(3):407–417.
- William Turnbull. 2003. Language in action: Psychological models of conversation. Routledge.
- Attention is all you need. In Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc.
- Retrieve and refine: Improved sequence generation models for dialogue. arXiv preprint arXiv:1808.04776.
- Context-aware self-attention networks. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01):387–394.
- Personalizing dialogue agents: I have a dog, do you have pets too? In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2204–2213, Melbourne, Australia. Association for Computational Linguistics.
- Personalizing dialogue agents: I have a dog, do you have pets too? arXiv preprint arXiv:1801.07243.
- Bertscore: Evaluating text generation with bert. In International Conference on Learning Representations.
- Shivani Kumar (13 papers)
- Tanmoy Chakraborty (224 papers)