SIG: Speaker Identification in Literature via Prompt-Based Generation (2312.14590v2)
Abstract: Identifying speakers of quotations in narratives is an important task in literary analysis, with challenging scenarios including the out-of-domain inference for unseen speakers, and non-explicit cases where there are no speaker mentions in surrounding context. In this work, we propose a simple and effective approach SIG, a generation-based method that verbalizes the task and quotation input based on designed prompt templates, which also enables easy integration of other auxiliary tasks that further bolster the speaker identification performance. The prediction can either come from direct generation by the model, or be determined by the highest generation probability of each speaker candidate. Based on our approach design, SIG supports out-of-domain evaluation, and achieves open-world classification paradigm that is able to accept any forms of candidate input. We perform both cross-domain evaluation and in-domain evaluation on PDNC, the largest dataset of this task, where empirical results suggest that SIG outperforms previous baselines of complicated designs, as well as the zero-shot ChatGPT, especially excelling at those hard non-explicit scenarios by up to 17% improvement. Additional experiments on another dataset WP further corroborate the efficacy of SIG.
- An Annotated Dataset of Coreference in English Literature. In Calzolari, N.; Béchet, F.; Blache, P.; Choukri, K.; Cieri, C.; Declerck, T.; Goggi, S.; Isahara, H.; Maegaard, B.; Mariani, J.; Mazo, H.; Moreno, A.; Odijk, J.; and Piperidis, S., eds., Proceedings of The 12th Language Resources and Evaluation Conference, LREC 2020, Marseille, France, May 11-16, 2020, 44–54. European Language Resources Association.
- Language Models are Few-Shot Learners. In Larochelle, H.; Ranzato, M.; Hadsell, R.; Balcan, M.; and Lin, H., eds., Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual.
- A Chinese Dataset for Identifying Speakers in Novels. In INTERSPEECH, 1561–1565. Graz, Austria.
- A Neural-Network-Based Approach to Identifying Speakers in Novels. In Hermansky, H.; Cernocký, H.; Burget, L.; Lamel, L.; Scharenborg, O.; and Motlícek, P., eds., Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021, 4114–4118. ISCA.
- A Neural-Network-Based Approach to Identifying Speakers in Novels. In Interspeech, 4114–4118.
- Template-Based Named Entity Recognition Using BART. In Zong, C.; Xia, F.; Li, W.; and Navigli, R., eds., Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, Online Event, August 1-6, 2021, volume ACL/IJCNLP 2021 of Findings of ACL, 1835–1845. Association for Computational Linguistics.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Burstein, J.; Doran, C.; and Solorio, T., eds., Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2-7, 2019, Volume 1 (Long and Short Papers), 4171–4186. Association for Computational Linguistics.
- Automatic Attribution of Quoted Speech in Literary Narrative. In Fox, M.; and Poole, D., eds., Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2010, Atlanta, Georgia, USA, July 11-15, 2010. AAAI Press.
- BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 7871–7880. Online: Association for Computational Linguistics.
- RoBERTa: A Robustly Optimized BERT Pretraining Approach. CoRR, abs/1907.11692.
- Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction. In Zong, C.; Xia, F.; Li, W.; and Navigli, R., eds., Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2795–2806. Online: Association for Computational Linguistics.
- Graph Refinement for Coreference Resolution. In Findings of the Association for Computational Linguistics: ACL 2022, 2732–2742. Dublin, Ireland: Association for Computational Linguistics.
- A Two-stage Sieve Approach for Quote Attribution. In Lapata, M.; Blunsom, P.; and Koller, A., eds., Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017, Valencia, Spain, April 3-7, 2017, Volume 1: Long Papers, 460–470. Association for Computational Linguistics.
- A Chapter-Wise Understanding System for Text-To-Speech in Chinese Novels. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021, 6069–6073. IEEE.
- CoNLL-2012 Shared Task: Modeling Multilingual Unrestricted Coreference in OntoNotes. In Joint Conference on EMNLP and CoNLL - Shared Task, 1–40. Jeju Island, Korea: Association for Computational Linguistics.
- TVShowGuess: Character Comprehension in Stories as Speaker Guessing. In Carpuat, M.; de Marneffe, M.-C.; and Meza Ruiz, I. V., eds., Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 4267–4287. Seattle, United States: Association for Computational Linguistics.
- Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference. CoRR, abs/2001.07676.
- Automatic Conversion of a Chinese Fairy Story into a Script - A Preliminary Report and Proposal. In 2019 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2019, Kaohsiung, Taiwan, November 21-23, 2019, 1–6. IEEE.
- The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary Texts. In Calzolari, N.; Béchet, F.; Blache, P.; Choukri, K.; Cieri, C.; Declerck, T.; Goggi, S.; Isahara, H.; Maegaard, B.; Mariani, J.; Mazo, H.; Odijk, J.; and Piperidis, S., eds., Proceedings of the Thirteenth Language Resources and Evaluation Conference, LREC 2022, Marseille, France, 20-25 June 2022, 5838–5848. European Language Resources Association.
- Improving Automatic Quotation Attribution in Literary Novels. In Rogers, A.; Boyd-Graber, J. L.; and Okazaki, N., eds., Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, 737–746. Association for Computational Linguistics.
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. In Koyejo, S.; Mohamed, S.; Agarwal, A.; Belgrave, D.; Cho, K.; and Oh, A., eds., Advances in Neural Information Processing Systems, volume 35, 24824–24837. Curran Associates, Inc.
- Semisupervised Feature Selection Based on Relevance and Redundancy Criteria. IEEE Transactions on Neural Networks and Learning Systems, 28(9): 1974–1984.
- Revealing the Myth of Higher-Order Inference in Coreference Resolution. In Webber, B.; Cohn, T.; He, Y.; and Liu, Y., eds., Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 8527–8533. Online: Association for Computational Linguistics.
- FanfictionNLP: A Text Processing Pipeline for Fanfiction. In Proceedings of the Third Workshop on Narrative Understanding, 13–23. Virtual: Association for Computational Linguistics.
- End-to-End Chinese Speaker Identification. In Carpuat, M.; de Marneffe, M.; and Ruíz, I. V. M., eds., Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022, Seattle, WA, United States, July 10-15, 2022, 2274–2285. Association for Computational Linguistics.
- End-to-End Chinese Speaker Identification. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2274–2285. Seattle, United States: Association for Computational Linguistics.
- Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind. arXiv:2211.04684.