EventNet-ITA: Italian Frame Parsing for Events (2305.10892v2)
Abstract: This paper introduces EventNet-ITA, a large, multi-domain corpus annotated full-text with event frames for Italian. Moreover, we present and thoroughly evaluate an efficient multi-label sequence labeling approach for Frame Parsing. Covering a wide range of individual, social and historical phenomena, with more than 53,000 annotated sentences and over 200 modeled frames, EventNet-ITA constitutes the first systematic attempt to provide the Italian language with a publicly available resource for Frame Parsing of events, useful for a broad spectrum of research and application tasks. Our approach achieves a promising 0.9 strict F1-score for frame classification and 0.72 for frame element classification, on top of minimizing computational requirements. The annotated corpus and the frame parsing model are released under open license.
- David Ahn. 2006. The stages of event extraction. In Proceedings of the Workshop on Annotating and Reasoning about Time and Events, pages 1–8.
- The berkeley framenet project. In Proceedings of the 17th international conference on Computational linguistics-Volume 1, pages 86–90. Association for Computational Linguistics.
- Developing a large scale framenet for italian: the iframenet experience. CLiC-it 2017 11-12 December 2017, Rome, page 59.
- Evalita 2011: The frame labelingover italian texts task. In International Workshop on Evaluation of Natural Language and Speech Tool for Italian, pages 195–204. Springer.
- Automatic induction of framenet lexical units in italian. In CEUR WORKSHOP PROCEEDINGS, volume 2769. CEUR-WS.
- Tommaso Caselli. 2018. Italian event detection goes deep learning. arXiv preprint arXiv:1810.02229.
- Annotating events, temporal expressions and relations in italian: the it-timeml experience for the ita-timebank. In Proceedings of the 5th Linguistic Annotation Workshop, pages 143–151. Association for Computational Linguistics.
- Eventi: Evaluation of events and temporal information at evalita 2014. EVENTI: EValuation of Events and Temporal INformation at Evalita 2014, pages 27–34.
- Event extraction via dynamic multi-pooling convolutional neural networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 167–176.
- Jacob Cohen. 1960. A coefficient of agreement for nominal scales. Educational and psychological measurement, 20(1):37–46.
- Linguistic Data Consortium et al. 2005. Ace (automatic content extraction) english annotation guidelines for events. version 5.4. 3. ACE.
- Agata Cybulska and Piek Vossen. 2011. Historical event extraction from text. In Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, pages 39–43.
- Frame-semantic parsing. Computational linguistics, 40(1):9–56.
- A global database of historic and real-time flood events based on social media. Scientific data, 6(1):311.
- BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota. Association for Computational Linguistics.
- The automatic content extraction (ace) program-tasks, data, and evaluation. In Lrec, volume 2, page 1. Lisbon.
- Overview of linguistic resources for the tac kbp 2015 evaluations: Methodologies and results. In Tac.
- Charles J Fillmore and Collin F Baker. 2001. Frame semantics for text understanding. In Proceedings of WordNet and Other Lexical Resources Workshop, NAACL, volume 6.
- Charles J Fillmore et al. 1976. Frame semantics and the nature of language. In Annals of the New York Academy of Sciences: Conference on the origin and development of language and speech, volume 280, pages 20–32. New York.
- Biomedical event extraction with hierarchical knowledge graphs. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 1277–1285.
- T-pas: A resource of corpus-derived types predicate-argument structures for linguistic analysis and semantic processing. In Proceedings of LREC, pages 890–895.
- Overview of tac-kbp2016 tri-lingual edl and its impact on end-to-end cold-start kbp. Proceedings of TAC.
- Historical thesaurus of the Oxford English dictionary. Oxford University Press.
- Event extraction from historical texts: A new dataset for black rebellions. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 2390–2400, Online. Association for Computational Linguistics.
- Building an italian framenet through semi-automatic corpus analysis. In LREC.
- Lexit: A computational resource on italian argument structure. In LREC, pages 3712–3718.
- Enriching the isst-tanl corpus with semantic frames. In LREC, pages 3719–3726.
- Biomedical event extraction based on knowledge-driven tree-lstm. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 1421–1430.
- Shasha Liao and Ralph Grishman. 2010. Using document level cross-event inference to improve event extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 789–797. Association for Computational Linguistics.
- Text2event: Controllable sequence-to-structure generation for end-to-end event extraction. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 2795–2806.
- Scent mining: Extracting olfactory events, smell sources and qualities. In Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 135–140.
- Sociofillmore: A tool for discovering perspectives. In The 60th Annual Meeting of the Association for Computational Linguistics Proceedings of System Demonstrations, pages 240–250. Association for Computational Linguistics.
- Frame semantics for social nlp in italian: Analyzing responsibility framing in femicide news reports. In Italian Conference on Computational Linguistics 2021: CLiC-it 2021. CEUR Workshop Proceedings (CEUR-WS. org).
- Event nugget annotation: Processes and issues. In Proceedings of the The 3rd Workshop on EVENTS: Definition, Detection, Coreference, and Representation, pages 66–76.
- Joint event extraction via recurrent neural networks. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 300–309.
- Trung Minh Nguyen and Thien Huu Nguyen. 2019. One for all: Neural joint modeling of entities and events. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 6851–6858.
- Event detection with neural networks: A rigorous empirical evaluation. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 999–1004, Brussels, Belgium. Association for Computational Linguistics.
- Alessio Palmero Aprosio and Giovanni Moretti. 2018. Tint 2.0: an all-inclusive suite for nlp in italian. In Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018).
- Structured prediction as translation between augmented natural languages. arXiv preprint arXiv:2101.05779.
- Biomedical event extraction as sequence labeling. In Proceedings of the 2020 conference on empirical methods in natural language processing (emnlp), pages 5357–5367.
- Event-based access to historical italian war memoirs. Journal on Computing and Cultural Heritage (JOCCH), 14(1):1–23.
- Framenet ii: Extended theory and practice.
- Framenet ii: Extended theory and practice. Technical report, International Computer Science Institute.
- Timeml annotation guidelines. Version, 1(1):31.
- Hacking history via event extraction. In Proceedings of the sixth international conference on Knowledge capture, pages 161–162.
- Literary event detection. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3623–3634, Florence, Italy. Association for Computational Linguistics.
- Rachele Sprugnoli and Sara Tonelli. 2017. One, no one and one hundred thousand events: Defining and processing events in an inter-disciplinary perspective. Natural Language Engineering, 23(4):485–506.
- Rachele Sprugnoli and Sara Tonelli. 2019. Novel event detection and classification for historical texts. Computational Linguistics, 45(2):229–265.
- Frame-semantic parsing with softmax-margin segmental rnns and a syntactic scaffold. arXiv preprint arXiv:1706.09528.
- Syntactic scaffolds for semantic structures. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3772–3782.
- Sara Tonelli and Emanuele Pianta. 2008. Frame information transfer from english to italian. In 6th International Conference on Language Resources and Evaluation (LREC 2008).
- Semi-automatic development of framenet for italian. In Proceedings of the FrameNet Workshop and Masterclass, Milano, Italy.
- Massive choice, ample tasks (MaChAmp): A toolkit for multi-task learning in NLP. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pages 176–197, Online. Association for Computational Linguistics.
- Ace 2005 multilingual training corpus. Linguistic Data Consortium, Philadelphia, 57:45.