Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
143 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

EventNet-ITA: Italian Frame Parsing for Events (2305.10892v2)

Published 18 May 2023 in cs.CL

Abstract: This paper introduces EventNet-ITA, a large, multi-domain corpus annotated full-text with event frames for Italian. Moreover, we present and thoroughly evaluate an efficient multi-label sequence labeling approach for Frame Parsing. Covering a wide range of individual, social and historical phenomena, with more than 53,000 annotated sentences and over 200 modeled frames, EventNet-ITA constitutes the first systematic attempt to provide the Italian language with a publicly available resource for Frame Parsing of events, useful for a broad spectrum of research and application tasks. Our approach achieves a promising 0.9 strict F1-score for frame classification and 0.72 for frame element classification, on top of minimizing computational requirements. The annotated corpus and the frame parsing model are released under open license.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (54)
  1. David Ahn. 2006. The stages of event extraction. In Proceedings of the Workshop on Annotating and Reasoning about Time and Events, pages 1–8.
  2. The berkeley framenet project. In Proceedings of the 17th international conference on Computational linguistics-Volume 1, pages 86–90. Association for Computational Linguistics.
  3. Developing a large scale framenet for italian: the iframenet experience. CLiC-it 2017 11-12 December 2017, Rome, page 59.
  4. Evalita 2011: The frame labelingover italian texts task. In International Workshop on Evaluation of Natural Language and Speech Tool for Italian, pages 195–204. Springer.
  5. Automatic induction of framenet lexical units in italian. In CEUR WORKSHOP PROCEEDINGS, volume 2769. CEUR-WS.
  6. Tommaso Caselli. 2018. Italian event detection goes deep learning. arXiv preprint arXiv:1810.02229.
  7. Annotating events, temporal expressions and relations in italian: the it-timeml experience for the ita-timebank. In Proceedings of the 5th Linguistic Annotation Workshop, pages 143–151. Association for Computational Linguistics.
  8. Eventi: Evaluation of events and temporal information at evalita 2014. EVENTI: EValuation of Events and Temporal INformation at Evalita 2014, pages 27–34.
  9. Event extraction via dynamic multi-pooling convolutional neural networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 167–176.
  10. Jacob Cohen. 1960. A coefficient of agreement for nominal scales. Educational and psychological measurement, 20(1):37–46.
  11. Linguistic Data Consortium et al. 2005. Ace (automatic content extraction) english annotation guidelines for events. version 5.4. 3. ACE.
  12. Agata Cybulska and Piek Vossen. 2011. Historical event extraction from text. In Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, pages 39–43.
  13. Frame-semantic parsing. Computational linguistics, 40(1):9–56.
  14. A global database of historic and real-time flood events based on social media. Scientific data, 6(1):311.
  15. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota. Association for Computational Linguistics.
  16. The automatic content extraction (ace) program-tasks, data, and evaluation. In Lrec, volume 2, page 1. Lisbon.
  17. Overview of linguistic resources for the tac kbp 2015 evaluations: Methodologies and results. In Tac.
  18. Charles J Fillmore and Collin F Baker. 2001. Frame semantics for text understanding. In Proceedings of WordNet and Other Lexical Resources Workshop, NAACL, volume 6.
  19. Charles J Fillmore et al. 1976. Frame semantics and the nature of language. In Annals of the New York Academy of Sciences: Conference on the origin and development of language and speech, volume 280, pages 20–32. New York.
  20. Biomedical event extraction with hierarchical knowledge graphs. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 1277–1285.
  21. T-pas: A resource of corpus-derived types predicate-argument structures for linguistic analysis and semantic processing. In Proceedings of LREC, pages 890–895.
  22. Overview of tac-kbp2016 tri-lingual edl and its impact on end-to-end cold-start kbp. Proceedings of TAC.
  23. Historical thesaurus of the Oxford English dictionary. Oxford University Press.
  24. Event extraction from historical texts: A new dataset for black rebellions. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 2390–2400, Online. Association for Computational Linguistics.
  25. Building an italian framenet through semi-automatic corpus analysis. In LREC.
  26. Lexit: A computational resource on italian argument structure. In LREC, pages 3712–3718.
  27. Enriching the isst-tanl corpus with semantic frames. In LREC, pages 3719–3726.
  28. Biomedical event extraction based on knowledge-driven tree-lstm. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 1421–1430.
  29. Shasha Liao and Ralph Grishman. 2010. Using document level cross-event inference to improve event extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 789–797. Association for Computational Linguistics.
  30. Text2event: Controllable sequence-to-structure generation for end-to-end event extraction. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 2795–2806.
  31. Scent mining: Extracting olfactory events, smell sources and qualities. In Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 135–140.
  32. Sociofillmore: A tool for discovering perspectives. In The 60th Annual Meeting of the Association for Computational Linguistics Proceedings of System Demonstrations, pages 240–250. Association for Computational Linguistics.
  33. Frame semantics for social nlp in italian: Analyzing responsibility framing in femicide news reports. In Italian Conference on Computational Linguistics 2021: CLiC-it 2021. CEUR Workshop Proceedings (CEUR-WS. org).
  34. Event nugget annotation: Processes and issues. In Proceedings of the The 3rd Workshop on EVENTS: Definition, Detection, Coreference, and Representation, pages 66–76.
  35. Joint event extraction via recurrent neural networks. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 300–309.
  36. Trung Minh Nguyen and Thien Huu Nguyen. 2019. One for all: Neural joint modeling of entities and events. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 6851–6858.
  37. Event detection with neural networks: A rigorous empirical evaluation. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 999–1004, Brussels, Belgium. Association for Computational Linguistics.
  38. Alessio Palmero Aprosio and Giovanni Moretti. 2018. Tint 2.0: an all-inclusive suite for nlp in italian. In Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018).
  39. Structured prediction as translation between augmented natural languages. arXiv preprint arXiv:2101.05779.
  40. Biomedical event extraction as sequence labeling. In Proceedings of the 2020 conference on empirical methods in natural language processing (emnlp), pages 5357–5367.
  41. Event-based access to historical italian war memoirs. Journal on Computing and Cultural Heritage (JOCCH), 14(1):1–23.
  42. Framenet ii: Extended theory and practice.
  43. Framenet ii: Extended theory and practice. Technical report, International Computer Science Institute.
  44. Timeml annotation guidelines. Version, 1(1):31.
  45. Hacking history via event extraction. In Proceedings of the sixth international conference on Knowledge capture, pages 161–162.
  46. Literary event detection. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3623–3634, Florence, Italy. Association for Computational Linguistics.
  47. Rachele Sprugnoli and Sara Tonelli. 2017. One, no one and one hundred thousand events: Defining and processing events in an inter-disciplinary perspective. Natural Language Engineering, 23(4):485–506.
  48. Rachele Sprugnoli and Sara Tonelli. 2019. Novel event detection and classification for historical texts. Computational Linguistics, 45(2):229–265.
  49. Frame-semantic parsing with softmax-margin segmental rnns and a syntactic scaffold. arXiv preprint arXiv:1706.09528.
  50. Syntactic scaffolds for semantic structures. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3772–3782.
  51. Sara Tonelli and Emanuele Pianta. 2008. Frame information transfer from english to italian. In 6th International Conference on Language Resources and Evaluation (LREC 2008).
  52. Semi-automatic development of framenet for italian. In Proceedings of the FrameNet Workshop and Masterclass, Milano, Italy.
  53. Massive choice, ample tasks (MaChAmp): A toolkit for multi-task learning in NLP. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pages 176–197, Online. Association for Computational Linguistics.
  54. Ace 2005 multilingual training corpus. Linguistic Data Consortium, Philadelphia, 57:45.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com