TacoERE: Cluster-aware Compression for Event Relation Extraction (2405.06890v1)
Abstract: Event relation extraction (ERE) is a critical and fundamental challenge for natural language processing. Existing work mainly focuses on directly modeling the entire document, which cannot effectively handle long-range dependencies and information redundancy. To address these issues, we propose a cluster-aware compression method for improving event relation extraction (TacoERE), which explores a compression-then-extraction paradigm. Specifically, we first introduce document clustering for modeling event dependencies. It splits the document into intra- and inter-clusters, where intra-clusters aim to enhance the relations within the same cluster, while inter-clusters attempt to model the related events at arbitrary distances. Secondly, we utilize cluster summarization to simplify and highlight important text content of clusters for mitigating information redundancy and event distance. We have conducted extensive experiments on both pre-trained LLMs, such as RoBERTa, and LLMs, such as ChatGPT and GPT-4, on three ERE datasets, i.e., MAVEN-ERE, EventStoryLine and HiEve. Experimental results demonstrate that TacoERE is an effective method for ERE.
- Docbert: Bert for document classification. arXiv preprint arXiv:1904.08398.
- Docbert: Bert for document classification. ArXiv, abs/1904.08398.
- Longformer: The long-document transformer. arXiv preprint arXiv:2004.05150.
- Knowledge-enriched event causality identification via latent structure induction networks. In Proceedings of ACL-IJCNLP, pages 4862–4872.
- Faithful to the original: Fact aware neural abstractive summarization. In Proceedings of AAAI, 32.
- ERGO: Event relational graph transformer for document-level event causality identification. In Proceedings of COLING, pages 2118–2128.
- Palm: Scaling language modeling with pathways. arXiv preprint arXiv:2204.02311.
- BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of NAACL, pages 4171–4186.
- GSum: A general framework for guided neural abstractive summarization. In Proceedings of NAACL, pages 4830–4842.
- Towards event-level causal relation identification. In Proceedings of SIGIR, page 1828–1833.
- Modeling document-level causal structures for event causal relation identification. In Proceedings of NAACL, pages 1808–1817.
- Making pre-trained language models better few-shot learners. In Proceedings of ACL-IJCNLP, pages 3816–3830.
- Deep feature-based text clustering and its explanation. IEEE Transactions on Knowledge and Data Engineering, 34(8):3669–3680.
- Frame semantic-enhanced sentence modeling for sentence-level extractive text summarization. In Proceedings of EMNLP, pages 4045–4052.
- Frame semantics guided network for abstractive sentence summarization. Knowledge-Based Systems, 221:106973.
- EventOA: An event ontology alignment benchmark based on FrameNet and Wikidata. In Findings of the ACL, pages 10038–10052.
- Joint event and temporal relation extraction with shared representations and structured prediction. In Proceedings of EMNLP-IJCNLP, pages 434–444.
- Marti A. Hearst and Christian Plaunt. 1993. Subtopic structuring for full-length document access. In Proceedings of the SIGIR, page 59–68.
- Question answering as global reasoning over semantic abstractions. In In Proceedings of AAAI.
- Chin-Yew Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out, pages 74–81.
- Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.
- Selecting optimal context sentences for event-event relation extraction. In Proceedings of AAAI, 36(10):11058–11066.
- A well-composed text is half done! composition sampling for diverse conditional generation. In Proceedings of ACL, pages 1319–1339.
- Planning with learned entity prompts for abstractive summarization. In Proceedings of TACL, 9:1475–1492.
- Joint reasoning for temporal and causal relations. In Proceedings of ACL, pages 2278–2288.
- OpenAI. 2023. Gpt-4 technical report.
- Setrank: Learning a permutation-invariant ranking model for information retrieval. In Proceedings of SIGIR, pages 499–508.
- Ramakanth Pasunuru and Mohit Bansal. 2018. Multi-reward reinforced summarization with saliency and entailment. In Proceedings of the NAACL, pages 646–653.
- A deep reinforced model for abstractive summarization. In Proceedings of the ICLR.
- Enhancement of short text clustering by iterative classification. In Natural Language Processing and Information Systems, pages 105–117.
- Get to the point: Summarization with pointer-generator networks. In Proceedings of ACL, pages 1073–1083.
- Lamda: Language models for dialog applications. arXiv preprint arXiv:2201.08239.
- Minh Tran Phu and Thien Huu Nguyen. 2021. Graph convolutional networks for event causality identification with rich document-level structures. In Proceedings of NAACL, pages 3480–3490.
- Attention is all you need. In Proceedings of NeurIPS, 30.
- Joint constrained learning for event-event relation extraction. In Proceedings of the EMNLP, pages 696–706.
- BiSET: Bi-directional selective encoding with template for abstractive summarization. In Proceedings of ACL, pages 2153–2162.
- MAVEN-ERE: A unified large-scale dataset for event coreference, temporal, causal, and subevent relation extraction. In Proceedings of the EMNLP, pages 926–941.
- Ronald J. Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn., 8(3–4):229–256.
- Document-level relation extraction with sentences importance estimation and focusing. In Proceedings of NAACL, pages 2920–2929.
- Discriminative reasoning for document-level relation extraction. In Findings of ACL-IJCNLP, pages 1653–1663.
- Temporal common sense acquisition with minimal supervision. In Proceedings of ACL, pages 7579–7589.
- Document-level relation extraction with adaptive thresholding and localized context pooling. In Proceedings of AAAI, 35(16):14612–14620.
- The Storyline Annotation and Representation Scheme (StaR): A Proposal. In Proceedings of the 2nd Workshop on Computing News Storylines. PID https://github.com/tommasoc80/EventStoryLine.
- Goran Glavas and Jan Snajder and Marie-Francine Moens and Parisa KordJamshidi. 2014. HiEve: A Corpus for Extracting Event Hierarchies from News Stories. In Proceedings of LREC. PID http://takelab.fer.hr/hievents.rar.
- MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subevent Relation Extraction. In Proceedings of EMNLP. PID https://github.com/THU-KEG/MAVEN-ERE.
- Yong Guan (18 papers)
- Xiaozhi Wang (51 papers)
- Lei Hou (127 papers)
- Juanzi Li (144 papers)
- Jeff Pan (4 papers)
- Jiaoyan Chen (85 papers)
- Freddy Lecue (36 papers)