Hyperbolic Relevance Matching for Neural Keyphrase Extraction (2205.02047v2)
Abstract: Keyphrase extraction is a fundamental task in natural language processing and information retrieval that aims to extract a set of phrases with important information from a source document. Identifying important keyphrase is the central component of the keyphrase extraction task, and its main challenge is how to represent information comprehensively and discriminate importance accurately. In this paper, to address these issues, we design a new hyperbolic matching model (HyperMatch) to represent phrases and documents in the same hyperbolic space and explicitly estimate the phrase-document relevance via the Poincar\'e distance as the important score of each phrase. Specifically, to capture the hierarchical syntactic and semantic structure information, HyperMatch takes advantage of the hidden representations in multiple layers of RoBERTa and integrates them as the word embeddings via an adaptive mixing layer. Meanwhile, considering the hierarchical structure hidden in the document, HyperMatch embeds both phrases and documents in the same hyperbolic space via a hyperbolic phrase encoder and a hyperbolic document encoder. This strategy can further enhance the estimation of phrase-document relevance due to the good properties of hyperbolic space. In this setting, the keyphrase extraction can be taken as a matching problem and effectively implemented by minimizing a hyperbolic margin-based triplet loss. Extensive experiments are conducted on six benchmarks and demonstrate that HyperMatch outperforms the state-of-the-art baselines.
- Select, extract and generate: Neural keyphrase generation with layer-wise coverage attention. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing., pages 1389–1404.
- Probing bert in hyperbolic spaces. In International Conference on Learning Representations.
- APo-VAE: Text generation in hyperbolic space. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 416–431.
- What’s so special about bert’s layers? a closer look at the nlp pipeline in monolingual and multilingual models. In EMNLP (Findings), pages 4339–4350. Association for Computational Linguistics.
- Bert: Pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT, pages 4171–4186. Association for Computational Linguistics.
- Embedding text in hyperbolic spaces. In TextGraphs@NAACL-HLT, pages 59–69. Association for Computational Linguistics.
- Hyperbolic neural networks. In NeurIPS, pages 5350–5360.
- Extracting key terms from noisy and multitheme documents. In WWW, pages 661–670. ACM.
- Hyperbolic attention networks. In ICLR (Poster). OpenReview.net.
- Matthias Hamann. 2018. On the tree-likeness of hyperbolic spaces. In Mathematical Proceedings of the Cambridge Philosophical Society, volume 164, pages 345–361.
- Kazi Saidul Hasan and Vincent Ng. 2014. Automatic keyphrase extraction: A survey of the state of the art. In ACL (1), pages 1262–1273. The Association for Computer Linguistics.
- C. Hopper and B. Andrews. 2011. The Ricci Flow in Riemannian Geometry.
- Anette Hulth. 2003. Improved automatic keyword extraction given more linguistic knowledge. In EMNLP.
- Anette Hulth. 2004. Enhancing linguistically oriented automatic keyword extraction. In HLT-NAACL (Short Papers). The Association for Computational Linguistics.
- What does bert learn about the structure of language? In ACL (1), pages 3651–3657. Association for Computational Linguistics.
- Karen Spärck Jones. 2004. A statistical interpretation of term specificity and its application in retrieval. J. Documentation, 60(5):493–502.
- Hyperbolic image embeddings. In CVPR, pages 6417–6427. IEEE.
- Semeval-2010 task 5 : Automatic keyphrase extraction from scientific articles. In SemEval@ACL, pages 21–26. The Association for Computer Linguistics.
- Applying graph-based keyword extraction to document retrieval. In IJCNLP, pages 864–868. Asian Federation of Natural Language Processing / ACL.
- M. Krapivin and M. Marchese. 2009. Large dataset for keyphrase extraction.
- On the sentence embeddings from pre-trained language models. In EMNLP (1), pages 9119–9130. Association for Computational Linguistics.
- The geometry of graphs and some of its algorithmic applications. Combinatorica, 15(2):215–245.
- Unsupervised approaches for automatic keyword extraction using meeting transcripts. In HLT-NAACL, pages 620–628. The Association for Computational Linguistics.
- Keyphrase prediction with pre-trained language model.
- Roberta: A robustly optimized bert pretraining approach. CoRR, abs/1907.11692.
- Clustering to find exemplar terms for keyphrase extraction. In EMNLP, pages 257–266. ACL.
- Continuous hierarchical representations with poincaré variational auto-encoders. In NeurIPS, pages 12544–12555.
- Human-competitive tagging using automatic keyphrase extraction. In Internat. Conference of Empirical Methods in Natural Language Processing, EMNLP-2009,.
- Deep keyphrase generation. In ACL, pages 582–592. Association for Computational Linguistics.
- Rada Mihalcea and Paul Tarau. 2004. Textrank: Bringing order into text. In EMNLP, pages 404–411. ACL.
- Keyphrase extraction with span-based feature representations. CoRR, abs/2002.05407.
- Chau Q. Nguyen and Tuoi T. Phan. 2009. An ontology-based approach for key phrase extraction. In ACL/IJCNLP (Short Papers), pages 181–184. The Association for Computer Linguistics.
- Thuy Dung Nguyen and Min-Yen Kan. 2007. Keyphrase extraction in scientific publications. In ICADL, volume 4822 of Lecture Notes in Computer Science, pages 317–326.
- Maximilian Nickel and Douwe Kiela. 2017. Poincaré embeddings for learning hierarchical representations. In NIPS, pages 6338–6347.
- Pytorch: An imperative style, high-performance deep learning library. In NeurIPS, pages 8024–8035.
- Deep contextualized word representations. In NAACL-HLT, pages 2227–2237. Association for Computational Linguistics.
- Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. Cite arxiv:1908.10084Comment: Published at EMNLP 2019.
- Representation tradeoffs for hyperbolic embeddings. CoRR, abs/1804.03329.
- Rik Sarkar. 2011. Low distortion delaunay embedding of trees in hyperbolic plane. In Graph Drawing, volume 7034 of Lecture Notes in Computer Science, pages 355–366.
- Importance Estimation from Multiple Perspectives for Keyphrase Extraction. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
- Joint keyphrase chunking and salience ranking with bert. CoRR, abs/2004.13639.
- Divgraphpointer: A graph pointer network for extracting diverse keyphrases. In SIGIR, pages 755–764.
- Poincare glove: Hyperbolic word embeddings. In ICLR (Poster). OpenReview.net.
- Xiaojun Wan and Jianguo Xiao. 2008. Collabrank: Towards a collaborative approach to single-document keyphrase extraction. In COLING, pages 969–976.
- Incorporating multimodal information in open-domain web keyphrase extraction. In EMNLP (1), pages 1790–1800. Association for Computational Linguistics.
- Bilateral multi-perspective matching for natural language sentences. In IJCAI, pages 4144–4150.
- Huggingface’s transformers: State-of-the-art natural language processing. CoRR, abs/1910.03771.
- Unikeyphrase: A unified extraction and generation framework for keyphrase prediction. CoRR, abs/2106.04847.
- Open domain web keyphrase extraction beyond language modeling. In EMNLP/IJCNLP (1), pages 5174–5183. Association for Computational Linguistics.
- Keyphrase extraction with dynamic graph convolutional networks and diversified inference. CoRR, abs/2010.12828.
- Extractive summarization as text matching. In ACL, pages 6197–6208. Association for Computational Linguistics.
- Parsing all: Syntax and semantics, dependencies and spans. In EMNLP (Findings), pages 4438–4449. Association for Computational Linguistics.
- Hypertext: Endowing fasttext with hyperbolic geometry. In EMNLP (Findings), pages 1166–1171. Association for Computational Linguistics.