ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation (2403.08737v1)
Abstract: Existing Machine Learning approaches for local citation recommendation directly map or translate a query, which is typically a claim or an entity mention, to citation-worthy research papers. Within such a formulation, it is challenging to pinpoint why one should cite a specific research paper for a particular query, leading to limited recommendation interpretability. To alleviate this, we introduce the evidence-grounded local citation recommendation task, where the target latent space comprises evidence spans for recommending specific papers. Using a distantly-supervised evidence retrieval and multi-step re-ranking framework, our proposed system, ILCiteR, recommends papers to cite for a query grounded on similar evidence spans extracted from the existing research literature. Unlike past formulations that simply output recommendations, ILCiteR retrieves ranked lists of evidence span and recommended paper pairs. Secondly, previously proposed neural models for citation recommendation require expensive training on massive labeled data, ideally after every significant update to the pool of candidate papers. In contrast, ILCiteR relies solely on distant supervision from a dynamic evidence database and pre-trained Transformer-based LLMs without any model training. We contribute a novel dataset for the evidence-grounded local citation recommendation task and demonstrate the efficacy of our proposed conditional neural rank-ensembling approach for re-ranking evidence spans.
- Spr-smn: scientific paper recommendation employing specter with memory network. Scientometrics, 127:6763–6785.
- Citation recommendation employing heterogeneous bibliographic network embedding. Neural Computing and Applications, 34:10229 – 10242.
- Neural machine translation by jointly learning to align and translate. CoRR, abs/1409.0473.
- SciBERT: A pretrained language model for scientific text. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3615–3620, Hong Kong, China. Association for Computational Linguistics.
- Steven Bethard and Dan Jurafsky. 2010. Who should i cite: learning literature search models from citation behavior. Proceedings of the 19th ACM international conference on Information and knowledge management.
- Latent dirichlet allocation. J. Mach. Learn. Res., 3:993–1022.
- The mathematics of statistical machine translation: Parameter estimation. Comput. Linguistics, 19:263–311.
- Bert: Pre-training of deep bidirectional transformers for language understanding. ArXiv, abs/1810.04805.
- Neural citation network for context-aware citation recommendation. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’17, page 1093–1096, New York, NY, USA. Association for Computing Machinery.
- Michael Färber and Adam Jatowt. 2020. Citation recommendation: approaches and datasets. International Journal on Digital Libraries, 21:375 – 405.
- Michael Färber and Ashwath Sampath. 2020. Hybridcite: A hybrid model for context-aware citation recommendation. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020.
- Soumyajit Ganguly and Vikram Pudi. 2017. Paper2vec: Combining graph and text information for scientific paper representation. In European Conference on Information Retrieval.
- Local citation recommendation with hierarchical-attention text encoder and scibert-based reranking. In Advances in Information Retrieval, pages 274–288, Cham. Springer International Publishing.
- Ben Hachey. 2009. Multi-document summarisation using generic relation extraction. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 420–429, Singapore. Association for Computational Linguistics.
- hyperdoc2vec: Distributed representations of hypertext documents. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2384–2394, Melbourne, Australia. Association for Computational Linguistics.
- Context-aware citation recommendation. In The Web Conference.
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation, 9:1735–1780.
- Matthew Honnibal and Mark Johnson. 2015. An improved non-monotonic transition system for dependency parsing. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 1373–1378, Lisbon, Portugal. Association for Computational Linguistics.
- Refseer: A citation recommendation system. IEEE/ACM Joint Conference on Digital Libraries, pages 371–374.
- A context-aware citation recommendation model with bert and graph convolutional networks. Scientometrics, 124:1907 – 1922.
- Thomas Kipf and Max Welling. 2016a. Semi-supervised classification with graph convolutional networks. ArXiv, abs/1609.02907.
- Thomas Kipf and Max Welling. 2016b. Variational graph auto-encoders. ArXiv, abs/1611.07308.
- Quoc V. Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In International Conference on Machine Learning.
- Chin-Yew Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out, pages 74–81, Barcelona, Spain. Association for Computational Linguistics.
- S2ORC: The semantic scholar open research corpus. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 4969–4983, Online. Association for Computational Linguistics.
- Yuanhua Lv and ChengXiang Zhai. 2011. Lower-bounding term frequency normalization. In International Conference on Information and Knowledge Management.
- Zoran Medić and Jan Snajder. 2020. Improved local citation recommendation based on context enhanced with global information. In Proceedings of the First Workshop on Scholarly Document Processing, pages 97–103, Online. Association for Computational Linguistics.
- Weakly-supervised hierarchical text classification. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 6826–6833.
- Efficient estimation of word representations in vector space. In 1st International Conference on Learning Representations, ICLR 2013, Scottsdale, Arizona, USA, May 2-4, 2013, Workshop Track Proceedings.
- Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013, Lake Tahoe, Nevada, United States, pages 3111–3119.
- Evaluating automatic summaries of meeting recordings. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pages 33–40, Ann Arbor, Michigan. Association for Computational Linguistics.
- Ani Nenkova and Rebecca Passonneau. 2004. Evaluating content selection in summarization: The pyramid method. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004, pages 145–152, Boston, Massachusetts, USA. Association for Computational Linguistics.
- Joakim Nivre and Jens Nilsson. 2005. Pseudo-projective dependency parsing. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05), pages 99–106, Ann Arbor, Michigan. Association for Computational Linguistics.
- Deep contextualized word representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 2227–2237, New Orleans, Louisiana. Association for Computational Linguistics.
- Stephen Robertson and Hugo Zaragoza. 2009. The probabilistic relevance framework: bm25 and beyond. Foundations and Trends in Information Retrieval, 3:333–389.
- Tarek Saier and Michael Faerber. 2020. Semantic Modelling of Citation Contexts for Context-Aware Citation Recommendation, pages 220–233.
- TaxoClass: Hierarchical multi-label text classification using only class names. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4239–4249, Online. Association for Computational Linguistics.
- Attention is all you need. In NIPS.
- Multi-information fusion based on dual attention and text embedding network for local citation recommendation. Advances in Computational Intelligence, 3.
- Universal decompositional semantics on Universal Dependencies. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pages 1713–1723, Austin, Texas. Association for Computational Linguistics.
- Attention-based personalized encoder-decoder model for local citation recommendation. Computational Intelligence and Neuroscience, 2019.