PaperWeaver: Enriching Topical Paper Alerts by Contextualizing Recommended Papers with User-collected Papers (2403.02939v2)
Abstract: With the rapid growth of scholarly archives, researchers subscribe to "paper alert" systems that periodically provide them with recommendations of recently published papers that are similar to previously collected papers. However, researchers sometimes struggle to make sense of nuanced connections between recommended papers and their own research context, as existing systems only present paper titles and abstracts. To help researchers spot these connections, we present PaperWeaver, an enriched paper alerts system that provides contextualized text descriptions of recommended papers based on user-collected papers. PaperWeaver employs a computational method based on LLMs to infer users' research interests from their collected papers, extract context-specific aspects of papers, and compare recommended and collected papers on these aspects. Our user study (N=15) showed that participants using PaperWeaver were able to better understand the relevance of recommended papers and triage them more confidently when compared to a baseline that presented the related work sections from recommended papers.
- Shaaron Ainsworth. 2006. DeFT: A conceptual framework for considering learning with multiple representations. Learning and instruction 16, 3 (2006), 183–198.
- Shaaron Ainsworth. 2008. The educational value of multiple-representations when learning complex scientific concepts. In Visualization: Theory and practice in science education. Springer, 191–208.
- Anthropic. 2023. Introducing Claude 2.1. https://www.anthropic.com/index/claude-2-1 Accessed: 2023-11-21.
- David Paul Ausubel. 2012. The acquisition and retention of knowledge: A cognitive view. Springer Science & Business Media.
- A multitask, multilingual, multimodal evaluation of chatgpt on reasoning, hallucination, and interactivity. arXiv preprint arXiv:2302.04023 (2023).
- SciBERT: A Pretrained Language Model for Scientific Text. In Conference on Empirical Methods in Natural Language Processing. https://api.semanticscholar.org/CorpusID:202558505
- Michael J. Black. 2022. Michael J. Black on Twitter. https://twitter.com/Michael_J_Black/status/1593133722316189696 Accessed: 2023-03-28.
- Ann M Blair. 2010. Too much to know: Managing scholarly information before the modern age. Yale University Press.
- Richard E. Boyatzis. 1998. Transforming Qualitative Information: Thematic Analysis and Code Development.
- TLDR: Extreme Summarization of Scientific Documents. ArXiv abs/2004.15011 (2020). https://api.semanticscholar.org/CorpusID:216867622
- SOLVENT: A Mixed Initiative System for Finding Analogies between Research Papers. Proc. ACM Hum.-Comput. Interact. 2, CSCW, Article 31 (Nov. 2018), 21 pages. https://doi.org/10.1145/3274300
- CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–15.
- Apolo: Interactive Large Graph Sensemaking by Combining Machine Learning and Visualization. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (San Diego, California, USA) (KDD ’11). Association for Computing Machinery, New York, NY, USA, 739–742. https://doi.org/10.1145/2020408.2020524
- Structural Scaffolds for Citation Intent Classification in Scientific Publications. ArXiv abs/1904.01608 (2019). https://api.semanticscholar.org/CorpusID:102483154
- SPECTER: Document-level Representation Learning using Citation-informed Transformers. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, 2270–2282. https://doi.org/10.18653/v1/2020.acl-main.207
- Lynne M. Connelly. 2013. Grounded theory. Medsurg nursing : official journal of the Academy of Medical-Surgical Nurses 22 2 (2013), 124, 127.
- Scim: Intelligent Skimming Support for Scientific Papers. Proceedings of the 28th International Conference on Intelligent User Interfaces (2022). https://api.semanticscholar.org/CorpusID:254591867
- Dedre Gentner and Russell Landers. 1985. ANALOGICAL REMINDING: A GOOD MATCH IS HARD TO FIND.. In Unknown Host Publication Title. IEEE, 607–613.
- Mary L Gick and Keith J Holyoak. 1980. Analogical problem solving. Cognitive psychology 12, 3 (1980), 306–355.
- Mary L. Gick and Keith J. Holyoak. 1983. Schema induction and analogical transfer. Cognitive Psychology 15, 1 (1983), 1 – 38. https://doi.org/10.1016/0010-0285(83)90002-6
- Towards multi-document summarization in the open-domain. https://api.semanticscholar.org/CorpusID:258865156
- Sandra G Hart and Lowell E Staveland. 1988. Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. In Advances in psychology. Vol. 52. Elsevier, 139–183.
- Tutorons: Generating context-relevant, on-demand explanations and demonstrations of online code. In 2015 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC). IEEE, 3–12.
- Augmenting scientific papers with just-in-time, position-sensitive definitions of terms and symbols. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 1–18.
- Paul Hemp. 2009. Death by information overload. Harvard business review 87 9 (2009), 82–9, 121. https://api.semanticscholar.org/CorpusID:584292
- Cochrane handbook for systematic reviews of interventions. (2008).
- Accelerating Innovation Through Analogy Mining. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Halifax, NS, Canada) (KDD ’17). ACM, New York, NY, USA, 235–243. https://doi.org/10.1145/3097983.3098038
- From Who You Know to What You Read: Augmenting Scientific Recommendations with Implicit Social Networks. (2022), 1–16.
- Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (San Francisco, CA, USA) (UIST ’23). Association for Computing Machinery, New York, NY, USA, 19 pages. https://doi.org/10.1145/3586183.3606759
- Threddy: An Interactive System for Personalized Thread-based Exploration and Organization of Scientific Literature. Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (2022). https://api.semanticscholar.org/CorpusID:251402552
- Augmenting Scientific Creativity with Retrieval across Knowledge Domains. In Second Workshop on Bridging Human-Computer Interaction and Natural Language Processing at NAACL 2022. arXiv. https://doi.org/10.48550/ARXIV.2206.01328
- Augmenting Scientific Creativity with an Analogical Search Engine. ACM Trans. Comput.-Hum. Interact. (mar 2022). https://doi.org/10.1145/3530013 Just Accepted.
- ComLittee: Literature Discovery with Personal Elected Author Committees. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 738, 20 pages. https://doi.org/10.1145/3544548.3581371
- FeedLens: Polymorphic Lenses for Personalizing Exploratory Search over Knowledge Graphs (UIST ’22).
- DAPIE: Interactive Step-by-Step Explanatory Dialogues to Answer Children’s Why and How Questions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–22.
- The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces. ArXiv abs/2303.14334 (2023). https://api.semanticscholar.org/CorpusID:257766269
- S2ORC: The Semantic Scholar Open Research Corpus. In Annual Meeting of the Association for Computational Linguistics. https://api.semanticscholar.org/CorpusID:215416146
- Explaining Relationships Between Scientific Documents. In Annual Meeting of the Association for Computational Linguistics. https://api.semanticscholar.org/CorpusID:236459799
- ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts. ArXiv abs/2205.06982 (2022). https://api.semanticscholar.org/CorpusID:248811750
- CSFCube - A Test Collection of Computer Science Research Articles for Faceted Query by Example. ArXiv abs/2103.12906 (2021). https://api.semanticscholar.org/CorpusID:232335540
- Citances: Citation sentences for semantic analysis of bioscience text. In Proceedings of the SIGIR, Vol. 4. Citeseer, 81–88.
- Elisabeth Pain. 2016. How to keep up with the scientific literature. Science (2016). https://api.semanticscholar.org/CorpusID:158399837
- Relatedly: Scaffolding Literature Reviews with Existing Related Work Sections. arXiv preprint arXiv:2302.06754 (2023).
- FoundWright: A System to Help People Re-find Pages from Their Web-history. ArXiv abs/2305.07930 (2023). https://api.semanticscholar.org/CorpusID:258685533
- AngleKindling: Supporting Journalistic Angle Ideation with Large Language Models. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (¡conf-loc¿, ¡city¿Hamburg¡/city¿, ¡country¿Germany¡/country¿, ¡/conf-loc¿) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 225, 16 pages. https://doi.org/10.1145/3544548.3580907
- PaperQuest: A visualization tool to support literature review. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems. 2264–2271.
- Bursting Scientific Filter Bubbles: Boosting Innovation via Novel Author Discovery. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 309, 13 pages. https://doi.org/10.1145/3491102.3501905
- CiteRead: Integrating Localized Citation Contexts into Scientific Paper Reading. In 27th International Conference on Intelligent User Interfaces. 707–719.
- Comparing Speech and Keyboard Text Entry for Short Messages in Two Languages on Touchscreen Phones. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 1, 4, Article 159 (jan 2018), 23 pages.
- LaMP: When Large Language Models Meet Personalization. arXiv preprint arXiv:2304.11406 (2023).
- SciRepEval: A Multi-Format Benchmark for Scientific Document Representations. ArXiv abs/2211.13308 (2022).
- Understanding and supporting academic literature review workflows with litsense. In Proceedings of the International Conference on Advanced Visual Interfaces. 1–5.
- H Holden Thorp. 2023. ChatGPT is fun, but not an author. , 313–313 pages.
- Axis: Generating explanations at scale with learnersourcing and machine learning. In Proceedings of the Third (2016) ACM Conference on Learning@ Scale. 379–388.
- Chris Woolston. 2019. PhDs: the tortuous truth. Nature 575 (2019), 403 – 406. https://api.semanticscholar.org/CorpusID:207986664
- C-Pack: Packaged Resources To Advance General Chinese Embedding. arXiv:2309.07597 [cs.CL]