Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
173 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PaperWeaver: Enriching Topical Paper Alerts by Contextualizing Recommended Papers with User-collected Papers (2403.02939v2)

Published 5 Mar 2024 in cs.DL, cs.AI, cs.CL, and cs.HC

Abstract: With the rapid growth of scholarly archives, researchers subscribe to "paper alert" systems that periodically provide them with recommendations of recently published papers that are similar to previously collected papers. However, researchers sometimes struggle to make sense of nuanced connections between recommended papers and their own research context, as existing systems only present paper titles and abstracts. To help researchers spot these connections, we present PaperWeaver, an enriched paper alerts system that provides contextualized text descriptions of recommended papers based on user-collected papers. PaperWeaver employs a computational method based on LLMs to infer users' research interests from their collected papers, extract context-specific aspects of papers, and compare recommended and collected papers on these aspects. Our user study (N=15) showed that participants using PaperWeaver were able to better understand the relevance of recommended papers and triage them more confidently when compared to a baseline that presented the related work sections from recommended papers.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (56)
  1. Shaaron Ainsworth. 2006. DeFT: A conceptual framework for considering learning with multiple representations. Learning and instruction 16, 3 (2006), 183–198.
  2. Shaaron Ainsworth. 2008. The educational value of multiple-representations when learning complex scientific concepts. In Visualization: Theory and practice in science education. Springer, 191–208.
  3. Anthropic. 2023. Introducing Claude 2.1. https://www.anthropic.com/index/claude-2-1 Accessed: 2023-11-21.
  4. David Paul Ausubel. 2012. The acquisition and retention of knowledge: A cognitive view. Springer Science & Business Media.
  5. A multitask, multilingual, multimodal evaluation of chatgpt on reasoning, hallucination, and interactivity. arXiv preprint arXiv:2302.04023 (2023).
  6. SciBERT: A Pretrained Language Model for Scientific Text. In Conference on Empirical Methods in Natural Language Processing. https://api.semanticscholar.org/CorpusID:202558505
  7. Michael J. Black. 2022. Michael J. Black on Twitter. https://twitter.com/Michael_J_Black/status/1593133722316189696 Accessed: 2023-03-28.
  8. Ann M Blair. 2010. Too much to know: Managing scholarly information before the modern age. Yale University Press.
  9. Richard E. Boyatzis. 1998. Transforming Qualitative Information: Thematic Analysis and Code Development.
  10. TLDR: Extreme Summarization of Scientific Documents. ArXiv abs/2004.15011 (2020). https://api.semanticscholar.org/CorpusID:216867622
  11. SOLVENT: A Mixed Initiative System for Finding Analogies between Research Papers. Proc. ACM Hum.-Comput. Interact. 2, CSCW, Article 31 (Nov. 2018), 21 pages. https://doi.org/10.1145/3274300
  12. CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–15.
  13. Apolo: Interactive Large Graph Sensemaking by Combining Machine Learning and Visualization. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (San Diego, California, USA) (KDD ’11). Association for Computing Machinery, New York, NY, USA, 739–742. https://doi.org/10.1145/2020408.2020524
  14. Structural Scaffolds for Citation Intent Classification in Scientific Publications. ArXiv abs/1904.01608 (2019). https://api.semanticscholar.org/CorpusID:102483154
  15. SPECTER: Document-level Representation Learning using Citation-informed Transformers. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, 2270–2282. https://doi.org/10.18653/v1/2020.acl-main.207
  16. Lynne M. Connelly. 2013. Grounded theory. Medsurg nursing : official journal of the Academy of Medical-Surgical Nurses 22 2 (2013), 124, 127.
  17. Scim: Intelligent Skimming Support for Scientific Papers. Proceedings of the 28th International Conference on Intelligent User Interfaces (2022). https://api.semanticscholar.org/CorpusID:254591867
  18. Dedre Gentner and Russell Landers. 1985. ANALOGICAL REMINDING: A GOOD MATCH IS HARD TO FIND.. In Unknown Host Publication Title. IEEE, 607–613.
  19. Mary L Gick and Keith J Holyoak. 1980. Analogical problem solving. Cognitive psychology 12, 3 (1980), 306–355.
  20. Mary L. Gick and Keith J. Holyoak. 1983. Schema induction and analogical transfer. Cognitive Psychology 15, 1 (1983), 1 – 38. https://doi.org/10.1016/0010-0285(83)90002-6
  21. Towards multi-document summarization in the open-domain. https://api.semanticscholar.org/CorpusID:258865156
  22. Sandra G Hart and Lowell E Staveland. 1988. Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. In Advances in psychology. Vol. 52. Elsevier, 139–183.
  23. Tutorons: Generating context-relevant, on-demand explanations and demonstrations of online code. In 2015 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC). IEEE, 3–12.
  24. Augmenting scientific papers with just-in-time, position-sensitive definitions of terms and symbols. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 1–18.
  25. Paul Hemp. 2009. Death by information overload. Harvard business review 87 9 (2009), 82–9, 121. https://api.semanticscholar.org/CorpusID:584292
  26. Cochrane handbook for systematic reviews of interventions. (2008).
  27. Accelerating Innovation Through Analogy Mining. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Halifax, NS, Canada) (KDD ’17). ACM, New York, NY, USA, 235–243. https://doi.org/10.1145/3097983.3098038
  28. From Who You Know to What You Read: Augmenting Scientific Recommendations with Implicit Social Networks. (2022), 1–16.
  29. Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology (San Francisco, CA, USA) (UIST ’23). Association for Computing Machinery, New York, NY, USA, 19 pages. https://doi.org/10.1145/3586183.3606759
  30. Threddy: An Interactive System for Personalized Thread-based Exploration and Organization of Scientific Literature. Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (2022). https://api.semanticscholar.org/CorpusID:251402552
  31. Augmenting Scientific Creativity with Retrieval across Knowledge Domains. In Second Workshop on Bridging Human-Computer Interaction and Natural Language Processing at NAACL 2022. arXiv. https://doi.org/10.48550/ARXIV.2206.01328
  32. Augmenting Scientific Creativity with an Analogical Search Engine. ACM Trans. Comput.-Hum. Interact. (mar 2022). https://doi.org/10.1145/3530013 Just Accepted.
  33. ComLittee: Literature Discovery with Personal Elected Author Committees. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 738, 20 pages. https://doi.org/10.1145/3544548.3581371
  34. FeedLens: Polymorphic Lenses for Personalizing Exploratory Search over Knowledge Graphs (UIST ’22).
  35. DAPIE: Interactive Step-by-Step Explanatory Dialogues to Answer Children’s Why and How Questions. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–22.
  36. The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces. ArXiv abs/2303.14334 (2023). https://api.semanticscholar.org/CorpusID:257766269
  37. S2ORC: The Semantic Scholar Open Research Corpus. In Annual Meeting of the Association for Computational Linguistics. https://api.semanticscholar.org/CorpusID:215416146
  38. Explaining Relationships Between Scientific Documents. In Annual Meeting of the Association for Computational Linguistics. https://api.semanticscholar.org/CorpusID:236459799
  39. ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts. ArXiv abs/2205.06982 (2022). https://api.semanticscholar.org/CorpusID:248811750
  40. CSFCube - A Test Collection of Computer Science Research Articles for Faceted Query by Example. ArXiv abs/2103.12906 (2021). https://api.semanticscholar.org/CorpusID:232335540
  41. Citances: Citation sentences for semantic analysis of bioscience text. In Proceedings of the SIGIR, Vol. 4. Citeseer, 81–88.
  42. Elisabeth Pain. 2016. How to keep up with the scientific literature. Science (2016). https://api.semanticscholar.org/CorpusID:158399837
  43. Relatedly: Scaffolding Literature Reviews with Existing Related Work Sections. arXiv preprint arXiv:2302.06754 (2023).
  44. FoundWright: A System to Help People Re-find Pages from Their Web-history. ArXiv abs/2305.07930 (2023). https://api.semanticscholar.org/CorpusID:258685533
  45. AngleKindling: Supporting Journalistic Angle Ideation with Large Language Models. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (¡conf-loc¿, ¡city¿Hamburg¡/city¿, ¡country¿Germany¡/country¿, ¡/conf-loc¿) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 225, 16 pages. https://doi.org/10.1145/3544548.3580907
  46. PaperQuest: A visualization tool to support literature review. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems. 2264–2271.
  47. Bursting Scientific Filter Bubbles: Boosting Innovation via Novel Author Discovery. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 309, 13 pages. https://doi.org/10.1145/3491102.3501905
  48. CiteRead: Integrating Localized Citation Contexts into Scientific Paper Reading. In 27th International Conference on Intelligent User Interfaces. 707–719.
  49. Comparing Speech and Keyboard Text Entry for Short Messages in Two Languages on Touchscreen Phones. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 1, 4, Article 159 (jan 2018), 23 pages.
  50. LaMP: When Large Language Models Meet Personalization. arXiv preprint arXiv:2304.11406 (2023).
  51. SciRepEval: A Multi-Format Benchmark for Scientific Document Representations. ArXiv abs/2211.13308 (2022).
  52. Understanding and supporting academic literature review workflows with litsense. In Proceedings of the International Conference on Advanced Visual Interfaces. 1–5.
  53. H Holden Thorp. 2023. ChatGPT is fun, but not an author. , 313–313 pages.
  54. Axis: Generating explanations at scale with learnersourcing and machine learning. In Proceedings of the Third (2016) ACM Conference on Learning@ Scale. 379–388.
  55. Chris Woolston. 2019. PhDs: the tortuous truth. Nature 575 (2019), 403 – 406. https://api.semanticscholar.org/CorpusID:207986664
  56. C-Pack: Packaged Resources To Advance General Chinese Embedding. arXiv:2309.07597 [cs.CL]
Citations (6)

Summary

We haven't generated a summary for this paper yet.