Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An Ecosystem for Personal Knowledge Graphs: A Survey and Research Roadmap (2304.09572v2)

Published 19 Apr 2023 in cs.AI and cs.IR

Abstract: This paper presents an ecosystem for personal knowledge graphs (PKGs), commonly defined as resources of structured information about entities related to an individual, their attributes, and the relations between them. PKGs are a key enabler of secure and sophisticated personal data management and personalized services. However, there are challenges that need to be addressed before PKGs can achieve widespread adoption. One of the fundamental challenges is the very definition of what constitutes a PKG, as there are multiple interpretations of the term. We propose our own definition of a PKG, emphasizing the aspects of (1) data ownership by a single individual and (2) the delivery of personalized services as the primary purpose. We further argue that a holistic view of PKGs is needed to unlock their full potential, and propose a unified framework for PKGs, where the PKG is a part of a larger ecosystem with clear interfaces towards data services and data sources. A comprehensive survey and synthesis of existing work is conducted, with a mapping of the surveyed work into the proposed unified ecosystem. Finally, we identify open challenges and research opportunities for the ecosystem as a whole, as well as for the specific aspects of PKGs, which include population, representation and management, and utilization.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (74)
  1. Information archiving with bookmarks: Personal web space construction and organization, in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 41–48.
  2. Improving web search ranking by incorporating user behavior information, in: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 19–26.
  3. Using a personal health library–enabled mhealth recommender system for self-management of diabetes among underserved populations: Use case for knowledge graphs and linked data. JMIR Formative Research 5, e24738.
  4. Dbpedia: A nucleus for a web of open data, in: The Semantic Web: 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference, ISWC 2007+ ASWC 2007, Busan, Korea, November 11-15, 2007. Proceedings, Springer. pp. 722–735.
  5. Entity-Oriented Search. Springer Publishing Company, Incorporated.
  6. Personal knowledge graphs: A research agenda, in: Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, pp. 217–220.
  7. Taking email to task: The design and evaluation of a task management centered email tool, in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 345–352.
  8. Improved search engines and navigation preference in personal information management. ACM Trans. Inf. Syst. 26.
  9. Personal information management, in: CHI ’04 Extended Abstracts on Human Factors in Computing Systems, pp. 1598–1599.
  10. Folder versus tag preference in personal information management. J. Am. Soc. Inf. Sci. Technol. 64, 1995–2012.
  11. The use of attention resources in navigation versus search. Personal Ubiquitous Comput. 17, 583–590.
  12. PKG API: A tool for personal knowledge graph management. arXiv:2402.07540.
  13. The semantic web. Scientific American 284, 34–43.
  14. “Stuff goes into the computer and doesn’t come out”: A cross-tool study of personal information management, in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 583–590.
  15. Freebase: A collaboratively created graph database for structuring human knowledge, in: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, p. 1247–1250.
  16. As we may think. Atlantic Monthly 176, 641–649.
  17. Work and personal e-mail use by university employees: Pim practices across domain boundaries. J. Am. Soc. Inf. Sci. Technol. 64, 1029–1044.
  18. Personal research knowledge graphs, in: Companion Proceedings of the Web Conference 2022, pp. 763–768.
  19. Better to organize personal information by folders or by tags?: The devil is in the details, in: Proceedings of the American Society for Information Science and Technology, pp. 1–13.
  20. Neural open information extraction, in: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 407–413.
  21. A large-scale evaluation and analysis of personalized search strategies, in: Proceedings of the 16th International Conference on World Wide Web, pp. 581–590.
  22. Identifying relations for open information extraction, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 1535–1545.
  23. Role-based Access Control. Artech House.
  24. An empirical characterisation of file retrieval. Int. J. Hum.-Comput. Stud. 74, 1–13.
  25. A neural architecture for person ontology population. arXiv:2001.08013.
  26. Automatic typing of dbpedia entities, in: Proceedings of the 11th International Conference on The Semantic Web, pp. 65–81.
  27. Bias in conversational search: The double-edged sword of the personalized knowledge graph, in: Proceedings of the 2020 ACM SIGIR on International Conference on Theory of Information Retrieval, pp. 133–136.
  28. The nepomuk project-on the way to the social semantic desktop, in: International Conference on Semantic Technologies: I-Semantics 2007.
  29. Lifelogging: Personal big data. Found. Trends Inf. Retr. 8, 1–125.
  30. Personalized health knowledge graph, in: Joint Proceedings of the International Workshops on Contextualized Knowledge Graphs, and Semantic Statistics co-located with 17th International Semantic Web Conference.
  31. Lost in email: Pulling users down a path of interaction, in: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, pp. 3981–3984.
  32. Rdf* and sparql*: An alternative approach to annotate statements in RDF, in: Proceedings of the ISWC 2017 Posters & Demonstrations and Industry Tracks co-located with 16th International Semantic Web Conference (ISWC 2017), CEUR-WS.org.
  33. Linked data: Evolving the web into a global data space. Morgan & Claypool Publishers.
  34. Knowledge Graphs. Number 22 in Synthesis Lectures on Data, Semantics, and Knowledge, Morgan & Claypool. URL: https://kgbook.org/, doi:10.2200/S01125ED1V01Y202109DSK022.
  35. The even more irresistible sroiq , 57–67.
  36. OntoNotes: The 90% solution, in: Proceedings of the Human Language Technology Conference of the NAACL, pp. 57–60.
  37. Open research knowledge graph: Next generation infrastructure for semantic scholarly knowledge, in: Proceedings of the 10th International Conference on Knowledge Capture, p. 243–246.
  38. Knowledge base population: Successful approaches and challenges, in: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 1148–1158.
  39. Report on the NSF-sponsored workshop on Personal Information Management, Seattle, WA, 2005. Personal Information Management 2005: A Special Workshop Sponsored by the National Science Foundation .
  40. Personal information management, in: Encyclopedia of Library and Information Science, Fourth Edition. CRC Press. doi:10.1081/E-ELIS4-120053695.
  41. Once found, what then? a study of “keeping” behaviors in the personal use of web information, in: Proceedings of the American Society for Information Science and Technology, pp. 391–402.
  42. No noun phrase left behind: Detecting and typing unlinkable entities, in: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 893–903.
  43. Goal-oriented end-to-end conversational models with profile features in a real-world setting, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Industry Papers), pp. 48–55.
  44. Learning personalized end-to-end goal-oriented dialog, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 6794–6801.
  45. Paraconsistent owl and related logics. Semantic Web 4, 395–427.
  46. Personalizing web search using long term browsing history, in: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 25–34.
  47. Training millions of personalized dialogue agents, in: The 2018 Conference on Empirical Methods in Natural Language Processing.
  48. Echo chambers and epistemic bubbles. Episteme 17, 141–161.
  49. Understanding the privacy-personalization dilemma for web search: A user perspective, in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 3427–3430.
  50. Web-scale distributional similarity and entity set expansion, in: Proceeding of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 938–947.
  51. What you seek is what you get: Extraction of class attributes from query logs, in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, pp. 2832–2837.
  52. Personal health knowledge graphs for patients arXiv:2004.00071.
  53. Solid: a platform for decentralized social applications based on linked data. MIT CSAIL & Qatar Computing Research Institute, Tech. Rep. .
  54. Information extraction. Found. Trends Databases 1, 261–377. URL: https://doi.org/10.1561/1900000003, doi:10.1561/1900000003.
  55. Personal health knowledge graph for clinically relevant diet recommendations. arXiv cs.HC/2110.10131.
  56. Implicit user modeling for personalized search, in: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, pp. 824–831.
  57. Privacy protection in personalized search. SIGIR Forum 41, 4–17.
  58. Probabilistic models for personalizing web search, in: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, pp. 433–442.
  59. The perfect search engine is not enough: A study of orienteering behavior in directed search, in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 415–422.
  60. Personalizing search via automated analysis of interests and activities, in: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 449–456.
  61. Potential for personalization. ACM Trans. Comput. Hum. Interact. 17, 4:1–4:31.
  62. Discovering and using groups to improve personalized search, in: Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 15–24.
  63. Listening between the lines: Learning personal attributes from conversations, in: The World Wide Web Conference, pp. 1818–1828.
  64. Charm: Inferring personal attributes from conversations, in: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 5391–5404.
  65. Personal Knowledge Graphs (PKGs): Methodology, tools and applications. The Institution of Engineering and Technology.
  66. Data augmentation for fairness in personal knowledge base population, in: Trends and Applications in Knowledge Discovery and Data Mining, pp. 143–152.
  67. Tagging might not be slower than filing in folders, in: CHI ’12 Extended Abstracts on Human Factors in Computing Systems, pp. 2063–2068.
  68. Wikidata: a free collaborative knowledgebase. Communications of the ACM 57, 78–85.
  69. Machine knowledge: Creation and curation of comprehensive knowledge bases. Found. Trends Databases 10, 108–490.
  70. Interactions with search systems. Cambridge University Press.
  71. Am i wasting my time organizing email? a study of email refinding, in: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 3449–3458.
  72. Transfer of frames from English FrameNet to construct Chinese FrameNet: A bilingual corpus-based approach, in: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018).
  73. Personal knowledge base construction from text-based lifelogs, in: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 185–194.
  74. Personalizing dialogue agents: I have a dog, do you have pets too?, in: The 56th Annual Meeting of the Association for Computational Linguistics.
Citations (9)

Summary

We haven't generated a summary for this paper yet.