Identifying and Consolidating Knowledge Engineering Requirements (2306.15124v1)
Abstract: Knowledge engineering is the process of creating and maintaining knowledge-producing systems. Throughout the history of computer science and AI, knowledge engineering workflows have been widely used because high-quality knowledge is assumed to be crucial for reliable intelligent agents. However, the landscape of knowledge engineering has changed, presenting four challenges: unaddressed stakeholder requirements, mismatched technologies, adoption barriers for new organizations, and misalignment with software engineering practices. In this paper, we propose to address these challenges by developing a reference architecture using a mainstream software methodology. By studying the requirements of different stakeholders and eras, we identify 23 essential quality attributes for evaluating reference architectures. We assess three candidate architectures from recent literature based on these attributes. Finally, we discuss the next steps towards a comprehensive reference architecture, including prioritizing quality attributes, integrating components with complementary strengths, and supporting missing socio-technical requirements. As this endeavor requires a collaborative effort, we invite all knowledge engineering researchers and practitioners to join us.
- A review on language models as knowledge bases. arXiv preprint arXiv:2204.06031 (2022).
- A classification of software reference architectures: Analyzing their success and effectiveness. In 2009 Joint Working IEEE/IFIP Conference on Software Architecture & European Conference on Software Architecture. IEEE, 141–150.
- Pouya Ataei and Alan Litchfield. 2022. The state of big data reference architectures: a systematic literature review. IEEE Access (2022).
- ReFinED: An Efficient Zero-shot-capable Approach to End-to-End Entity Linking. arXiv preprint arXiv:2207.04108 (2022).
- Software architecture in practice (fourth ed.). Addison-Wesley Professional.
- On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (Virtual Event, Canada) (FAccT ’21). Association for Computing Machinery, New York, NY, USA, 610–623. https://doi.org/10.1145/3442188.3445922
- The semantic web. Scientific american 284, 5 (2001), 34–43.
- Emerging Architectures for Modern Data Infrastructure. https://future.com/emerging-architectures-modern-data-infrastructure/. Accessed: 2022-12-02.
- The concept of reference architectures. Systems Engineering 13, 1 (2010), 14–27.
- Xin Luna Dong. 2018. Challenges and innovations in building a product knowledge graph. In Proceedings of the 24th ACM SIGKDD International conference on knowledge discovery & data mining. 2869–2869.
- Edward A Feigenbaum. 1977. The art of artificial intelligence: Themes and case studies of knowledge engineering. In Proceedings of the Fifth International Joint Conference on Artificial Intelligence, Vol. 2. Boston.
- Edward A Feigenbaum. 1992. A personal view of expert systems: Looking back and looking ahead. Knowledge Systems Laboratory, Department of Computer Science, Stanford ….
- Three decades of software reference architectures: A systematic mapping study. Journal of Systems and Software 179 (2021), 111004.
- Olaf Hartig. 2022. ”Reflections on Linked Data Querying and other Related Topics”. https://olafhartig.de/slides/Slides-DKG-SWSA-Talk.pdf. Accessed: 2022-03-17.
- Semantic web for the working ontologist: Effective modeling for linked data, RDFS, and OWL. Morgan & Claypool.
- James A Hendler. 2009. Tonight’s Dessert: Semantic Web Layer Cakes. In European Semantic Web Conference. Springer, 1–1.
- Aidan Hogan. 2020. The semantic web: Two decades on. Semantic Web 11, 1 (2020), 169–185.
- LOTUS: Adaptive text search for big linked data. In European Semantic Web Conference. Springer, 470–485.
- KGTK: a toolkit for large knowledge graph manipulation and analysis. In International Semantic Web Conference. Springer, Cham, 278–293.
- Democratising Knowledge Representation with BioCypher. arXiv preprint arXiv:2212.13543 (2022).
- Achieving Knowledge-as-a-Service in IIoT-driven smart manufacturing: A crowdsourcing-based continuous enrichment method for Industrial Knowledge Graph. Advanced Engineering Informatics 51 (2022), 101494.
- Benefits and drawbacks of software reference architectures: A case study. Information and software technology 88 (2017), 37–52.
- Aggregating empirical evidence about the benefits and drawbacks of software reference architectures. In 2015 ACM/IEEE international symposium on empirical software engineering and measurement (ESEM). IEEE, 1–10.
- A software reference architecture for semantic-aware Big Data systems. Information and software technology 90 (2017), 75–92.
- An aspect-oriented reference architecture for software engineering environments. Journal of Systems and Software 84, 10 (2011), 1670–1684.
- Elements of a theory of human problem solving. Psychological review 65, 3 (1958), 151.
- Industry-scale Knowledge Graphs: Lessons and Challenges: Five diverse technology companies show how it’s done. Queue 17, 2 (2019), 48–75.
- Protégé-2000: an open-source ontology-development and knowledge-acquisition environment.. In AMIA… annual symposium proceedings. AMIA Symposium. 953–953.
- Pekka Pääkkönen and Daniel Pakkala. 2020. Extending reference architecture of big data systems towards machine learning in edge computing environments. Journal of Big Data 7, 1 (2020), 1–29.
- Language models as knowledge bases? arXiv preprint arXiv:1909.01066 (2019).
- F.P. Ramsey. 1929. Knowledge. In F.P. Ramsey: Philosophical Papers. Cambridge University Press, 110–111.
- Knowledge engineering and management: the CommonKADS methodology. MIT press.
- Gytė Tamašauskaitė and Paul Groth. 2022. Defining a Knowledge Graph Development Process Through a Systematic Review. ACM Transactions on Software Engineering and Methodology (2022).
- Software architecture: foundations, theory, and practice. John Wiley & Sons, Inc.
- WDQS Search Team. 2022. ”WDQS Backend Alternatives: The Process, Details and Results”. https://www.wikidata.org/wiki/File:WDQS_Backend_Alternatives_working_paper.pdf. Accessed: 2022-08-15.
- Karim Tharani. 2021. Much more than a mere technology: A systematic review of Wikidata in libraries. The Journal of Academic Librarianship 47, 2 (2021), 102326.
- Google Books Ngram Viewer. 2022. ”Percentage of n-grams in books published in English between 1955 and 2019 that are ‘expert systems’, ‘ontology’, ‘metadata’, or ‘machine learning’.”. https://books.google.com/ngrams/graph?content=expert+systems%2Contology%2Cmachine+learning%2Cmetadata&year_start=1955&year_end=2019&corpus=26&smoothing=1. Accessed: 2022-08-10.
- Haohan Wang and Bhiksha Raj. 2017. On the origin of deep learning. arXiv preprint arXiv:1702.07800 (2017).
- Kgxboard: Explainable and interactive leaderboard for evaluation of knowledge graph completion models. arXiv preprint arXiv:2208.11024 (2022).
- All You Need to Know to Build a Product Knowledge Graph. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 4090–4091.