Logic Mill -- A Knowledge Navigation System (2301.00200v2)

Published 31 Dec 2022 in cs.CL

Abstract: Logic Mill is a scalable and openly accessible software system that identifies semantically similar documents within either one domain-specific corpus or multi-domain corpora. It uses advanced NLP techniques to generate numerical representations of documents. Currently it leverages a large pre-trained LLM to generate these document representations. The system focuses on scientific publications and patent documents and contains more than 200 million documents. It is easily accessible via a simple Application Programming Interface (API) or via a web interface. Moreover, it is continuously being updated and can be extended to text corpora from other domains. We see this system as a general-purpose tool for future research applications in the social sciences and other domains.

References (23)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Logic Mill -- A Knowledge Navigation System (2301.00200v2)

Summary

Related Papers