Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Classifying Organizations for Food System Ontologies using Natural Language Processing (2309.10880v1)

Published 19 Sep 2023 in cs.CL, cs.AI, cs.CY, and cs.IR

Abstract: Our research explores the use of NLP methods to automatically classify entities for the purpose of knowledge graph population and integration with food system ontologies. We have created NLP models that can automatically classify organizations with respect to categories associated with environmental issues as well as Standard Industrial Classification (SIC) codes, which are used by the U.S. government to characterize business activities. As input, the NLP models are provided with text snippets retrieved by the Google search engine for each organization, which serves as a textual description of the organization that is used for learning. Our experimental results show that NLP models can achieve reasonably good performance for these two classification tasks, and they rely on a general framework that could be applied to many other classification problems as well. We believe that NLP models represent a promising approach for automatically harvesting information to populate knowledge graphs and aligning the information with existing ontologies through shared categories and concepts.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Tianyu Jiang (13 papers)
  2. Sonia Vinogradova (1 paper)
  3. Nathan Stringham (3 papers)
  4. E. Louise Earl (1 paper)
  5. Allan D. Hollander (1 paper)
  6. Patrick R. Huber (1 paper)
  7. Ellen Riloff (6 papers)
  8. R. Sandra Schillo (1 paper)
  9. Giorgio A. Ubbiali (1 paper)
  10. Matthew Lange (3 papers)

Summary

We haven't generated a summary for this paper yet.