Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Commonsense Properties from Query Logs and Question Answering Forums (1905.10989v4)

Published 27 May 2019 in cs.CL, cs.AI, and cs.DB

Abstract: Commonsense knowledge about object properties, human behavior and general concepts is crucial for robust AI applications. However, automatic acquisition of this knowledge is challenging because of sparseness and bias in online sources. This paper presents Quasimodo, a methodology and tool suite for distilling commonsense properties from non-standard web sources. We devise novel ways of tapping into search-engine query logs and QA forums, and combining the resulting candidate assertions with statistical cues from encyclopedias, books and image tags in a corroboration step. Unlike prior work on commonsense knowledge bases, Quasimodo focuses on salient properties that are typically associated with certain objects or concepts. Extensive evaluations, including extrinsic use-case studies, show that Quasimodo provides better coverage than state-of-the-art baselines with comparable quality.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Julien Romero (7 papers)
  2. Simon Razniewski (49 papers)
  3. Koninika Pal (3 papers)
  4. Jeff Z. Pan (78 papers)
  5. Archit Sakhadeo (4 papers)
  6. Gerhard Weikum (75 papers)
Citations (60)

Summary

We haven't generated a summary for this paper yet.