Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PatentEval: Understanding Errors in Patent Generation (2406.06589v2)

Published 5 Jun 2024 in cs.CL and cs.AI

Abstract: In this work, we introduce a comprehensive error typology specifically designed for evaluating two distinct tasks in machine-generated patent texts: claims-to-abstract generation, and the generation of the next claim given previous ones. We have also developed a benchmark, PatentEval, for systematically assessing LLMs in this context. Our study includes a comparative analysis, annotated by humans, of various models. These range from those specifically adapted during training for tasks within the patent domain to the latest general-purpose LLMs. Furthermore, we explored and evaluated some metrics to approximate human judgments in patent text evaluation, analyzing the extent to which these metrics align with expert assessments. These approaches provide valuable insights into the capabilities and limitations of current LLMs in the specialized field of patent text generation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. You Zuo (2 papers)
  2. Kim Gerdes (3 papers)
  3. BenoƮt Sagot (60 papers)
  4. Eric Villemonte de La Clergerie (1 paper)
Citations (1)
X Twitter Logo Streamline Icon: https://streamlinehq.com