Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Retrieval-Augmented Chain-of-Thought in Semi-structured Domains (2310.14435v1)

Published 22 Oct 2023 in cs.CL and cs.AI

Abstract: Applying existing question answering (QA) systems to specialized domains like law and finance presents challenges that necessitate domain expertise. Although LLMs have shown impressive language comprehension and in-context learning capabilities, their inability to handle very long inputs/contexts is well known. Tasks specific to these domains need significant background knowledge, leading to contexts that can often exceed the maximum length that existing LLMs can process. This study explores leveraging the semi-structured nature of legal and financial data to efficiently retrieve relevant context, enabling the use of LLMs for domain-specialized QA. The resulting system outperforms contemporary models and also provides useful explanations for the answers, encouraging the integration of LLMs into legal and financial NLP systems for future research.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Vaibhav Mavi (4 papers)
  2. Abulhair Saparov (17 papers)
  3. Chen Zhao (249 papers)
Citations (3)
Youtube Logo Streamline Icon: https://streamlinehq.com