Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Satyrn: A Platform for Analytics Augmented Generation (2406.12069v2)

Published 17 Jun 2024 in cs.CL

Abstract: LLMs are capable of producing documents, and retrieval augmented generation (RAG) has shown itself to be a powerful method for improving accuracy without sacrificing fluency. However, not all information can be retrieved from text. We propose an approach that uses the analysis of structured data to generate fact sets that are used to guide generation in much the same way that retrieved documents are used in RAG. This analytics augmented generation (AAG) approach supports the ability to utilize standard analytic techniques to generate facts that are then converted to text and passed to an LLM. We present a neurosymbolic platform, Satyrn, that leverages AAG to produce accurate, fluent, and coherent reports grounded in large scale databases. In our experiments, we find that Satyrn generates reports in which over 86% of claims are accurate while maintaining high levels of fluency and coherence, even when using smaller LLMs such as Mistral-7B, as compared to GPT-4 Code Interpreter in which just 57% of claims are accurate.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Marko Sterbentz (4 papers)
  2. Cameron Barrie (3 papers)
  3. Shubham Shahi (2 papers)
  4. Abhratanu Dutta (4 papers)
  5. Donna Hooshmand (2 papers)
  6. Harper Pack (3 papers)
  7. Kristian J. Hammond (2 papers)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets