Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

h2oGPT: Democratizing Large Language Models (2306.08161v2)

Published 13 Jun 2023 in cs.CL, cs.AI, cs.HC, cs.IR, and cs.LG

Abstract: Applications built on top of LLMs such as GPT-4 represent a revolution in AI due to their human-level capabilities in natural language processing. However, they also pose many significant risks such as the presence of biased, private, or harmful text, and the unauthorized inclusion of copyrighted material. We introduce h2oGPT, a suite of open-source code repositories for the creation and use of LLMs based on Generative Pretrained Transformers (GPTs). The goal of this project is to create the world's best truly open-source alternative to closed-source approaches. In collaboration with and as part of the incredible and unstoppable open-source community, we open-source several fine-tuned h2oGPT models from 7 to 40 Billion parameters, ready for commercial use under fully permissive Apache 2.0 licenses. Included in our release is 100\% private document search using natural language. Open-source LLMs help boost AI development and make it more accessible and trustworthy. They lower entry hurdles, allowing people and groups to tailor these models to their needs. This openness increases innovation, transparency, and fairness. An open-source strategy is needed to share AI benefits fairly, and H2O.ai will continue to democratize AI and LLMs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (15)
  1. Arno Candel (2 papers)
  2. Jon McKinney (2 papers)
  3. Philipp Singer (21 papers)
  4. Pascal Pfeiffer (7 papers)
  5. Maximilian Jeblick (6 papers)
  6. Prithvi Prabhu (1 paper)
  7. Jeff Gambera (1 paper)
  8. Mark Landry (5 papers)
  9. Shivam Bansal (2 papers)
  10. Ryan Chesler (2 papers)
  11. Chun Ming Lee (2 papers)
  12. Marcos V. Conde (99 papers)
  13. Pasha Stetsenko (1 paper)
  14. Olivier Grellier (1 paper)
  15. SriSatish Ambati (2 papers)
Citations (7)
Youtube Logo Streamline Icon: https://streamlinehq.com