Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

GPTZoo: A Large-scale Dataset of GPTs for the Research Community (2405.15630v1)

Published 24 May 2024 in cs.SE

Abstract: The rapid advancements in LLMs have revolutionized natural language processing, with GPTs, customized versions of ChatGPT available on the GPT Store, emerging as a prominent technology for specific domains and tasks. To support academic research on GPTs, we introduce GPTZoo, a large-scale dataset comprising 730,420 GPT instances. Each instance includes rich metadata with 21 attributes describing its characteristics, as well as instructions, knowledge files, and third-party services utilized during its development. GPTZoo aims to provide researchers with a comprehensive and readily available resource to study the real-world applications, performance, and potential of GPTs. To facilitate efficient retrieval and analysis of GPTs, we also developed an automated command-line interface (CLI) that supports keyword-based searching of the dataset. To promote open research and innovation, the GPTZoo dataset will undergo continuous updates, and we are granting researchers public access to GPTZoo and its associated tools.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (12)
  1. gptsapp.io. Gpts app. https://gptsapp.io/, 2024.
  2. Large language models for software engineering: A systematic literature review. arXiv e-prints, pages arXiv–2308, 2023.
  3. A review on selenium web driver with python. Annals of the Romanian Society for Cell Biology, pages 16760–16768, 2021.
  4. OpenAI. Introducing gpts. https://openai.com/blog/introducing-gpts, 2023.
  5. OpenAI. Chatgpt. https://openai.com/chatgpt/, 2024.
  6. OpenAI. Gpt store. https://chat.openai.com/gpts, 2024.
  7. Gpt store mining and analysis. arXiv preprint arXiv:2405.10210, 2024.
  8. Large language model supply chain: A research agenda. arXiv preprint arXiv:2404.12736, 2024.
  9. A first look at gpt apps: Landscape and vulnerability. arXiv preprint arXiv:2402.15105, 2024.
  10. A survey of large language models. arXiv preprint arXiv:2303.18223, 2023.
  11. Llm app store analysis: A vision and roadmap. arXiv preprint arXiv:2404.12737, 2024.
  12. A survey on generative ai and llm for video generation, understanding, and streaming. arXiv preprint arXiv:2404.16038, 2024.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Xinyi Hou (16 papers)
  2. Yanjie Zhao (39 papers)
  3. Shenao Wang (15 papers)
  4. Haoyu Wang (309 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com