GPTZoo: A Large-scale Dataset of GPTs for the Research Community (2405.15630v1)
Abstract: The rapid advancements in LLMs have revolutionized natural language processing, with GPTs, customized versions of ChatGPT available on the GPT Store, emerging as a prominent technology for specific domains and tasks. To support academic research on GPTs, we introduce GPTZoo, a large-scale dataset comprising 730,420 GPT instances. Each instance includes rich metadata with 21 attributes describing its characteristics, as well as instructions, knowledge files, and third-party services utilized during its development. GPTZoo aims to provide researchers with a comprehensive and readily available resource to study the real-world applications, performance, and potential of GPTs. To facilitate efficient retrieval and analysis of GPTs, we also developed an automated command-line interface (CLI) that supports keyword-based searching of the dataset. To promote open research and innovation, the GPTZoo dataset will undergo continuous updates, and we are granting researchers public access to GPTZoo and its associated tools.
- gptsapp.io. Gpts app. https://gptsapp.io/, 2024.
- Large language models for software engineering: A systematic literature review. arXiv e-prints, pages arXiv–2308, 2023.
- A review on selenium web driver with python. Annals of the Romanian Society for Cell Biology, pages 16760–16768, 2021.
- OpenAI. Introducing gpts. https://openai.com/blog/introducing-gpts, 2023.
- OpenAI. Chatgpt. https://openai.com/chatgpt/, 2024.
- OpenAI. Gpt store. https://chat.openai.com/gpts, 2024.
- Gpt store mining and analysis. arXiv preprint arXiv:2405.10210, 2024.
- Large language model supply chain: A research agenda. arXiv preprint arXiv:2404.12736, 2024.
- A first look at gpt apps: Landscape and vulnerability. arXiv preprint arXiv:2402.15105, 2024.
- A survey of large language models. arXiv preprint arXiv:2303.18223, 2023.
- Llm app store analysis: A vision and roadmap. arXiv preprint arXiv:2404.12737, 2024.
- A survey on generative ai and llm for video generation, understanding, and streaming. arXiv preprint arXiv:2404.16038, 2024.
- Xinyi Hou (16 papers)
- Yanjie Zhao (39 papers)
- Shenao Wang (15 papers)
- Haoyu Wang (309 papers)