Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
GPT-5.1
GPT-5.1 96 tok/s
Gemini 3.0 Pro 48 tok/s Pro
Gemini 2.5 Flash 155 tok/s Pro
Kimi K2 197 tok/s Pro
Claude Sonnet 4.5 36 tok/s Pro
2000 character limit reached

Introducing ChatSQC: Enhancing Statistical Quality Control with Augmented AI (2308.13550v2)

Published 22 Aug 2023 in cs.HC

Abstract: We introduce ChatSQC, an innovative chatbot system that combines the power of OpenAI's LLMs (LLM) with a specific knowledge base in Statistical Quality Control (SQC). Our research focuses on enhancing LLMs using specific SQC references, shedding light on how data preprocessing parameters and LLM selection impact the quality of generated responses. By illustrating this process, we hope to motivate wider community engagement to refine LLM design and output appraisal techniques. We also highlight potential research opportunities within the SQC domain that can be facilitated by leveraging ChatSQC, thereby broadening the application spectrum of SQC. A primary goal of our work is to provide a template and proof-of-concept on how LLMs can be utilized by our community. To continuously improve ChatSQC, we ask the SQC community to provide feedback, highlight potential issues, request additional features, and/or contribute via pull requests through our public GitHub repository. Additionally, the team will continue to explore adding supplementary reference material that would further improve the contextual understanding of the chatbot. Overall, ChatSQC serves as a testament to the transformative potential of AI within SQC, and we hope it will spur further advancements in the integration of AI in this field.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (46)
  1. Explaining predictive model performance: An experimental study of data preparation and model choice. Big Data, 11(3):199–214.
  2. GPT4All: Training an assistant-style chatbot with large scale data distillation from GPT-3.5-Turbo. https://github.com/nomic-ai/gpt4all.
  3. On the dangers of stochastic parrots: Can language models be too big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’21, page 610–623, New York, NY, USA. Association for Computing Machinery.
  4. Bowman, S. R. (2023). Eight things to know about large language models. arXiv preprint 2304.00612v1.
  5. Chandrasekar, P. (2023). Announcing OverflowAI - stack overflow. https://stackoverflow.blog/2023/07/27/announcing-overflowai/. Last accessed on 2023-07-28.
  6. How is ChatGPT’s behavior changing over time? arXiv preprint arXiv:2307.09009v1.
  7. Evaluating large language models trained on code. arXiv preprint arXiv:2107.03374v2.
  8. Creative Commons (2023). About cc licenses. https://creativecommons.org/share-your-work/cclicenses/. Last access on 2024-02-24.
  9. The faiss library. arXiv preprint arXiv:2401.08281.
  10. A new era of learning: Considerations for ChatGPT as a tool to enhance statistics and data science education. Journal of Statistics and Data Science Education, just-accepted:1–10.
  11. Retrieval-augmented generation for large language models: A survey. arXiv preprint arXiv:2312.10997.
  12. Goldman Sachs (2023). Generative AI could raise global GDP by 7%. https://www.goldmansachs.com/intelligence/pages/generative-ai-could-raise-global-gdp-by-7-percent.html. Last accessed on 2023-07-10.
  13. Health-Europe, T. L. R. (2023). Embracing generative ai in health care. The Lancet Regional Health-Europe, 30.
  14. Kalliamvakou, E. (2022). Research: quantifying GitHub Copilot’s impact on developer productivity and happiness. https://github.blog/2022-09-07-research-quantifying-github-copilots-impact-on-developer-productivity-and-happiness/. Last accessed on 2023-07-10.
  15. Kamath, O. (2023). The power of gpt-3.5-16k. https://meetcody.ai/blog/the-power-of-gpt-3-5-16k/. Last accessed on 2024-02-24.
  16. Knoth, S. (2022). An expanded case against synthetic-type control charts. Quality and Reliability Engineering International, 38(6):3197–3215.
  17. A critique of a variety of “memory-based” process monitoring methods. Journal of Quality Technology, 55(1):18–42.
  18. SPoC: Search-based pseudocode to code. In Wallach, H., Larochelle, H., Beygelzimer, A., d'Alché-Buc, F., Fox, E., and Garnett, R., editors, Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc.
  19. LangChain (2023a). Getting started: Vector stores. https://js.langchain.com/docs/modules/indexes/vector_stores/. Last accessed on 2023-07-15.
  20. LangChain (2023b). Text embedding models. https://js.langchain.com/docs/modules/data_connection/text_embedding/. Last accessed on 2023-07-15.
  21. LangChain (2023c). URL — LangChain. https://python.langchain.com/docs/modules/data_connection/document_loaders/integrations/url. Last accessed on 2023-07-15.
  22. Emergent world representations: Exploring a sequence model trained on a synthetic task. arXiv preprint arXiv: 2210.13382v4.
  23. The grounding problem: An approach to the integration of cognitive and generative models. In Proceedings of the AAAI Symposium Series, volume 2, pages 320–325.
  24. How generative AI models such as ChatGPT can be (mis)used in SPC practice, education, and research? An exploratory study. Quality Engineering, 36(2):287–315.
  25. Augmented language models: a survey. arXiv preprint arXiv:2302.07842.
  26. Large language models: A survey. arXiv preprint arXiv:2402.06196.
  27. Mitchell, M. (2023). How do we know how smart ai systems are? Science, 381(6654):adj5957.
  28. A conversation about AI and catastrophic risks. LinkedIn Video, available at: https://www.linkedin.com/posts/andrewyng_had-an-insightful-conversation-with-geoff-activity-7073688821803978752-DO9h/. Accessed: 2023-8-10.
  29. NIST (2021). Copyrights — NIST. National Institute of Standards and Technology. U.S. Department of Commerce. https://www.nist.gov/oism/copyrights. Last accessed on 2023-07-25.
  30. NIST/SEMATCH (2022). NIST/SEMATECH e-handbook of statistical methods. National Institute of Standards and Technology. U.S. Department of Commerce. https://www.itl.nist.gov/div898/handbook/index.htm. Last accessed on 2023-07-25.
  31. OpenAI (2023a). API data usage policies. https://openai.com/policies/api-data-usage-policies. Last accessed on 2023-07-21.
  32. OpenAI (2023b). GPT-4 technical report. arXiv preprint arXiv:2303.08774v3.
  33. OpenAI (2024a). Embeddings - OpenAI API platform. https://platform.openai.com/docs/guides/embeddings/what-are-embeddings. Last accessed on 2024-03-26.
  34. OpenAI (2024b). Text generation – OpenAI platform. https://platform.openai.com/docs/guides/text-generation/how-should-i-set-the-temperature-parameter. Accessed: 2024-3-26.
  35. Check your facts and try again: Improving large language models with external knowledge and automated feedback. arXiv preprint arXiv:2302.12813.
  36. The impact of AI on developer productivity: Evidence from GitHub Copilot. arXiv preprint arXiv:2302.06590v1.
  37. Our next-generation model: Gemini 1.5. https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/. Accessed: 2024-3-26.
  38. Richardson, L. (2023). Beautiful soup documentation - beautiful soup 4.12.0 documentation. https://www.crummy.com/software/BeautifulSoup/bs4/doc/.
  39. How can IJDS authors, reviewers, and editors use (and misuse) generative AI? INFORMS Journal on Data Science.
  40. Large language models encode clinical knowledge. Nature, pages 1–9.
  41. Capacity for large language model chatbots to aid in orthopedic management, research, and patient queries. Journal of Orthopaedic Research®.
  42. A comprehensive survey of hallucination mitigation techniques in large language models. arXiv preprint arXiv:2401.01313.
  43. Llama 2: Open foundation and fine-tuned chat models. Facebook AI Research. Preprint available at: https://scontent.fosu2-1.fna.fbcdn.net/v/t39.2365-6/10000000_662098952474184_2584067087619170692_n.pdf?_nc_cat=105&ccb=1-7&_nc_sid=3c67a6&_nc_ohc=RYfzDCymkuYAX84NKmI&_nc_ht=scontent.fosu2-1.fna&oh=00_AfCUMCHqcqUPVZD2w8TX8zdEtMj7Zy3M7n5Z_pwUZGfgBA&oe=64C0613F. Available on July 18, 2023 and accessed on July 20, 2023.
  44. chatClimate: Grounding conversational AI in climate science. arXiv preprint arXiv:2304.05510v2.
  45. Scientific discovery in the age of artificial intelligence. Nature, 620(7972):47–60.
  46. BloombergGPT: A large language model for finance. arXiv preprint arXiv:2303.17564v2.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 1 tweet and received 0 likes.

Upgrade to Pro to view all of the tweets about this paper: