Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Evidence-centered Assessment for Writing with Generative AI (2401.08964v1)

Published 17 Jan 2024 in cs.HC

Abstract: We propose a learning analytics-based methodology for assessing the collaborative writing of humans and generative artificial intelligence. Framed by the evidence-centered design, we used elements of knowledge-telling, knowledge transformation, and cognitive presence to identify assessment claims; we used data collected from the CoAuthor writing tool as potential evidence for these claims; and we used epistemic network analysis to make inferences from the data about the claims. Our findings revealed significant differences in the writing processes of different groups of CoAuthor users, suggesting that our method is a plausible approach to assessing human-AI collaborative writing.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. Daniel Naber. A rule-based style and grammar checker. 01 2003.
  2. A brief introduction to evidence-centered design. US Department of Education, 06 2003.
  3. The extended mind. Analysis, 58(1):7–19, 1998.
  4. OpenAI. Gpt-4 technical report, 2023.
  5. James V. Wertsch. Mind As Action. Oxford University Press, Incorporated, New York, UNITED STATES, 1998.
  6. Lodge J. M.and Howard S.and Bearman M.and Dawson P and Associates. Assessment reform for the age of artificial intelligence. Tertiary Education Quality and Standards Agency, 2023.
  7. Automated essay scoring: A survey of the state of the art. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19, pages 6300–6308. International Joint Conferences on Artificial Intelligence Organization, 7 2019.
  8. Advances in writing analytics: Mapping the state of the field. In Companion Proceedings of the 9th International Conference on Learning Analytics & Knowledge, LAK ’19. Association for Computing Machinery, 03 2019.
  9. On the use of bert for automated essay scoring: Joint learning of multi-scale essay representation. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 3416–3425, Seattle, United States, July 2022. Association for Computational Linguistics.
  10. Towards automated analysis of rhetorical categories in students essay writings using bloom’s taxonomy. In LAK23: 13th International Learning Analytics and Knowledge Conference, LAK2023, page 418–429. Association for Computing Machinery, 2023.
  11. Let’s not forget: Learning analytics are about learning. TechTrends, 59, 01 2015.
  12. Analytics of self-regulated learning scaffolding: effects on learning processes. Frontiers in Psychology, 14, 2023.
  13. Assessment in the age of artificial intelligence. Computers and Education: Artificial Intelligence, 3:100075, 2022.
  14. Sunder Pichai. An important next step on our ai journey, 2023.
  15. The psychology of written composition. Psychology of Education and Instruction Series. Erlbaum, 1987.
  16. Mladen Raković. Automatic Identification of Knowledge Transforming Content in Argument Essays Developed from Multiple Sources. Phd thesis, Simon Fraser University, British Columbia, CA, sep 2019.
  17. Critical inquiry in a text-based environment: Computer conferencing in higher education. The Internet and Higher Education, 2(2):87–105, 1999.
  18. Automatic analysis of cognitive presence in online discussions: An approach using deep learning and explainable artificial intelligence. Computers and Education: Artificial Intelligence, 2:100037, 2021.
  19. Uncovering associations between cognitive presence and speech acts: A network-based approach. In LAK22: 12th International Learning Analytics and Knowledge Conference, LAK22, page 315–325. Association for Computing Machinery, 2022.
  20. Lessons learned from implementing remotely invigilated online exams. Journal of University Teaching and Learning Practice, 16(1):137–155, 2019.
  21. Coauthor: Designing a human-ai collaborative writing dataset for exploring language model capabilities. CoRR, abs/2201.06796, 2022.
  22. Natural language processing - writing analytics. In The Handbook of Learning Analytics, pages 96–104. SoLAR, 2 edition, 2022.
  23. Writing process differences in subgroups reflected in keystroke logs. Journal of Educational and Behavioral Statistics, 44(5):571–596, 2019.
  24. Using keystroke analytics to understand cognitive processes during writing. International Educational Data Mining Society, 2021.
  25. Harnessing the potential of trace data and linguistic analysis to predict learner performance in a multi-text writing task. Journal of Computer Assisted Learning, 39(3):703–718, 2023.
  26. Semi-Markov Processes and Reliability. 01 2001.
  27. Keystroke logging in writing research. Written Communication, 30(3):358–392, 2013.
  28. Analysis of collaborative writing processes using revision maps and probabilistic topic models. In Proceedings of 3rd International Conference on Learning Analytics and Knowledge, LAK ’13, page 38–47. Association for Computing Machinery, 2013.
  29. Understanding revisions in student writing through revision graphs. In Artificial Intelligence in Education, pages 332–336, Cham, 2018. Springer International Publishing.
  30. Visual representation of co-authorship with GPT-3: Studying human-machine interaction for effective writing. In Proceedings of the 16th International Conference on Educational Data Mining, pages 183–193. International Educational Data Mining Society, July 2023.
  31. Epistemic Network Analysis: A Worked Example of Theory-Based Learning Analytics, pages 175–187. 01 2017.
  32. Towards a fuller picture: Triangulation and integration of the measurement of self-regulated learning based on trace and think aloud data. Journal of Computer Assisted Learning, 39(4):1303–1324, 2023.
  33. Daniel Chandler. An introduction to genre theory. 1997.
  34. Survey of hallucination in natural language generation. ACM Computing Surveys, 55(12):1–38, mar 2023.
  35. How we code. In Advances in Quantitative Ethnography, pages 62–77. Springer International Publishing, 2021.
  36. Sentence-bert: Sentence embeddings using siamese bert-networks. CoRR, abs/1908.10084, 2019.
  37. Information mining and similarity computation for semi- / un-structured sentences from the social data. Digital Communications and Networks, 7, 08 2020.
  38. rENA: Epistemic Network Analysis, 2022.
  39. The mathematical foundations of epistemic network analysis. In Advances in Quantitative Ethnography, Communications in Computer and Information Science, pages 91–105. Springer, 2021.
  40. Mixed Effects Models and Extensions in Ecology With R, volume 1-574. 01 2009.
  41. Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1):1–48, 2015.
  42. lmerTest package: Tests in linear mixed effects models. Journal of Statistical Software, 82(13):1–26, 2017.
  43. F. E. Satterthwaite. An approximate distribution of estimates of variance components. Biometrics Bulletin, 2(6):110–114, December 1946.
  44. Sample quantiles in statistical packages. The American Statistician, 50:361–365, 11 1996.
  45. J. Cohen. Statistical Power Analysis for the Behavioral Sciences. Lawrence Erlbaum Associates, 1988.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Yixin Cheng (9 papers)
  2. Kayley Lyons (2 papers)
  3. Guanliang Chen (11 papers)
  4. Zachari Swiecki (6 papers)
  5. Dragan Gasevic (12 papers)
Citations (5)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets