March 3, 2026

Case Study on Using AI for Feedback and Evaluation in an Undergraduate Statistics Course: A Pragmatic Workflow Approach

Key Points

AI provides motivational feedback and identifies areas for improvement in student research papers, enhancing learning outcomes.
Evaluation was based on a four-step workflow involving defining red flags, scanning submissions, and rubric-based assessments.
The structured approach utilizes natural language processing to complement human judgment, not replace it, addressing ethical considerations.
Strategically guided AI reduces expert workload while ensuring detailed and constructive feedback for students.

Abstract

Artificial Intelligence (AI) is increasingly utilized in student assessment, particularly for creating and grading tests. However, current applications largely focus on multiple-choice questionnaires. In this contribution, we present an innovative case study employing AI, specifically natural language processing techniques, for the evaluation and feedback of research papers submitted by students. We designed and implemented a structured experiment involving second-year undergraduate students enrolled in an introductory statistics course for social sciences. Students were required to submit a 20-page research paper, including a literature review, descriptive statistics, hypothesis formulation, and testing. The aim was not to fully automate evaluation, but rather to create a pragmatic workflow that meaningfully complements human judgment while addressing institutional, ethical, and pedagogical considerations. Through iterative testing with actual student assignments and previously graded submissions, we assessed AI’s ability to deliver motivational feedback, pinpoint areas for improvement, provide rubric-aligned scoring, and detect inconsistencies or possible academic misconduct. In an initial implementation, AI reliably delivered surface-level praise and basic rubric-driven evaluations but struggled with deeper contextual judgments and accurate fraud detection without explicit guidance. To overcome these limitations, we established a four-step workflow: (1) defining red flags based on prior subject matter expert (SME) knowledge; (2) scanning submissions for red flags, inconsistencies, and suspicious data; (3) rubric-based evaluation supported by justifications and specific quotes; and (4) SME intervention to finalize feedback using the AI-generated insights. Key insights include the effectiveness of “chain-of-verification” prompting, the necessity of developing domain-specific red-flag rubrics collaboratively with faculty, and navigating the strategic balance between delivering substantial feedback and ensuring time efficiency. Properly guided AI should help reduce expert timeload, enhancing efficiency and foster student engagement. Instead of replacing human expertise, our proposed model strategically leverages AI to highlight areas where expert intervention is most beneficial. In environments where students increasingly expect detailed feedback but faculty face time constraints, AI can offer a supportive, motivational, and pedagogically appropriate "light-touch" solution. This presentation provides practical examples, detailed workflow diagrams, and valuable insights for educators aiming to responsibly and effectively integrate AI into statistics education.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Ralitza Soultanova

Anna Riepe

RoSE Conference

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Case Study on Using AI for Feedback and Evaluation in an Undergraduate Statistics Course: A Pragmatic Workflow Approach

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study