Analyzing large-scale student feedback is critical for higher education quality assurance, yet manual analysis is inefficient and subjective. This paper proposes an integrated framework that unifies supervised classification, unsupervised clustering, and retrieval-augmented generation (RAG) to produce evidence-grounded and actionable insights. Ensemble-based supervised models perform thematic classification, while multi-encoder embedding fusion enables unsupervised discovery of coherent feedback clusters. A multi-stage RAG module integrates category predictions and cluster structure to retrieve representative evidence and generate transparent summaries with citation traceability. The framework is evaluated on student feedback collected from a Central Asian university and two public benchmarks, EduRABSA and Coursera course reviews, covering seven thematic categories. The supervised ensemble achieves 83.0% accuracy and 0.829 Macro-F1 on the primary dataset, while unsupervised clustering attains a silhouette score of 0.271 under the best fusion strategy. Independent evaluation on external benchmarks yields ensemble accuracy of 81.1% on EduRABSA and 49.8% on Coursera, confirming the framework’s adaptability across diverse educational contexts. By leveraging supervised labels and unsupervised structure, the proposed framework enables evidence-grounded, category-aware LLM-based summaries that faithfully reflect the diversity and distribution of student feedback and support actionable educational decision-making.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhanerke Baimukanova
Yerassyl Saparbekov
Hyesong Ha
Information
Nazarbayev University
Building similarity graph...
Analyzing shared references across papers
Loading...
Baimukanova et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69d894ad6c1944d70ce0597b — DOI: https://doi.org/10.3390/info17040351