The rapid proliferation of generative artificial intelligence (GenAI) tools has created an urgent need for instruments to evaluate their educational value as teachers, faculty, administrators, and instructional designers consider adopting them. While rubrics exist to assess mobile applications and virtual reality tools, no comparable instrument has been developed specifically for large language models (LLMs) and AI media generators. The authors reviewed existing evaluation rubrics for edtech and GenAI tools, with edtech meaning digital tools that support ethical teaching to improve student learning and GenAI referring to neural networks that simulate human interactions by contextualizing relevant content based on learning needs. Grounded in Waks’ framework, the resulting Edu-GenAI Rubric comprises multiple dimensions organized into five domains: the Instrumental, Technical, Hedonic, Use, and Beneficial values. Dimensions include accuracy, productivity, personalization, citation, user interface, user experience, sharing, storage, and ethical dimensions encompassing data privacy, data transparency, guardrails, fair use, and algorithmic discrimination. The Edu-GenAI Rubric offers decision-makers with a preliminary, theory-informed instrument for evaluating GenAI tools in educational contexts that can be applied to institutional adoption decisions, developer benchmarking, and future research.
Cherner et al. (Thu,) studied this question.