May 16, 2026Open Access

Evaluating creative work with artificial intelligence: Evidence from constrained innovation tasks

Key Points

Key points are not available for this paper at this time.

Abstract

We study whether a large language model can reliably evaluate human creativity in constrained, innovation-like tasks. Using expert-generated creative outputs from a validated experiment with workers in cultural and creative industries, we embed ChatGPT as an evaluator and benchmark its assessments against expert human judgments obtained through the Consensual Assessment Technique. Study 1 supports AI reliability by showing that AI-based creativity evaluations exhibit internal consistency comparable to that of expert judges across repeated and independent runs, even under conservative scenarios. Replacing a human judge with an AI evaluator does not reduce inter-rater reliability across drawing, mathematical, and verbal tasks. Beyond reliability, AI evaluations display three additional features that are difficult to achieve with human-only panels: lower evaluative variability, systematically higher scores consistent with a potentially more inclusive evaluative stance, and task-independence of evaluative standards. Study 2 further supports task-independence by showing that AI evaluations are structured along fluency, flexibility, originality, and elaboration, with dimension weights that adapt to task-specific constraints. • We test AI evaluation of human creativity on outputs from a controlled experiment. • We study constrained, innovation-like creative tasks. • Replacing one human judge with AI preserves panel reliability. • AI scores are less dispersed, higher on average, and task-independent. • AI evaluation is structured by fluency, flexibility, originality, and elaboration.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Valerio Fedele Addis

Giuseppe Attanasi

Giovanni Di Bartolomeo

Journals

Technovation

Actions

Institutions

Sapienza University of Rome

Corvinus University of Budapest

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Evaluating creative work with artificial intelligence: Evidence from constrained innovation tasks

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study