This dataset contains 5,997 annotated code samples (3,200 synthetic violations, 1,847 real-world violations from 15 open-source projects, and 950 negative samples) for benchmarking automated usability heuristic evaluation tools. Annotations include violation type, severity, and heuristic category.
Nwasra et al. (Tue,) studied this question.