What question did this study set out to answer?

The research aims to create a new dataset for improving the evaluation of machine learning models in handwritten character recognition.

January 25, 2026Open Access

The dataset for extending EMNIST evaluation

Key Points

The research aims to create a new dataset for improving the evaluation of machine learning models in handwritten character recognition.
Developed a novel dataset combined with existing NIST Databases.
Analyzed popular machine learning models trained on the EMNIST-letters dataset.
Discussed evaluation issues related to accuracy comparisons on test sets.
Proposed evaluation methods on new independently constructed data.
Identified limitations in current evaluation methods for state-of-the-art models.
Demonstrated the potential for deeper insights using the new dataset and evaluation process.
Included publicly available source codes and datasets for the research community.

Abstract

The paper describes the dataset for a deeper evaluation of the machine learning models for handwritten character recognition. For that purpose, we build a dataset that, combined with existing NIST Databases, offers possibilities for additional analysis of the models built on these data. The paper summarizes the most popular publicly available machine learning models, trained on the EMNIST-letters dataset. We discuss issues related to the evaluation of state-of-the-art results that have been made by comparing accuracy achieved on the test set built in cross-validation setting. We propose additional evaluation on new, independently constructed data, unaffiliated with the NIST database authors. The dataset and source codes have been made available using Gdansk Tech University repository Most Wiedzy.

The dataset for extending EMNIST evaluation

Key Points

Abstract

Cite This Study