What question did this study set out to answer?

This study evaluates the effectiveness of self-supervised contrastive learning methods for stock keeping unit recognition.

March 18, 2026Open Access

Contrastive Learning in Stock Keeping Unit Image Recognition

Key Points

This study evaluates the effectiveness of self-supervised contrastive learning methods for stock keeping unit recognition.
Evaluated three methods: SimCLR, MoCo v2, and BYOL.
Conducted experiments on the RP2K benchmark and InSKU dataset.
Applied linear probing and full fine-tuning techniques.
SimCLR achieved the highest Top-1 accuracy of 94.98% with linear evaluation.
BYOL attained the highest performance of 99.22% with full fine-tuning.
MoCo v2 showed competitive performance under a reduced training budget after dataset filtering.

Abstract

Self-supervised contrastive learning has become an effective approach for visual representation learning when large-scale annotation is impractical. In this study, we evaluate three widely used methods—SimCLR, MoCo v2, and BYOL—for large-scale stock keeping unit (SKU) recognition in retail environments. Experiments are conducted on the RP2K benchmark and a domain-specific in-house dataset (InSKU) using both linear probing and full fine-tuning. Under the original RP2K configuration with extended self-supervised pre-training, SimCLR achieves the highest Top-1 accuracy under linear evaluation (94.98%). In contrast, BYOL attains the highest performance under full fine-tuning (99.22% Top-1 accuracy). After filtering and deduplicating the dataset to reduce class imbalance and near-duplicate samples, MoCo v2 achieves competitive, and in some cases superior, linear performance under a reduced training budget. Cross-domain evaluation on InSKU indicates that SimCLR generalises more effectively under frozen-encoder constraints, whereas BYOL and MoCo v2 require full adaptation. These results highlight the sensitivity of contrastive representations to dataset composition, optimisation regime, and domain shift, providing practical guidance for deployment in dynamic retail settings.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Wiktor Kępiński

Grzegorz Sarwas

Journals

Applied Sciences

Actions

Institutions

Warsaw University of Technology

Omnikon (Poland)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Contrastive Learning in Stock Keeping Unit Image Recognition

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study