Accurate Single-cell (SC) image classification is critical for characterizing cellular heterogeneity and supporting disease diagnostics. Conventional convolutional models often struggle due to limited data, subtle morphological differences between cell types, and class imbalance. In this work, we propose a Hybrid Inception Vision Transformer (HiViT) that combines Inception convolutional feature extraction with transformer-based attention mechanism to capture bothfine-grained texture and long-range structural context. Our framework incorporates adaptive uncertainty-aware learning via Monte Carlo dropout and data balancing through augmentation. We evaluate HiViT on the White BloodCell (WBC) classification Berkeley SC Computational Microscopy (BSCCM) dataset, covering Lymphocyte, Granulocyte, and Monocyte classes. The model achieves overall superior performance compared to classical machine learningand deep learning baselines, with class-wise recalls of 90.31% (Lymphocyte), 97.97% (Granulocyte), and 81.21% (Monocyte). Experiments highlight the effectiveness of hybrid CNN–ViT architectures for robust and uncertainty-awareSC classification, providing a foundation for extending to other biomedical image-driven analysis and diagnostic tasks.
Building similarity graph...
Analyzing shared references across papers
Loading...
Saqib Nazir
ARDHENDU; id_orcid 0000-0003-0276-9000 BEHERA
Building similarity graph...
Analyzing shared references across papers
Loading...
Nazir et al. (Tue,) studied this question.