March 3, 2026

Hybrid Inception-ViT Networks for Fine-Grained Single-Cell Image Classification

Key Points

The HiViT model significantly improves single-cell image classification accuracy, particularly highlighting class recalls for various white blood cells.
Class-wise recalls achieved were 90.31% for lymphocytes, 97.97% for granulocytes, and 81.21% for monocytes, demonstrating notable performance.
Evaluation utilized the Berkeley SC Computational Microscopy dataset, featuring multiple classes of white blood cells, enhancing diagnostic utility.
Findings suggest that hybrid CNN–ViT architectures could be essential in advancing biomedical image analysis and improving disease diagnostics.

Abstract

Accurate Single-cell (SC) image classification is critical for characterizing cellular heterogeneity and supporting disease diagnostics. Conventional convolutional models often struggle due to limited data, subtle morphological differences between cell types, and class imbalance. In this work, we propose a Hybrid Inception Vision Transformer (HiViT) that combines Inception convolutional feature extraction with transformer-based attention mechanism to capture bothfine-grained texture and long-range structural context. Our framework incorporates adaptive uncertainty-aware learning via Monte Carlo dropout and data balancing through augmentation. We evaluate HiViT on the White BloodCell (WBC) classification Berkeley SC Computational Microscopy (BSCCM) dataset, covering Lymphocyte, Granulocyte, and Monocyte classes. The model achieves overall superior performance compared to classical machine learningand deep learning baselines, with class-wise recalls of 90.31% (Lymphocyte), 97.97% (Granulocyte), and 81.21% (Monocyte). Experiments highlight the effectiveness of hybrid CNN–ViT architectures for robust and uncertainty-awareSC classification, providing a foundation for extending to other biomedical image-driven analysis and diagnostic tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Saqib Nazir

ARDHENDU; id_orcid 0000-0003-0276-9000 BEHERA

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Hybrid Inception-ViT Networks for Fine-Grained Single-Cell Image Classification

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study