Few-Shot Class-Incremental Learning (FSCIL) aims to sequentially learn new classes from very few labelled samples while preventing the forgetting of previously acquired knowledge, which has important practical value for remote sensing scene classification (RSSC). Recent studies have shown that applying a Vision Transformer (ViT) pre-trained on natural image datasets to FSCIL tasks can achieve significantly superior performance. Nevertheless, a substantial domain distribution gap exists between natural images and remote sensing images, which leads to severe performance degradation when such models are directly transferred to RSSC. To address the domain gap alongside FSCIL’s inherent stability–plasticity dilemma and overfitting under data scarcity, we propose a Dynamic Expansion Mixture-of-Experts with Pre-trained Vision Transformer (DEM-ViT) framework. Specifically, to alleviate the domain discrepancy, DEM-ViT incorporates an Adapter-Based Mixture-of-Experts (ABMoE) module, which captures the diverse visual patterns of remote sensing scenes through feature reconstruction in the representation space and collaborative learning among multiple experts. Furthermore, to address the stability–plasticity dilemma in FSCIL, we propose a Dynamic Expert Expansion (DEE) strategy, which progressively expands the model capacity along the incremental sessions. DEE provides sufficient space for learning new knowledge while mitigating the forgetting of previous knowledge. In addition, we propose a Semantic-Guided Feature Alignment (SGFA) method to reduce the risk of overfitting under data-scarce conditions. SGFA leverages textual information to construct robust text prototypes and uses them to calibrate the visual feature space. Extensive experiments across three benchmarks indicate that our framework exhibits highly competitive performance compared with state-of-the-art methods.
Building similarity graph...
Analyzing shared references across papers
Loading...
Wu et al. (Sun,) studied this question.
www.synapsesocial.com/papers/69df2c2fe4eeef8a2a6b134f — DOI: https://doi.org/10.3390/rs18081145
Yunhao Wu
Xiang Li
Jianlin Zhang
Remote Sensing
University of Chinese Academy of Sciences
Institute of Optics and Electronics, Chinese Academy of Sciences
Building similarity graph...
Analyzing shared references across papers
Loading...