Explore open access research and scholarly works from NERC Open Research Archive

Advanced Search

Self-organising explainable multi-view representation learning for remote sensing scene classification

Gu, Xiaowei ORCID: https://orcid.org/0000-0001-9116-4761; Kerim, Abdulrahman ORCID: https://orcid.org/0000-0003-0141-9543; Zhang, Jinghao ORCID: https://orcid.org/0000-0001-5394-1814; Han, Jungong; Shen, Qiang ORCID: https://orcid.org/0000-0001-9333-4605; Atkinson, Peter M. ORCID: https://orcid.org/0000-0002-5489-6880; Zhang, Ce ORCID: https://orcid.org/0000-0001-5100-3584. 2026 Self-organising explainable multi-view representation learning for remote sensing scene classification. Applied Soft Computing, 190, 114579. 19, pp. 10.1016/j.asoc.2026.114579

Abstract
Remote sensing scene classification is widely considered to be a challenging task due to high intraclass variability and interclass similarity in remotely sensed imagery. While existing deep neural networks achieve promising performance, they often lack transparency and generalisation capability. To enhance interpretability without sacrificing accuracy, a novel self-organising transparent multi-view representation learning framework based on evolving fuzzy neural encoders for remote sensing scene classification is introduced in this paper. The framework leverages multiple pre-trained convolutional neural network backbones with different architectures to extract image embeddings from multiple views. The multi-view image embeddings are projected into a lower-dimensional feature space using multilayer evolving fuzzy neural networks trained in a supervised or self-supervised fashion as encoders and subsequently fused for scene classification. Extensive experiments on six benchmark datasets (Optimal-31, WHU-RS, UCMerced, AID, RSI-CB256, and PatternNet) demonstrate the framework’s superior performance, achieving average accuracies of 99.81 %, 98.83 %, 97.86 %, 98.37 %, 99.84 %, and 98.83 %, respectively, without fine-tuning to the specific context. Ablation studies confirm the complementary contributions of the multi-view, supervised, and self-supervised components in the proposed framework. The proposed framework provides an effective solution for remote sensing scene classification, achieving high accuracy with enhanced transparency and interpretability.
Documents
Full text not available from this repository.
Information
Library
Metrics

Altmetric Badge

Dimensions Badge

Share
Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email
View Item