CMID: A Unified Self-Supervised Learning Framework for Remote Sensing Image Understanding
نویسندگان
چکیده
Self-supervised learning (SSL) has gained widespread attention in the remote sensing (RS) and earth observation (EO) communities owing to its ability learn task-agnostic representations without human-annotated labels. Nevertheless, most existing RS SSL methods are limited either global semantic separable or local spatial perceptible representations. We argue that this strategy is suboptimal realm of RS, since required for different downstream tasks often varied complex. In study, we proposed a unified framework better suited images representation learning. The framework, Contrastive Mask Image Distillation (CMID), capable with both separability perceptibility by combining contrastive (CL) masked image modeling (MIM) self-distillation way. Furthermore, our CMID architecture-agnostic, which compatible convolutional neural networks (CNN) vision transformers (ViT), allowing be easily adapted variety deep (DL) applications understanding. Comprehensive experiments have been carried out on four (i.e. scene classification, segmentation, object-detection, change detection) results show models pre-trained using achieve performance than other state-of-the-art multiple tasks. code will made available at https://github.com/NJU-LHRS/official-CMID facilitate research speed up development DL applications.
منابع مشابه
Deep Self-taught Learning for Remote Sensing Image Classification
This paper addresses the land cover classification task for remote sensing images by deep self-taught learning. Our selftaught learning approach learns suitable feature representations of the input data using sparse representation and undercomplete dictionary learning. We propose a deep learning framework which extracts representations in multiple layers and use the output of the deepest layer ...
متن کاملSupervised Object Knowledge Learning for Image Understanding
An object learning system for image understanding is proposed in this paper. The knowledge acquisition system is designed as a supervised learning task. Therefore, the role of the user as teacher of the system is emphasized, which allows to obtain the object description as well as to select the best recognition strategy for each specific object. An object description is acquired by considering ...
متن کاملA unified semi-supervised dimensionality reduction framework for manifold learning
We present a general framework of semi-supervised dimensionality reduction for manifold learning which naturally generalizes existing supervised and unsupervised learning frameworks which apply the spectral decomposition. Algorithms derived under our framework are able to employ both labeled and unlabeled examples and are able to handle complex problems where data form separate clusters of mani...
متن کاملA Unified Framework Of Distributed Spatial Database Orienting The Mass Remote Sensing Image Data
During the process of applying remote sensing image data, how to save the mass remote sensing image data of different times and use them effectively is an important problem. Researches show that the traditional large commercial relational database cannot storage the mass remote sensing data effectively and also cannot support unlimited data storage. In this paper, we combine the file system, re...
متن کاملA novel semi-supervised learning framework for hyperspectral image classification
In this paper, we propose a novel semi-supervised learning classification framework using box-based smooth ordering and Multiple 1D-embedding-based interpolation method in Ref. 25 for hyperspectral images. Due to the lack of labeled samples, conventional supervised approaches cannot generally perform efficient enough. On the other hand, obtaining labeled samples for hyperspectral image classifi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Geoscience and Remote Sensing
سال: 2023
ISSN: ['0196-2892', '1558-0644']
DOI: https://doi.org/10.1109/tgrs.2023.3268232