Cross-Modal Contrastive Learning for Domain Adaptation in 3D Semantic Segmentation
نویسندگان
چکیده
Domain adaptation for 3D point cloud has attracted a lot of interest since it can avoid the time-consuming labeling process data to some extent. A recent work named xMUDA leveraged multi-modal domain task semantic segmentation by mimicking predictions between 2D and modalities, outperformed previous single modality methods only using clouds. Based on it, in this paper, we propose novel cross-modal contrastive learning scheme further improve effects. By employing constraints from correspondences pixel features features, our method not facilitates interaction two different but also boosts feature representations both labeled source unlabeled target domain. Meanwhile, sufficiently utilize context information through learning, introduce neighborhood aggregation module enhance features. The employs attention aggregate nearby pixels image, which relieves mismatching arising projecting relative sparse dense image pixels. We evaluate three unsupervised scenarios, including country-to-country, day-to-night, dataset-to-dataset. Experimental results show that approach outperforms existing methods, demonstrates effectiveness proposed method.
منابع مشابه
Cross-Domain Learning for Semantic Concept Detection
Automatic semantic concept detection has become increasingly important to effectively index and search the exploding amount of multimedia content, such as those from the Web and TV broadcasts. The large and growing amount of unlabeled data in comparison with the small amount of labeled training data limits the applicability of classifiers based upon supervised learning. In addition, newly acqui...
متن کاملUnsupervised Domain Adaptation for Semantic Segmentation with GANs
Visual Domain Adaptation is a problem of immense importance in computer vision. Previous approaches showcase the inability of even deep neural networks to learn informative representations across domain shift. This problem is more severe for tasks where acquiring hand labeled data is extremely hard and tedious. In this work, we focus on adapting the representations learned by segmentation netwo...
متن کاملLearning Deep Semantic Embeddings for Cross-Modal Retrieval
Deep learning methods have been actively researched for cross-modal retrieval, with the softmax cross-entropy loss commonly applied for supervised learning. However, the softmax cross-entropy loss is known to result in large intra-class variances, which is not not very suited for cross-modal matching. In this paper, a deep architecture called Deep Semantic Embedding (DSE) is proposed, which is ...
متن کاملCross-Modal Semantic Priming
The cross-modal semantic priming paradigm is described, including its underlying rationale and the different tasks with which it is combined. The major characteristics of the paradigm—the type of stimuli used, the dependent and independent variables typically manipulated—are then introduced. Also, its main advantages and drawbacks are discussed. Finally, the most important areas of application ...
متن کاملDomain - Adaptation Technique for Semantic Role Labeling with Structural Learning
© 2014 Soojong Lim et al. 429 http://dx.doi.org/10.4218/etrij.14.0113.0645 Semantic role labeling (SRL) is a task in naturallanguage processing with the aim of detecting predicates in the text, choosing their correct senses, identifying their associated arguments, and predicting the semantic roles of the arguments. Developing a high-performance SRL system for a domain requires manually annotate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i3.25400