Masked Discrimination for Self-supervised Learning on Point Clouds

نویسندگان

چکیده

Masked autoencoding has achieved great success for self-supervised learning in the image and language domains. However, mask based pretraining yet to show benefits point cloud understanding, likely due standard backbones like PointNet being unable properly handle training versus testing distribution mismatch introduced by masking during training. In this paper, we bridge gap proposing a discriminative Transformer framework, MaskPoint, clouds. Our key idea is represent as discrete occupancy values (1 if part of cloud; 0 not), perform simple binary classification between masked object points sampled noise proxy task. way, our approach robust sampling variance clouds, facilitates rich representations. We evaluate pretrained models across several downstream tasks, including 3D shape classification, segmentation, real-word detection, demonstrate state-of-the-art results while achieving significant speedup (e.g., 4.1 $$\times $$ on ScanNet) compared prior baseline. Code available at https://github.com/haotian-liu/MaskPoint .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Voxel-Based Neighborhood for Spatial Shape Pattern Classification of Lidar Point Clouds with Supervised Learning

Improving the effectiveness of spatial shape features classification from 3D lidar data is very relevant because it is largely used as a fundamental step towards higher level scene understanding challenges of autonomous vehicles and terrestrial robots. In this sense, computing neighborhood for points in dense scans becomes a costly process for both training and classification. This paper propos...

متن کامل

Dynamic Graph CNN for Learning on Point Clouds

Point clouds provide a flexible and scalable geometric representation suitable for countless applications in computer graphics; they also comprise the raw output of most 3D data acquisition devices. Hence, the design of intelligent computational models that act directly on point clouds is critical, especially when efficiency considerations or noise preclude the possibility of expensive denoisin...

متن کامل

Weakly Supervised Learning of Foreground-Background Segmentation Using Masked RBMs

We propose an extension of the Restricted Boltzmann Machine (RBM) that allows the joint shape and appearance of foreground objects in cluttered images to be modeled independently of the background. We present a learning scheme that learns this representation directly from cluttered images with only very weak supervision. The model generates plausible samples and performs foreground-background s...

متن کامل

A Supervised Approach for Object Extraction from Terrestrial Laser Point Clouds Demonstrated on Trees

Terrestrial laser scanning is becoming a standard for 3D modeling of complex scenes. Results of the scan contain detailed geometric information about the scene; however, the lack of semantic details is still a gap in making this data useable for mapping. In this paper we propose a framework for object recognition in laser scans. The 3D point cloud, which is the natural representation of scanner...

متن کامل

Supervised Learning for Self-Generating Neural Networks

In this paper, supervised learning for Self-Generating Neural Networks (SGNN) method, which was originally developed for the purpose of unsupervised learning, is discussed. An information analytical method is proposed to assign weights to attributes in the training examples if class information is available. This significantly improves the learning speed and the accuracy of the SGNN classiier. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20086-1_38