CrossMoCo: Multi-modal Momentum Contrastive Learning for Point Cloud

Sneha Paul, Zachary Patterson, Nizar Bouguila

2023-06-0820th Conference on Robots and Vision (CRV) 2023 63D Point Cloud Linear Classification Few-Shot Learning Self-Supervised Learning Contrastive Learning Few-Shot 3D Point Cloud Classification 3D Object Classification 3D Point Cloud Classification

Paper PDF Code

Abstract

The point cloud is a 3D geometric data that lacks a specific structure and is permutation-invariant. The applications of point clouds have gained significant attention recently in the field of vision tasks. However, most existing works on point clouds utilize supervised learning on large labelled data, which are costly and laborious to collect. To this end, unsupervised learning, for example, self-supervised learning, has shown promising performance in various tasks of 2D computer vision and holds the potential in 3D computer vision applications. In this study, we introduce a novel selfsupervised method called CrossMoCo, which learns the representations of unlabelled point cloud data in a multi-modal setup that also utilizes the 2D rendered images of the point clouds. CrossMoCo outperforms existing methods on multimodal self-supervised learning on point cloud by introducing two new concepts: momentum contrastive learning with more negative samples and multiple-view intra-modal contrastive learning. The first component learns from an online encoder and a momentum encoder with a large number of negative samples, which provides consistent learning signals. The second component enforces consistency between different views of the samples of the same modality, thereby improving multimodal representation. We conduct extensive studies on two popular benchmark datasets (ModelNet40 and ScanObjectNN) for linear classification and few-shot learning tasks. Our results demonstrate that CrossMoCo achieves superior performance over existing methods for both tasks on both datasets, achieving up to 4.36% improvement on linear classification and up to 9.2% on few-shot tasks. Our code is available at https://github.com/snehaputul/CrossMoCo.

CrossMoCo: Multi-modal Momentum Contrastive Learning for Point Cloud

Abstract

Related Papers

CrossMoCo: Multi-modal Momentum Contrastive Learning for Point Cloud

Abstract

Related Papers