TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Compressing 3DCNNs Based on Tensor Train Decomposition

Compressing 3DCNNs Based on Tensor Train Decomposition

Dingheng Wang, Guangshe Zhao, Guoqi Li, Lei Deng, Yang Wu

2019-12-08QuantizationHand Gesture RecognitionNeural Network CompressionHand-Gesture Recognition
PaperPDF

Abstract

Three dimensional convolutional neural networks (3DCNNs) have been applied in many tasks, e.g., video and 3D point cloud recognition. However, due to the higher dimension of convolutional kernels, the space complexity of 3DCNNs is generally larger than that of traditional two dimensional convolutional neural networks (2DCNNs). To miniaturize 3DCNNs for the deployment in confining environments such as embedded devices, neural network compression is a promising approach. In this work, we adopt the tensor train (TT) decomposition, a straightforward and simple in situ training compression method, to shrink the 3DCNN models. Through proposing tensorizing 3D convolutional kernels in TT format, we investigate how to select appropriate TT ranks for achieving higher compression ratio. We have also discussed the redundancy of 3D convolutional kernels for compression, core significance and future directions of this work, as well as the theoretical computation complexity versus practical executing time of convolution in TT. In the light of multiple contrast experiments based on VIVA challenge, UCF11, and UCF101 datasets, we conclude that TT decomposition can compress 3DCNNs by around one hundred times without significant accuracy loss, which will enable its applications in extensive real world scenarios.

Results

TaskDatasetMetricValueModel
HandSHREC 2017 track on 3D Hand Gesture Recognition14 gestures accuracy731212163DCNN_VIVA_4
HandVIVA Hand Gestures DatasetAccuracy77.5Two 3DCNNs: LRN + HRN [11]
HandVIVA Hand Gestures DatasetAccuracy6.86
HandVIVA Hand Gestures DatasetAccuracy-CN23032403DCNN_VIVA_1
HandVIVA Hand Gestures DatasetAccuracy-CN-135855913DCNN_VIVA_2
Gesture RecognitionSHREC 2017 track on 3D Hand Gesture Recognition14 gestures accuracy731212163DCNN_VIVA_4
Gesture RecognitionVIVA Hand Gestures DatasetAccuracy77.5Two 3DCNNs: LRN + HRN [11]
Gesture RecognitionVIVA Hand Gestures DatasetAccuracy6.86
Gesture RecognitionVIVA Hand Gestures DatasetAccuracy-CN23032403DCNN_VIVA_1
Gesture RecognitionVIVA Hand Gestures DatasetAccuracy-CN-135855913DCNN_VIVA_2
QuantizationCIFAR-10MAP160327.043DCNN_VIVA_3
QuantizationKnowledge-based:All848096643DCNN_VIVA_5

Related Papers

Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation2025-09-04An End-to-End DNN Inference Framework for the SpiNNaker2 Neuromorphic MPSoC2025-07-18Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine2025-07-17Angle Estimation of a Single Source with Massive Uniform Circular Arrays2025-07-17Quantized Rank Reduction: A Communications-Efficient Federated Learning Scheme for Network-Critical Applications2025-07-15MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization2025-07-14Lightweight Federated Learning over Wireless Edge Networks2025-07-13Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation2025-07-11