TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learned Image Compression with Mixed Transformer-CNN Archi...

Learned Image Compression with Mixed Transformer-CNN Architectures

Jinming Liu, Heming Sun, Jiro Katto

2023-03-27CVPR 2023 1Image Compression
PaperPDFCodeCode(official)Code

Abstract

Learned image compression (LIC) methods have exhibited promising progress and superior rate-distortion performance compared with classical image compression standards. Most existing LIC methods are Convolutional Neural Networks-based (CNN-based) or Transformer-based, which have different advantages. Exploiting both advantages is a point worth exploring, which has two challenges: 1) how to effectively fuse the two methods? 2) how to achieve higher performance with a suitable complexity? In this paper, we propose an efficient parallel Transformer-CNN Mixture (TCM) block with a controllable complexity to incorporate the local modeling ability of CNN and the non-local modeling ability of transformers to improve the overall architecture of image compression models. Besides, inspired by the recent progress of entropy estimation models and attention modules, we propose a channel-wise entropy model with parameter-efficient swin-transformer-based attention (SWAtten) modules by using channel squeezing. Experimental results demonstrate our proposed method achieves state-of-the-art rate-distortion performances on three different resolution datasets (i.e., Kodak, Tecnick, CLIC Professional Validation) compared to existing LIC methods. The code is at https://github.com/jmliu206/LIC_TCM.

Results

TaskDatasetMetricValueModel
Image CompressionkodakBD-Rate over VTM-17.0-10.14LIC-TCM Large

Related Papers

Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference2025-07-02Explicit Residual-Based Scalable Image Coding for Humans and Machines2025-06-24NIC-RobustBench: A Comprehensive Open-Source Toolkit for Neural Image Compression and Robustness Analysis2025-06-23LVPNet: A Latent-variable-based Prediction-driven End-to-end Framework for Lossless Compression of Medical Images2025-06-22DiffO: Single-step Diffusion for Image Compression at Ultra-Low Bitrates2025-06-19Fast Training-free Perceptual Image Compression2025-06-19ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression2025-06-18Breaking the Multi-Enhancement Bottleneck: Domain-Consistent Quality Enhancement for Compressed Images2025-06-17