TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Densely Connected Parameter-Efficient Tuning for Referring...

Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

Jiaqi Huang, Zunnan Xu, Ting Liu, Yong liu, Haonan Han, Kehong Yuan, Xiu Li

2025-01-15Referring Expression SegmentationTransfer LearningSemantic SegmentationImage Segmentation
PaperPDFCode(official)

Abstract

In the domain of computer vision, Parameter-Efficient Tuning (PET) is increasingly replacing the traditional paradigm of pre-training followed by full fine-tuning. PET is particularly favored for its effectiveness in large foundation models, as it streamlines transfer learning costs and optimizes hardware utilization. However, the current PET methods are mainly designed for single-modal optimization. While some pioneering studies have undertaken preliminary explorations, they still remain at the level of aligned encoders (e.g., CLIP) and lack exploration of misaligned encoders. These methods show sub-optimal performance with misaligned encoders, as they fail to effectively align the multimodal features during fine-tuning. In this paper, we introduce DETRIS, a parameter-efficient tuning framework designed to enhance low-rank visual feature propagation by establishing dense interconnections between each layer and all preceding layers, which enables effective cross-modal feature interaction and adaptation to misaligned encoders. We also suggest using text adapters to improve textual features. Our simple yet efficient approach greatly surpasses state-of-the-art methods with 0.9% to 1.8% backbone parameter updates, evaluated on challenging benchmarks. Our project is available at \url{https://github.com/jiaqihuang01/DETRIS}.

Results

TaskDatasetMetricValueModel
Instance SegmentationRefCOCOIoU81DETRIS
Instance SegmentationRefCOCO testAOverall IoU81.9DETRIS
Instance SegmentationRefCoCo valOverall IoU81DETRIS
Instance SegmentationRefCOCO testBOverall IoU79DETRIS
Instance SegmentationRefCOCOg-testOverall IoU75.3DETRIS
Instance SegmentationRefCOCO+ valOverall IoU75.2DETRIS
Instance SegmentationRefCOCO+ test BOverall IoU70.2DETRIS
Instance SegmentationRefCOCO+ testAOverall IoU78.6DETRIS
Instance SegmentationRefCOCOg-valOverall IoU74.6DETRIS
Referring Expression SegmentationRefCOCOIoU81DETRIS
Referring Expression SegmentationRefCOCO testAOverall IoU81.9DETRIS
Referring Expression SegmentationRefCoCo valOverall IoU81DETRIS
Referring Expression SegmentationRefCOCO testBOverall IoU79DETRIS
Referring Expression SegmentationRefCOCOg-testOverall IoU75.3DETRIS
Referring Expression SegmentationRefCOCO+ valOverall IoU75.2DETRIS
Referring Expression SegmentationRefCOCO+ test BOverall IoU70.2DETRIS
Referring Expression SegmentationRefCOCO+ testAOverall IoU78.6DETRIS
Referring Expression SegmentationRefCOCOg-valOverall IoU74.6DETRIS

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16