TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/JAFAR: Jack up Any Feature at Any Resolution

JAFAR: Jack up Any Feature at Any Resolution

Paul Couairon, Loick Chambon, Louis Serrano, Jean-Emmanuel Haugeard, Matthieu Cord, Nicolas Thome

2025-06-10Feature Upsampling
PaperPDFCode(official)

Abstract

Foundation Vision Encoders have become essential for a wide range of dense vision tasks. However, their low-resolution spatial feature outputs necessitate feature upsampling to produce the high-resolution modalities required for downstream tasks. In this work, we introduce JAFAR, a lightweight and flexible feature upsampler that enhances the spatial resolution of visual features from any Foundation Vision Encoder to an arbitrary target resolution. JAFAR employs an attention-based module designed to promote semantic alignment between high-resolution queries, derived from low-level image features, and semantically enriched low-resolution keys, using Spatial Feature Transform (SFT) modulation. Notably, despite the absence of high-resolution supervision, we demonstrate that learning at low upsampling ratios and resolutions generalizes remarkably well to significantly higher output scales. Extensive experiments show that JAFAR effectively recovers fine-grained spatial details and consistently outperforms existing feature upsampling methods across a diverse set of downstream tasks. Project page at https://jafar-upsampler.github.io

Results

TaskDatasetMetricValueModel
Representation LearningImageNetADCC73.3JAFAR
Representation LearningImageNetAverage Drop17.4JAFAR
Representation LearningImageNetAverage Increase30.9JAFAR

Related Papers

Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation2025-05-04LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models2025-04-18LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention2024-11-29Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels2024-10-29EfficientCD: A New Strategy For Change Detection Based With Bi-temporal Layers Exchanged2024-07-22FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures2024-07-18A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling2024-07-02LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors2024-03-21