TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/ScanRefer Dataset

Image Captioning on ScanRefer Dataset

Metric: ROUGE-L (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕ROUGE-L▼Extra DataPaperDate↕Code
13D CoCa61.98No3D CoCa: Contrastive Learners are 3D Captioners2025-04-13Code
2Vote2Cap-DETR++60NoVote2Cap-DETR++: Decoupling Localization and Des...2023-09-06Code
3See It All59.44NoSee It All: Contextualized Late Aggregation for ...2024-08-14-
4Vote2Cap-DETR59.33NoEnd-to-End 3D Dense Captioning with Vote2Cap-DETR2023-01-06Code
53DJCG59.02No---
6BiCA56.1NoBi-directional Contextual Attention for 3D Dense...2024-08-13-
7MORE55.41NoMORE: Multi-Order RElation Mining for Dense Capt...2022-03-10Code
8SpaCap3d55.03NoSpatiality-guided Transformer for 3D Dense Capti...2022-04-22Code
9Scan2Cap54.95NoScan2Cap: Context-aware Dense Captioning in RGB-...2020-12-03-
103D-VLP51.17No--Code
11χ-Tran2Cap44.97NoX-Trans2Cap: Cross-Modal Knowledge Transfer usin...2022-03-02Code
12Contextual44.71NoContextual Modeling for 3D Dense Captioning on P...2022-10-08-