TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/ScanRefer Dataset

Image Captioning on ScanRefer Dataset

Metric: CIDEr (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕CIDEr▼Extra DataPaperDate↕Code
13D CoCa85.42No3D CoCa: Contrastive Learners are 3D Captioners2025-04-13Code
2See It All83.14NoSee It All: Contextualized Late Aggregation for ...2024-08-14-
3BiCA80.14NoBi-directional Contextual Attention for 3D Dense...2024-08-13-
4Vote2Cap-DETR++76.36NoVote2Cap-DETR++: Decoupling Localization and Des...2023-09-06Code
5Vote2Cap-DETR71.45NoEnd-to-End 3D Dense Captioning with Vote2Cap-DETR2023-01-06Code
63DJCG60.86No---
7MORE58.89NoMORE: Multi-Order RElation Mining for Dense Capt...2022-03-10Code
8SpaCap3d58.06NoSpatiality-guided Transformer for 3D Dense Capti...2022-04-22Code
9Scan2Cap53.73NoScan2Cap: Context-aware Dense Captioning in RGB-...2020-12-03-
10Contextual50.29NoContextual Modeling for 3D Dense Captioning on P...2022-10-08-
113D-VLP50.02No--Code
12χ-Tran2Cap41.52NoX-Trans2Cap: Cross-Modal Knowledge Transfer usin...2022-03-02Code