TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/ScanRefer Dataset

Image Captioning on ScanRefer Dataset

Metric: BLEU-4 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕BLEU-4▼Extra DataPaperDate↕Code
13D CoCa45.56No3D CoCa: Contrastive Learners are 3D Captioners2025-04-13Code
2See It All42.17NoSee It All: Contextualized Late Aggregation for ...2024-08-14-
3Vote2Cap-DETR++41.37NoVote2Cap-DETR++: Decoupling Localization and Des...2023-09-06Code
4BiCA40.16NoBi-directional Contextual Attention for 3D Dense...2024-08-13-
53DJCG39.67No---
6Vote2Cap-DETR39.34NoEnd-to-End 3D Dense Captioning with Vote2Cap-DETR2023-01-06Code
7MORE35.41NoMORE: Multi-Order RElation Mining for Dense Capt...2022-03-10Code
8SpaCap3d35.3NoSpatiality-guided Transformer for 3D Dense Capti...2022-04-22Code
9Scan2Cap34.25NoScan2Cap: Context-aware Dense Captioning in RGB-...2020-12-03-
103D-VLP31.87No--Code
11Contextual26.64NoContextual Modeling for 3D Dense Captioning on P...2022-10-08-
12χ-Tran2Cap23.83NoX-Trans2Cap: Cross-Modal Knowledge Transfer usin...2022-03-02Code