TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/Nr3D

Image Captioning on Nr3D

Metric: CIDEr (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕CIDEr▼Extra DataPaperDate↕Code
13D CoCa52.84No3D CoCa: Contrastive Learners are 3D Captioners2025-04-13Code
2BiCA48.77NoBi-directional Contextual Attention for 3D Dense...2024-08-13-
3Vote2Cap-DETR++47.08NoVote2Cap-DETR++: Decoupling Localization and Des...2023-09-06Code
4Vote2Cap-DETR43.84NoEnd-to-End 3D Dense Captioning with Vote2Cap-DETR2023-01-06Code
53DJCG38.06No---
6Contextual35.26NoContextual Modeling for 3D Dense Captioning on P...2022-10-08-
7REMAN34.81No---
8D3Net33.85NoD3Net: A Unified Speaker-Listener Architecture f...2021-12-02-
9SpaCap3d33.71NoSpatiality-guided Transformer for 3D Dense Capti...2022-04-22Code
10Scan2Cap27.47NoScan2Cap: Context-aware Dense Captioning in RGB-...2020-12-03-