TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Open Vocabulary Object Detection/LVIS v1.0

Open Vocabulary Object Detection on LVIS v1.0

Metric: AP novel-LVIS base training (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕AP novel-LVIS base training▼Extra DataPaperDate↕Code
1LaMI-DETR43.4NoLaMI-DETR: Open-Vocabulary Detection with Langua...2024-07-16Code
2DITO40.4NoRegion-centric Image-Language Pretraining for Op...2023-09-29Code
3OV-DQUO(ViT-L/14)39.3NoOV-DQUO: Open-Vocabulary DETR with Denoising Tex...2024-05-28Code
4CoDet (EVA02-L)37YesCoDet: Co-Occurrence Guided Region-Word Alignmen...2023-10-25Code
5CLIPSelf34.9NoCLIPSelf: Vision Transformer Distills Itself for...2023-10-02Code
6OVMR34.4YesOVMR: Open-Vocabulary Recognition with Multi-Mod...2024-06-07Code
7DE-ViT34.3NoDetect Everything with Few Examples2023-09-22Code
8CFM-ViT33.9NoContrastive Feature Masking Open-Vocabulary Visi...2023-09-02-
9CLIM (RN50x64)32.3NoCLIM: Contrastive Language-Image Mosaic for Regi...2023-12-18Code
10RO-ViT32.1NoRegion-Aware Pretraining for Open-Vocabulary Obj...2023-05-11Code
11Prova (Swin-Base)31.5YesComprehensive Multi-Modal Prototypes are Simple ...2024-12-23Code
12RTGen30.2YesRTGen: Generating Region-Text Pairs for Open-Voc...2024-05-30Code
13OV-DQUO(ViT-B/16)29.7NoOV-DQUO: Open-Vocabulary DETR with Denoising Tex...2024-05-28Code
14ViLD-ensemble w/ ALIGN (Eb7-FPN)26.3NoOpen-vocabulary Object Detection via Vision and ...2021-04-28Code
15OWL-ViT (CLIP-L/14)25.6YesSimple Open-Vocabulary Object Detection with Vis...2022-05-12Code
16POMP25.2NoPrompt Pre-Training with Twenty-Thousand Classes...2023-04-10Code
17BARON22.6NoAligning Bag of Regions for Open-Vocabulary Obje...2023-02-27Code
18MEDet22.4NoOpen Vocabulary Object Detection with Proposal M...2022-06-22Code
19Region-CLIP (RN50x4-C4)22NoRegionCLIP: Region-based Language-Image Pretrain...2021-12-16Code
20RALF21.9YesRetrieval-Augmented Open-Vocabulary Object Detec...2024-04-08Code
21OADP21.7NoObject-Aware Distillation Pyramid for Open-Vocab...2023-03-10Code
22X-Paste21.4NoX-Paste: Revisiting Scalable Copy-Paste for Inst...2022-12-07Code
23Object-Centric-OVD21.1YesBridging the Gap between Object and Image-level ...2022-07-07Code
24ViLD-ensemble (R152-FPN)18.7NoOpen-vocabulary Object Detection via Vision and ...2021-04-28Code
25Detic17.8YesDetecting Twenty-thousand Classes using Image-le...2022-01-07Code
26Region-CLIP (RN50-C4)17.1NoRegionCLIP: Region-based Language-Image Pretrain...2021-12-16Code
27ViLD-ensemble (R50-FPN)16.6NoOpen-vocabulary Object Detection via Vision and ...2021-04-28Code
28ViLD (R50-FPN)16.1NoOpen-vocabulary Object Detection via Vision and ...2021-04-28Code