Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Object-Centric-OVD

Object-Centric-OVD

Reported on 29 benchmarks across 6 tasks · 1 paper · 17 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Methodology20 results

3DonObjects365
mask AP50· 2022-07-07
22.3
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
3DonOpenImages-v4
mask AP50· 2022-07-07
42.9
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
3DonMSCOCO
AP· 2022-07-07
40.5
best: 58.8 (ScaleDet)
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D ClassificationonObjects365
mask AP50· 2022-07-07
22.3
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D ClassificationonOpenImages-v4
mask AP50· 2022-07-07
42.9
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D ClassificationonMSCOCO
AP· 2022-07-07
40.5
best: 58.8 (ScaleDet)
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D Object DetectiononObjects365
mask AP50· 2022-07-07
22.3
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D Object DetectiononOpenImages-v4
mask AP50· 2022-07-07
42.9
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D Object DetectiononMSCOCO
AP· 2022-07-07
40.5
best: 58.8 (ScaleDet)
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
16konObjects365
mask AP50· 2022-07-07
22.3
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
16konOpenImages-v4
mask AP50· 2022-07-07
42.9
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
16konMSCOCO
AP· 2022-07-07
40.5
best: 58.8 (ScaleDet)
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
3DonLVIS v1.0
AP novel-LVIS base training· uses extra data· 2022-07-07
21.1
best: 43.4 (LaMI-DETR)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
3DonMSCOCO
AP 0.5· 2022-07-07
36.9
best: 50.3 (Cooperative Foundational Models)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D ClassificationonLVIS v1.0
AP novel-LVIS base training· uses extra data· 2022-07-07
21.1
best: 43.4 (LaMI-DETR)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D ClassificationonMSCOCO
AP 0.5· 2022-07-07
36.9
best: 50.3 (Cooperative Foundational Models)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D Object DetectiononLVIS v1.0
AP novel-LVIS base training· uses extra data· 2022-07-07
21.1
best: 43.4 (LaMI-DETR)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
2D Object DetectiononMSCOCO
AP 0.5· 2022-07-07
36.9
best: 50.3 (Cooperative Foundational Models)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
16konLVIS v1.0
AP novel-LVIS base training· uses extra data· 2022-07-07
21.1
best: 43.4 (LaMI-DETR)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
16konMSCOCO
AP 0.5· 2022-07-07
36.9
best: 50.3 (Cooperative Foundational Models)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482

Computer Vision9 results

Object DetectiononObjects365
mask AP50· 2022-07-07
22.3
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
Object DetectiononOpenImages-v4
mask AP50· 2022-07-07
42.9
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
Object DetectiononMSCOCO
AP· 2022-07-07
40.5
best: 58.8 (ScaleDet)
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
Open Vocabulary Object DetectiononObjects365
mask AP50· 2022-07-07
22.3
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
Open Vocabulary Object DetectiononOpenImages-v4
mask AP50· 2022-07-07
42.9
SOTA
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
Object DetectiononLVIS v1.0
AP novel-LVIS base training· uses extra data· 2022-07-07
21.1
best: 43.4 (LaMI-DETR)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
Object DetectiononMSCOCO
AP 0.5· 2022-07-07
36.9
best: 50.3 (Cooperative Foundational Models)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
Open Vocabulary Object DetectiononLVIS v1.0
AP novel-LVIS base training· uses extra data· 2022-07-07
21.1
best: 43.4 (LaMI-DETR)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482
Open Vocabulary Object DetectiononMSCOCO
AP 0.5· 2022-07-07
36.9
best: 50.3 (Cooperative Foundational Models)
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection arXiv:2207.03482