TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/SpaceLLaVA

SpaceLLaVA

Reported on 10 benchmarks across 2 tasks · 1 paper · 4 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing10 results

  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Orientation-rel· 2024-01-22
    30.9
    best: 54.6 (SoFar)
    SOTA
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168
  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Position-abs· 2024-01-22
    30.5
    best: 33.8 (SoFar)
    SOTA
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168
  • Visual Question Answeringon6-DoF SpatialBench
    Orientation-rel· 2024-01-22
    30.9
    best: 54.6 (SoFar)
    SOTA
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168
  • Visual Question Answeringon6-DoF SpatialBench
    Position-abs· 2024-01-22
    30.5
    best: 33.8 (SoFar)
    SOTA
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168
  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Orientation-abs· 2024-01-22
    24.9
    best: 31.3 (SoFar)
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168
  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Position-rel· 2024-01-22
    32.4
    best: 59.6 (SoFar)
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168
  • Visual Question Answering (VQA)on6-DoF SpatialBench
    Total· 2024-01-22
    28.2
    best: 43.9 (SoFar)
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168
  • Visual Question Answeringon6-DoF SpatialBench
    Orientation-abs· 2024-01-22
    24.9
    best: 31.3 (SoFar)
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168
  • Visual Question Answeringon6-DoF SpatialBench
    Position-rel· 2024-01-22
    32.4
    best: 59.6 (SoFar)
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168
  • Visual Question Answeringon6-DoF SpatialBench
    Total· 2024-01-22
    28.2
    best: 43.9 (SoFar)
    SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning CapabilitiesarXiv:2401.12168