LOGO-CAP (Ours) HRNet-W48
Reported on 6 benchmarks across 3 tasks
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Computer Vision2 results
- 72.2
- 70.8best: 81.1 (ViTPose (ViTAE-G, ensemble))
Methodology2 results
- AP72.2
- AP70.8best: 81.1 (ViTPose (ViTAE-G, ensemble))
Audio2 results
- 70.8best: 81.1 (ViTPose (ViTAE-G, ensemble))