Transformer local-attention (NesT-B)
Reported on 3 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Computer Vision3 results
- Percentage correct· 2021-05-2697.2best: 99.5 (ViT-H/14)
- Percentage correct· uses extra data· 2021-05-2682.56best: 96.08 (EffNet-L2 (SAM))
- GFLOPs· 2021-05-2617.9best: 1478 (InternImage-H)