Spatial Token Mixer
0 benchmarks0 papers
Spatial Token Mixer (STM) is a module for vision transformers that aims to improve the efficiency of token mixing. STM is a type of depthwise convolution that operates on the spatial dimension of the tokens. STM is a drop-in replacement for the token mixing layers in vision transformers.
Benchmarks
No benchmarks available for this task.