Description
A FuseFormer block is used in the FuseFormer model for video inpainting. It is the same to standard Transformer block except that feed forward network is replaced with a Fusion Feed Forward Network (F3N). F3N brings no extra parameter into the standard feed forward net and the difference is that F3N inserts a soft-split and a soft composite operation between the two layer of MLPs.