Improving ProtoNet for Few-Shot Video Object Recognition: Winner of ORBIT Challenge 2022

Li Gu, Zhixiang Chi, Huan Liu, Yuanhao Yu, Yang Wang

2022-10-01Object Recognition Few-Shot Image Classification

Abstract

In this work, we present the winning solution for ORBIT Few-Shot Video Object Recognition Challenge 2022. Built upon the ProtoNet baseline, the performance of our method is improved with three effective techniques. These techniques include the embedding adaptation, the uniform video clip sampler and the invalid frame detection. In addition, we re-factor and re-implement the official codebase to encourage modularity, compatibility and improved performance. Our implementation accelerates the data loading in both training and testing.

Results

Task	Dataset	Metric	Value	Model
Image Classification	ORBIT Clutter Video Evaluation	Frame accuracy	71.69	ProtoNetsVideo
Few-Shot Image Classification	ORBIT Clutter Video Evaluation	Frame accuracy	71.69	ProtoNetsVideo

Related Papers

ViT-ProtoNet for Few-Shot Image Classification: A Multi-Benchmark Evaluation2025-07-12 GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing2025-07-08 Out-of-distribution detection in 3D applications: a review2025-07-01 SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds2025-06-16 Continual Hyperbolic Learning of Instances and Classes2025-06-12 DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects2025-06-11 Aligning Text, Images, and 3D Structure Token-by-Token2025-06-09 STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving2025-06-06