TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Improving ProtoNet for Few-Shot Video Object Recognition: ...

Improving ProtoNet for Few-Shot Video Object Recognition: Winner of ORBIT Challenge 2022

Li Gu, Zhixiang Chi, Huan Liu, Yuanhao Yu, Yang Wang

2022-10-01Object RecognitionFew-Shot Image Classification
PaperPDFCodeCodeCode(official)Code

Abstract

In this work, we present the winning solution for ORBIT Few-Shot Video Object Recognition Challenge 2022. Built upon the ProtoNet baseline, the performance of our method is improved with three effective techniques. These techniques include the embedding adaptation, the uniform video clip sampler and the invalid frame detection. In addition, we re-factor and re-implement the official codebase to encourage modularity, compatibility and improved performance. Our implementation accelerates the data loading in both training and testing.

Results

TaskDatasetMetricValueModel
Image ClassificationORBIT Clutter Video EvaluationFrame accuracy71.69ProtoNetsVideo
Few-Shot Image ClassificationORBIT Clutter Video EvaluationFrame accuracy71.69ProtoNetsVideo

Related Papers

ViT-ProtoNet for Few-Shot Image Classification: A Multi-Benchmark Evaluation2025-07-12GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing2025-07-08Out-of-distribution detection in 3D applications: a review2025-07-01SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds2025-06-16Continual Hyperbolic Learning of Instances and Classes2025-06-12DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects2025-06-11Aligning Text, Images, and 3D Structure Token-by-Token2025-06-09STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving2025-06-06