To Boost or Not to Boost? On the Limits of Boosted Trees for Object Detection

Eshed Ohn-Bar, Mohan M. Trivedi

2017-01-06Pedestrian Detection object-detection Object Detection Face Detection

Abstract

We aim to study the modeling limitations of the commonly employed boosted decision trees classifier. Inspired by the success of large, data-hungry visual recognition models (e.g. deep convolutional neural networks), this paper focuses on the relationship between modeling capacity of the weak learners, dataset size, and dataset properties. A set of novel experiments on the Caltech Pedestrian Detection benchmark results in the best known performance among non-CNN techniques while operating at fast run-time speed. Furthermore, the performance is on par with deep architectures (9.71% log-average miss rate), while using only HOG+LUV channels as features. The conclusions from this study are shown to generalize over different object detection domains as demonstrated on the FDDB face detection benchmark (93.37% accuracy). Despite the impressive performance, this study reveals the limited modeling capacity of the common boosted trees model, motivating a need for architectural changes in order to compete with multi-level and very deep architectures.

Results

Task	Dataset	Metric	Value	Model
Facial Recognition and Modelling	WIDER Face (Medium)	AP	0.772	LDCF+
Facial Recognition and Modelling	WIDER Face (Hard)	AP	0.564	LDCF+
Face Detection	WIDER Face (Medium)	AP	0.772	LDCF+
Face Detection	WIDER Face (Hard)	AP	0.564	LDCF+
Face Reconstruction	WIDER Face (Medium)	AP	0.772	LDCF+
Face Reconstruction	WIDER Face (Hard)	AP	0.564	LDCF+
3D	WIDER Face (Medium)	AP	0.772	LDCF+
3D	WIDER Face (Hard)	AP	0.564	LDCF+
3D Face Modelling	WIDER Face (Medium)	AP	0.772	LDCF+
3D Face Modelling	WIDER Face (Hard)	AP	0.564	LDCF+
3D Face Reconstruction	WIDER Face (Medium)	AP	0.772	LDCF+
3D Face Reconstruction	WIDER Face (Hard)	AP	0.564	LDCF+

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17 RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17 Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17 Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17 Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16 Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15 ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08 YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries2025-07-07