Person Retrieval in Surveillance Video using Height, Color and Gender

Hiren Galiyawala, Kenil Shah, Vandit Gajjar, Mehul S. Raval

2018-09-242018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) 2019 2Person Retrieval Retrieval

Paper PDF Code(official)

Abstract

A person is commonly described by attributes like height, build, cloth color, cloth type, and gender. Such attributes are known as soft biometrics. They bridge the semantic gap between human description and person retrieval in surveillance video. The paper proposes a deep learning-based linear filtering approach for person retrieval using height, cloth color, and gender. The proposed approach uses Mask R-CNN for pixel-wise person segmentation. It removes background clutter and provides precise boundary around the person. Color and gender models are fine-tuned using AlexNet and the algorithm is tested on SoftBioSearch dataset. It achieves good accuracy for person retrieval using the semantic query in challenging conditions.

Results

Task	Dataset	Metric	Value	Model
Person Retrieval	SoftBioSearch	Average IOU	0.503	SSD
Person Retrieval	SoftBioSearch	Average IOU	0.363	Mask R-CNN and AlexNet
Person Retrieval	SoftBioSearch	Average IOU	0.29	Baseline - AvatarSearch

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17 HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17 A Survey of Context Engineering for Large Language Models2025-07-17 MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17 Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker2025-07-16 Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16 Context-Aware Search and Retrieval Over Erasure Channels2025-07-16 Seq vs Seq: An Open Suite of Paired Encoders and Decoders2025-07-15