TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Beyond Part Models: Person Retrieval with Refined Part Poo...

Beyond Part Models: Person Retrieval with Refined Part Pooling (and a Strong Convolutional Baseline)

Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, Shengjin Wang

2017-11-26ECCV 2018 9Person RetrievalPerson Re-IdentificationRetrieval
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Employing part-level features for pedestrian image description offers fine-grained information and has been verified as beneficial for person retrieval in very recent literature. A prerequisite of part discovery is that each part should be well located. Instead of using external cues, e.g., pose estimation, to directly locate parts, this paper lays emphasis on the content consistency within each part. Specifically, we target at learning discriminative part-informed features for person retrieval and make two contributions. (i) A network named Part-based Convolutional Baseline (PCB). Given an image input, it outputs a convolutional descriptor consisting of several part-level features. With a uniform partition strategy, PCB achieves competitive results with the state-of-the-art methods, proving itself as a strong convolutional baseline for person retrieval. (ii) A refined part pooling (RPP) method. Uniform partition inevitably incurs outliers in each part, which are in fact more similar to other parts. RPP re-assigns these outliers to the parts they are closest to, resulting in refined parts with enhanced within-part consistency. Experiment confirms that RPP allows PCB to gain another round of performance boost. For instance, on the Market-1501 dataset, we achieve (77.4+4.2)% mAP and (92.3+1.5)% rank-1 accuracy, surpassing the state of the art by a large margin.

Results

TaskDatasetMetricValueModel
Person Re-IdentificationMarket-1501-C Rank-134.93PCB
Person Re-IdentificationMarket-1501-C mAP12.72PCB
Person Re-IdentificationMarket-1501-C mINP0.41PCB
Person Re-IdentificationMarket-1501Rank-193.8PCB + RPP
Person Re-IdentificationMarket-1501mAP81.6PCB + RPP
Person Re-IdentificationMarket-1501Rank-192.3PCB
Person Re-IdentificationMarket-1501mAP77.4PCB
Person Re-IdentificationUAV-Human Rank-162.19PCB
Person Re-IdentificationUAV-Human Rank-583.9PCB
Person Re-IdentificationUAV-HumanmAP61.05PCB
Person Re-IdentificationDukeMTMC-reIDRank-183.3PCB (RPP)
Person Re-IdentificationDukeMTMC-reIDmAP69.2PCB (RPP)
Person Re-IdentificationDukeMTMC-reIDRank-181.8PCB (UP)
Person Re-IdentificationDukeMTMC-reIDmAP66.1PCB (UP)

Related Papers

Weakly Supervised Visible-Infrared Person Re-Identification via Heterogeneous Expert Collaborative Consistency Learning2025-07-17WhoFi: Deep Person Re-Identification via Wi-Fi Channel Signal Encoding2025-07-17From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17A Survey of Context Engineering for Large Language Models2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker2025-07-16Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16