TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Text-Based Person Search with Limited Data

Text-Based Person Search with Limited Data

Xiao Han, Sen He, Li Zhang, Tao Xiang

2021-10-20Cross-Modal RetrievalBenchmarkingDescriptivePerson SearchTransfer LearningContrastive LearningRetrievalText based Person SearchText based Person Retrieval
PaperPDFCode(official)

Abstract

Text-based person search (TBPS) aims at retrieving a target person from an image gallery with a descriptive text query. Solving such a fine-grained cross-modal retrieval task is challenging, which is further hampered by the lack of large-scale datasets. In this paper, we present a framework with two novel components to handle the problems brought by limited data. Firstly, to fully utilize the existing small-scale benchmarking datasets for more discriminative feature learning, we introduce a cross-modal momentum contrastive learning framework to enrich the training data for a given mini-batch. Secondly, we propose to transfer knowledge learned from existing coarse-grained large-scale datasets containing image-text pairs from drastically different problem domains to compensate for the lack of TBPS training data. A transfer learning method is designed so that useful information can be transferred despite the large domain gap. Armed with these components, our method achieves new state of the art on the CUHK-PEDES dataset with significant improvements over the prior art in terms of Rank-1 and mAP. Our code is available at https://github.com/BrandonHanx/TextReID.

Results

TaskDatasetMetricValueModel
Text based Person RetrievalCUHK-PEDESR@164.08TextReID
Text based Person RetrievalCUHK-PEDESR@1088.19TextReID
Text based Person RetrievalCUHK-PEDESR@581.73TextReID
Text based Person RetrievalCUHK-PEDESmAP60.08TextReID

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Training Transformers with Enforced Lipschitz Constants2025-07-17Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization2025-07-17SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17