TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Semantic-Aligned Feature Representation for Text-...

Learning Semantic-Aligned Feature Representation for Text-based Person Search

Shiping Li, Min Cao, Min Zhang

2021-12-13Person SearchText based Person SearchText based Person Retrieval
PaperPDFCode(official)

Abstract

Text-based person search aims to retrieve images of a certain pedestrian by a textual description. The key challenge of this task is to eliminate the inter-modality gap and achieve the feature alignment across modalities. In this paper, we propose a semantic-aligned embedding method for text-based person search, in which the feature alignment across modalities is achieved by automatically learning the semantic-aligned visual features and textual features. First, we introduce two Transformer-based backbones to encode robust feature representations of the images and texts. Second, we design a semantic-aligned feature aggregation network to adaptively select and aggregate features with the same semantics into part-aware features, which is achieved by a multi-head attention module constrained by a cross-modality part alignment loss and a diversity loss. Experimental results on the CUHK-PEDES and Flickr30K datasets show that our method achieves state-of-the-art performances.

Results

TaskDatasetMetricValueModel
Text based Person RetrievalCUHK-PEDESR@164.13SAF
Text based Person RetrievalCUHK-PEDESR@1088.4SAF
Text based Person RetrievalCUHK-PEDESR@582.62SAF

Related Papers

SA-Person: Text-Based Person Retrieval with Scene-aware Re-ranking2025-05-30Dynamic Uncertainty Learning with Noisy Correspondence for Text-Based Person Search2025-05-10Uncertainty-Aware Prototype Semantic Decoupling for Text-Based Person Search in Full Images2025-05-06CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval2025-04-26UP-Person: Unified Parameter-Efficient Transfer Learning for Text-based Person Retrieval2025-04-14An Empirical Study of Validating Synthetic Data for Text-Based Person Retrieval2025-03-28SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks2025-03-10Boosting Weak Positives for Text Based Person Search2025-01-29