TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/From Data Deluge to Data Curation: A Filtering-WoRA Paradi...

From Data Deluge to Data Curation: A Filtering-WoRA Paradigm for Efficient Text-based Person Search

Jintao Sun, Hao Fei, Zhedong Zheng, Gangyi Ding

2024-04-16Person SearchText based Person SearchText based Person Retrieval
PaperPDFCode

Abstract

In text-based person search endeavors, data generation has emerged as a prevailing practice, addressing concerns over privacy preservation and the arduous task of manual annotation. Although the number of synthesized data can be infinite in theory, the scientific conundrum persists that how much generated data optimally fuels subsequent model training. We observe that only a subset of the data in these constructed datasets plays a decisive role. Therefore, we introduce a new Filtering-WoRA paradigm, which contains a filtering algorithm to identify this crucial data subset and WoRA (Weighted Low-Rank Adaptation) learning strategy for light fine-tuning. The filtering algorithm is based on the cross-modality relevance to remove the lots of coarse matching synthesis pairs. As the number of data decreases, we do not need to fine-tune the entire model. Therefore, we propose a WoRA learning strategy to efficiently update a minimal portion of model parameters. WoRA streamlines the learning process, enabling heightened efficiency in extracting knowledge from fewer, yet potent, data instances. Extensive experimentation validates the efficacy of pretraining, where our model achieves advanced and efficient retrieval performance on challenging real-world benchmarks. Notably, on the CUHK-PEDES dataset, we have achieved a competitive mAP of 67.02% while reducing model training time by 19.82%.

Results

TaskDatasetMetricValueModel
Text based Person RetrievalCUHK-PEDESR@176.38WoRA
Text based Person RetrievalCUHK-PEDESR@1093.49WoRA
Text based Person RetrievalCUHK-PEDESR@589.72WoRA
Text based Person RetrievalCUHK-PEDESmAP67.22WoRA
Text based Person RetrievalICFG-PEDESR@168.35Filtering-WoRA(Small)
Text based Person RetrievalICFG-PEDESR@1087.53Filtering-WoRA(Small)
Text based Person RetrievalICFG-PEDESR@583.1Filtering-WoRA(Small)
Text based Person RetrievalICFG-PEDESmAP42.6Filtering-WoRA(Small)
Text based Person RetrievalRSTPReidR@166.85Filtering-WoRA(Small)
Text based Person RetrievalRSTPReidR@1091.1Filtering-WoRA(Small)
Text based Person RetrievalRSTPReidR@586.45Filtering-WoRA(Small)
Text based Person RetrievalRSTPReidmAP52.49Filtering-WoRA(Small)

Related Papers

SA-Person: Text-Based Person Retrieval with Scene-aware Re-ranking2025-05-30Dynamic Uncertainty Learning with Noisy Correspondence for Text-Based Person Search2025-05-10Uncertainty-Aware Prototype Semantic Decoupling for Text-Based Person Search in Full Images2025-05-06CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval2025-04-26UP-Person: Unified Parameter-Efficient Transfer Learning for Text-based Person Retrieval2025-04-14An Empirical Study of Validating Synthetic Data for Text-Based Person Retrieval2025-03-28SeCap: Self-Calibrating and Adaptive Prompts for Cross-view Person Re-Identification in Aerial-Ground Networks2025-03-10Boosting Weak Positives for Text Based Person Search2025-01-29