TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Neural Code Search Revisited: Enhancing Code Snippet Retri...

Neural Code Search Revisited: Enhancing Code Snippet Retrieval through Natural Language Intent

Geert Heyman, Tom Van Cutsem

2020-08-27Annotated Code SearchTransfer LearningInformation RetrievalCode SearchRetrievalNatural Language Queries
PaperPDFCode(official)

Abstract

In this work, we propose and study annotated code search: the retrieval of code snippets paired with brief descriptions of their intent using natural language queries. On three benchmark datasets, we investigate how code retrieval systems can be improved by leveraging descriptions to better capture the intents of code snippets. Building on recent progress in transfer learning and natural language processing, we create a domain-specific retrieval model for code annotated with a natural language description. We find that our model yields significantly more relevant search results (with absolute gains up to 20.6% in mean reciprocal rank) compared to state-of-the-art code retrieval methods that do not use descriptions but attempt to compute the intent of snippets solely from unannotated code.

Results

TaskDatasetMetricValueModel
Code SearchPACS-CoNaLaMRR0.351Ensemble:USE-tuned+NCS
Code SearchPACS-CoNaLaMRR0.34USE-tuned
Code SearchPACS-CoNaLaMRR0.181USE
Code SearchPACS-CoNaLaMRR0.167NCS
Code SearchPACS-StaQC-pyMRR0.126Ensemble:USE-tuned+NCS
Code SearchPACS-StaQC-pyMRR0.117USE-tuned
Code SearchPACS-StaQC-pyMRR0.104USE
Code SearchPACS-StaQC-pyMRR0.03NCS
Code SearchPACS-SO-DSMRR0.323Ensemble:USE-tuned+NCS
Code SearchPACS-SO-DSMRR0.304USE-tuned
Code SearchPACS-SO-DSMRR0.244USE
Code SearchPACS-SO-DSMRR0.113NCS

Related Papers

RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17A Survey of Context Engineering for Large Language Models2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16