TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Assemble Foundation Models for Automatic Code Summarization

Assemble Foundation Models for Automatic Code Summarization

Jian Gu, Pasquale Salza, Harald C. Gall

2022-01-13Code Documentation GenerationSequence-to-sequence Language ModelingTransfer LearningCode SummarizationSource Code Summarization
PaperPDFCode(official)

Abstract

Automatic code summarization is beneficial to daily software development since it could help reduce the requirement of manual writing. Currently, artificial intelligence is undergoing a paradigm shift. The foundation models pretrained on massive data and finetuned to downstream tasks surpass specially customized models. This trend inspired us to consider reusing foundation models instead of learning from scratch. Thereby, we propose a flexible and robust approach for automatic code summarization, based on neural models. We assemble available foundation models, such as CodeBERT and GPT-2, into a single neural model named AdaMo. Moreover, we utilize Gaussian noise as the simulation of contextual information to optimize the latent representation. Furthermore, we introduce two adaptive schemes from the perspective of knowledge transfer, namely continuous pretraining and intermediate finetuning, and design intermediate stage tasks for general sequence-to-sequence learning. Finally, we evaluate AdaMo against a benchmark dataset for code summarization, by comparing it with state-of-the-art models.

Results

TaskDatasetMetricValueModel
Source Code SummarizationDeepCom-JavaBLEU-445.35AdaMo-noise
Source Code SummarizationDeepCom-JavaBLEU-445.3AdaMo-basic
Source Code SummarizationJava scriptsBLEU-437.64AdaMo-basic
Source Code SummarizationParallelCorpus-PythonBLEU-434.05AdaMo-noise
Source Code SummarizationParallelCorpus-PythonBLEU-433.85AdaMo-basic
Source Code SummarizationCodeSearchNet - PythonBLEU-416.46AdaMo-basic

Related Papers

RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16Robust-Multi-Task Gradient Boosting2025-07-15Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distribution Shift2025-07-12The Bayesian Approach to Continual Learning: An Overview2025-07-11Contrastive and Transfer Learning for Effective Audio Fingerprinting through a Real-World Evaluation Protocol2025-07-08Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving2025-07-08