TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Semantics-aware BERT for Language Understanding

Semantics-aware BERT for Language Understanding

Zhuosheng Zhang, Yuwei Wu, Hai Zhao, Zuchao Li, Shuailiang Zhang, Xi Zhou, Xiang Zhou

2019-09-05Reading ComprehensionQuestion AnsweringNatural Language InferenceNatural Language UnderstandingWord EmbeddingsSemantic Role LabelingMachine Reading ComprehensionLanguage Modelling
PaperPDFCode(official)

Abstract

The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference tasks. However, the existing language representation models including ELMo, GPT and BERT only exploit plain context-sensitive features such as character or word embeddings. They rarely consider incorporating structured semantic information which can provide rich semantics for language representation. To promote natural language understanding, we propose to incorporate explicit contextual semantics from pre-trained semantic role labeling, and introduce an improved language representation model, Semantics-aware BERT (SemBERT), which is capable of explicitly absorbing contextual semantics over a BERT backbone. SemBERT keeps the convenient usability of its BERT precursor in a light fine-tuning way without substantial task-specific modifications. Compared with BERT, semantics-aware BERT is as simple in concept but more powerful. It obtains new state-of-the-art or substantially improves results on ten reading comprehension and language inference tasks.

Results

TaskDatasetMetricValueModel
Question AnsweringSQuAD2.0 devEM80.9SemBERT large
Question AnsweringSQuAD2.0 devF183.6SemBERT large
Question AnsweringSQuAD2.0EM86.166SemBERT(ensemble)
Question AnsweringSQuAD2.0F188.886SemBERT(ensemble)
Question AnsweringSQuAD2.0EM86.166SemBERT(ensemble)
Question AnsweringSQuAD2.0F188.886SemBERT(ensemble)
Question AnsweringSQuAD2.0EM86.166SemBERT (ensemble)
Question AnsweringSQuAD2.0F188.886SemBERT (ensemble)
Question AnsweringSQuAD2.0EM84.8SemBERT (single model)
Question AnsweringSQuAD2.0F187.864SemBERT (single model)
Question AnsweringSQuAD2.0EM84.8SemBERT (single model)
Question AnsweringSQuAD2.0F187.864SemBERT (single model)
Natural Language InferenceSNLI% Test Accuracy91.9SemBERT
Natural Language InferenceSNLI% Train Accuracy94.4SemBERT

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17