TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/LexGLUE: A Benchmark Dataset for Legal Language Understand...

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

Ilias Chalkidis, Abhik Jana, Dirk Hartung, Michael Bommarito, Ion Androutsopoulos, Daniel Martin Katz, Nikolaos Aletras

2021-10-03ACL 2022 5Multi-class ClassificationNatural Language UnderstandingOpen-Ended Question AnsweringMulti-Label ClassificationMultiple Choice Question Answering (MCQA)
PaperPDFCode(official)

Abstract

Laws and their interpretations, legal arguments and agreements\ are typically expressed in writing, leading to the production of vast corpora of legal text. Their analysis, which is at the center of legal practice, becomes increasingly elaborate as these collections grow in size. Natural language understanding (NLU) technologies can be a valuable tool to support legal practitioners in these endeavors. Their usefulness, however, largely depends on whether current state-of-the-art models can generalize across various tasks in the legal domain. To answer this currently open question, we introduce the Legal General Language Understanding Evaluation (LexGLUE) benchmark, a collection of datasets for evaluating model performance across a diverse set of legal NLU tasks in a standardized way. We also provide an evaluation and analysis of several generic and legal-oriented models demonstrating that the latter consistently offer performance improvements across multiple tasks.

Results

TaskDatasetMetricValueModel
Natural Language UnderstandingLexGLUECaseHOLD70.7BERT
Natural Language UnderstandingLexGLUECaseHOLD75.1Legal-BERT
Natural Language UnderstandingLexGLUECaseHOLD75.6CaseLaw-BERT
Natural Language UnderstandingLexGLUECaseHOLD70.4BigBird
Natural Language UnderstandingLexGLUECaseHOLD72Longformer
Natural Language UnderstandingLexGLUECaseHOLD71.7RoBERTa
Natural Language UnderstandingLexGLUECaseHOLD72.1DeBERTa

Related Papers

Vision Language Action Models in Robotic Manipulation: A Systematic Review2025-07-14A Survey on Vision-Language-Action Models for Autonomous Driving2025-06-30State and Memory is All You Need for Robust and Reliable AI Agents2025-06-30skLEP: A Slovak General Language Understanding Benchmark2025-06-26SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models2025-06-25Semantic similarity estimation for domain specific data using BERT and other techniques2025-06-23Privacy-Preserving Chest X-ray Classification in Latent Space with Homomorphically Encrypted Neural Inference2025-06-18Detecting immune cells with label-free two-photon autofluorescence and deep learning2025-06-17