TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Rethinking the Evaluation of Pre-trained Text-and-Layout M...

Rethinking the Evaluation of Pre-trained Text-and-Layout Models from an Entity-Centric Perspective

Chong Zhang, Yixi Zhao, Chenshu Yuan, Yi Tu, Ya Guo, Qi Zhang

2024-02-04Semantic entity labelingEntity Linking
PaperPDFCode(official)

Abstract

Recently developed pre-trained text-and-layout models (PTLMs) have shown remarkable success in multiple information extraction tasks on visually-rich documents. However, the prevailing evaluation pipeline may not be sufficiently robust for assessing the information extraction ability of PTLMs, due to inadequate annotations within the benchmarks. Therefore, we claim the necessary standards for an ideal benchmark to evaluate the information extraction ability of PTLMs. We then introduce EC-FUNSD, an entity-centric benckmark designed for the evaluation of semantic entity recognition and entity linking on visually-rich documents. This dataset contains diverse formats of document layouts and annotations of semantic-driven entities and their relations. Moreover, this dataset disentangles the falsely coupled annotation of segment and entity that arises from the block-level annotation of FUNSD. Experiment results demonstrate that state-of-the-art PTLMs exhibit overfitting tendencies on the prevailing benchmarks, as their performance sharply decrease when the dataset bias is removed.

Results

TaskDatasetMetricValueModel
Entity LinkingEC-FUNSDF186.18GeoLayoutLM
Semantic entity labelingEC-FUNSDF183.62GeoLayoutLM

Related Papers

LEMONADE: A Large Multilingual Expert-Annotated Abstractive Event Dataset for the Real World2025-06-01Distilling Closed-Source LLM's Knowledge for Locally Stable and Economic Biomedical Entity Linking2025-05-26Evaluation of LLMs on Long-tail Entity Linking in Historical Documents2025-05-06KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking2025-04-21Cross-Document Contextual Coreference Resolution in Knowledge Graphs2025-04-08Explainable ICD Coding via Entity Linking2025-03-26Entity-aware Cross-lingual Claim Detection for Automated Fact-checking2025-03-19Leveraging Knowledge Graphs and LLMs for Context-Aware Messaging2025-03-12