TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/WDC Products: A Multi-Dimensional Entity Matching Benchmark

WDC Products: A Multi-Dimensional Entity Matching Benchmark

Ralph Peeters, Reng Chiz Der, Christian Bizer

2023-01-23Entity ResolutionMulti-class ClassificationData IntegrationContrastive Learning
PaperPDFCode(official)

Abstract

The difficulty of an entity matching task depends on a combination of multiple factors such as the amount of corner-case pairs, the fraction of entities in the test set that have not been seen during training, and the size of the development set. Current entity matching benchmarks usually represent single points in the space along such dimensions or they provide for the evaluation of matching methods along a single dimension, for instance the amount of training data. This paper presents WDC Products, an entity matching benchmark which provides for the systematic evaluation of matching systems along combinations of three dimensions while relying on real-world data. The three dimensions are (i) amount of corner-cases (ii) generalization to unseen entities, and (iii) development set size (training set plus validation set). Generalization to unseen entities is a dimension not covered by any of the existing English-language benchmarks yet but is crucial for evaluating the robustness of entity matching systems. Instead of learning how to match entity pairs, entity matching can also be formulated as a multi-class classification task that requires the matcher to recognize individual entities. WDC Products is the first benchmark that provides a pair-wise and a multi-class formulation of the same tasks. We evaluate WDC Products using several state-of-the-art matching systems, including Ditto, HierGAT, and R-SupCon. The evaluation shows that all matching systems struggle with unseen entities to varying degrees. It also shows that for entity matching contrastive learning is more training data efficient compared to cross-encoders.

Results

TaskDatasetMetricValueModel
Data IntegrationWDC Products-80%cc-seen-medium-multiF1 Micro88.63RoBERTa-SupCon
Data IntegrationWDC Products-80%cc-seen-medium-multiF1 Micro52.03RoBERTa-base
Data IntegrationWDC Products-80%cc-seen-mediumF1 (%)79.99RoBERTa-SupCon
Data IntegrationWDC Products-80%cc-seen-mediumF1 (%)73.93Ditto
Data IntegrationWDC Products-80%cc-seen-mediumF1 (%)72.18RoBERTa-base
Data IntegrationWDC Products-80%cc-seen-mediumF1 (%)71.4HG
Data IntegrationWDC Products-50%cc-unseen-mediumF1 (%)71.14RoBERTa-base
Data IntegrationWDC Products-50%cc-unseen-mediumF1 (%)70.66Ditto
Data IntegrationWDC Products-50%cc-unseen-mediumF1 (%)68.74HG
Data IntegrationWDC Products-50%cc-unseen-mediumF1 (%)57.23RoBERTa-SupCon
Entity ResolutionWDC Products-80%cc-seen-medium-multiF1 Micro88.63RoBERTa-SupCon
Entity ResolutionWDC Products-80%cc-seen-medium-multiF1 Micro52.03RoBERTa-base
Entity ResolutionWDC Products-80%cc-seen-mediumF1 (%)79.99RoBERTa-SupCon
Entity ResolutionWDC Products-80%cc-seen-mediumF1 (%)73.93Ditto
Entity ResolutionWDC Products-80%cc-seen-mediumF1 (%)72.18RoBERTa-base
Entity ResolutionWDC Products-80%cc-seen-mediumF1 (%)71.4HG
Entity ResolutionWDC Products-50%cc-unseen-mediumF1 (%)71.14RoBERTa-base
Entity ResolutionWDC Products-50%cc-unseen-mediumF1 (%)70.66Ditto
Entity ResolutionWDC Products-50%cc-unseen-mediumF1 (%)68.74HG
Entity ResolutionWDC Products-50%cc-unseen-mediumF1 (%)57.23RoBERTa-SupCon

Related Papers

SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation2025-07-15Latent Space Consistency for Sparse-View CT Reconstruction2025-07-15Self-supervised pretraining of vision transformers for animal behavioral analysis and neural encoding2025-07-13