TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Automatic extraction of materials and properties from supe...

Automatic extraction of materials and properties from superconductors scientific literature

Luca Foppiano, Pedro Baptista de Castro, Pedro Ortiz Suarez, Kensei Terashima, Yoshihiko Takano, Masashi Ishii

2022-10-26NER
PaperPDFCode(official)Code

Abstract

The automatic extraction of materials and related properties from the scientific literature is gaining attention in data-driven materials science (Materials Informatics). In this paper, we discuss Grobid-superconductors, our solution for automatically extracting superconductor material names and respective properties from text. Built as a Grobid module, it combines machine learning and heuristic approaches in a multi-step architecture that supports input data as raw text or PDF documents. Using Grobid-superconductors, we built SuperCon2, a database of 40324 materials and properties records from 37700 papers. The material (or sample) information is represented by name, chemical formula, and material class, and is characterized by shape, doping, substitution variables for components, and substrate as adjoined information. The properties include the Tc superconducting critical temperature and, when available, applied pressure with the Tc measurement method.

Results

TaskDatasetMetricValueModel
Open Information ExtractionSuperMatF177.03superconductors-Scibert
Open Information ExtractionSuperMatPrecision73.69superconductors-Scibert
Open Information ExtractionSuperMatRecall80.69superconductors-Scibert
Information ExtractionSuperMatF177.03superconductors-Scibert
Information ExtractionSuperMatPrecision73.69superconductors-Scibert
Information ExtractionSuperMatRecall80.69superconductors-Scibert
Named Entity Recognition (NER)SuperMatF177.03superconductors-Scibert
Named Entity Recognition (NER)SuperMatPrecision73.69superconductors-Scibert
Named Entity Recognition (NER)SuperMatRecall80.69superconductors-Scibert
Event ExtractionSuperMatF177.03superconductors-Scibert
Event ExtractionSuperMatPrecision73.69superconductors-Scibert
Event ExtractionSuperMatRecall80.69superconductors-Scibert

Related Papers

Flippi: End To End GenAI Assistant for E-Commerce2025-07-08Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28Improving Named Entity Transcription with Contextual LLM-based Revision2025-06-12Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering2025-06-05Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering2025-06-04EL4NER: Ensemble Learning for Named Entity Recognition via Multiple Small-Parameter Large Language Models2025-05-29Label-Guided In-Context Learning for Named Entity Recognition2025-05-29Named Entity Recognition in Historical Italian: The Case of Giacomo Leopardi's Zibaldone2025-05-26