TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ColNet: Embedding the Semantics of Web Tables for Column T...

ColNet: Embedding the Semantics of Web Tables for Column Type Prediction

Jiaoyan Chen, Ernesto Jimenez-Ruiz, Ian Horrocks, Charles Sutton

2018-11-04Column Type AnnotationType predictionVocal Bursts Type PredictionTable annotation
PaperPDFCode(official)

Abstract

Automatically annotating column types with knowledge base (KB) concepts is a critical task to gain a basic understanding of web tables. Current methods rely on either table metadata like column name or entity correspondences of cells in the KB, and may fail to deal with growing web tables with incomplete meta information. In this paper we propose a neural network based column type annotation framework named ColNet which is able to integrate KB reasoning and lookup with machine learning and can automatically train Convolutional Neural Networks for prediction. The prediction model not only considers the contextual semantics within a cell using word representation, but also embeds the semantics of a column by learning locality features from multiple cells. The method is evaluated with DBPedia and two different web table datasets, T2Dv2 from the general Web and Limaye from Wikipedia pages, and achieves higher performance than the state-of-the-art approaches.

Results

TaskDatasetMetricValueModel
Data IntegrationT2Dv2F1 (%)94.9ColNet - Ensemble
Table annotationT2Dv2F1 (%)94.9ColNet - Ensemble

Related Papers

UniPTMs: The First Unified Multi-type PTM Site Prediction Model via Master-Slave Architecture-Based Multi-Stage Fusion Strategy and Hierarchical Contrastive Loss2025-06-05GenEDA: Unleashing Generative Reasoning on Netlist via Multimodal Encoder-Decoder Aligned Foundation Model2025-04-13Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection2025-03-22Evaluating Knowledge Generation and Self-Refinement Strategies for LLM-based Column Type Annotation2025-03-04Language-TPP: Integrating Temporal Point Processes with Language Models for Event Analysis2025-02-11Graph Structure Learning for Tumor Microenvironment with Cell Type Annotation from non-spatial scRNA-seq data2025-02-04Column Property Annotation using Large Language Models2025-01-01Stock Type Prediction Model Based on Hierarchical Graph Neural Network2024-12-09