Semantic Annotation of Tabular Data for Machine-to-Machine Interoperability via Neuro-Symbolic Anchoring
Shervin Mehryar, Remzi Celebi
Abstract
In this paper we investigate automated annotation of tabular data using semantic technologies in combination with neural network embedding. Specifically, we propose an anchoring model in which property and cell types from the data embedding space are aligned with ontology relation and entity types. We show that by combining the power of symbolic reasoning, neural embeddings, and loss function design, a significant performance improvement as high as 86% for column property, 82% for column type, and 87% for column qualifier annotations can be achieved based on DBpedia and Wikidata table extractions.
Related Papers
GARG-AML against Smurfing: A Scalable and Interpretable Graph-Based Framework for Anti-Money Laundering2025-06-04Network Alignment2025-04-15Subset-Contrastive Multi-Omics Network Embedding2025-04-15Network Embedding Exploration Tool (NEExT)2025-03-20ASD Classification on Dynamic Brain Connectome using Temporal Random Walk with Transformer-based Dynamic Network Embedding2025-03-16Unifying Structural Proximity and Equivalence for Enhanced Dynamic Network Embedding2025-03-14Evaluating Knowledge Generation and Self-Refinement Strategies for LLM-based Column Type Annotation2025-03-04Unsupervised Attributed Dynamic Network Embedding with Stability Guarantees2025-03-04