TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CISOL: An Open and Extensible Dataset for Table Structure ...

CISOL: An Open and Extensible Dataset for Table Structure Recognition in the Construction Industry

David Tschirschwitz, Volker Rodehorst

2025-01-26BenchmarkingTable DetectionObject Detection
PaperPDF

Abstract

Reproducibility and replicability are critical pillars of empirical research, particularly in machine learning, where they depend not only on the availability of models, but also on the datasets used to train and evaluate those models. In this paper, we introduce the Construction Industry Steel Ordering List (CISOL) dataset, which was developed with a focus on transparency to ensure reproducibility, replicability, and extensibility. CISOL provides a valuable new research resource and highlights the importance of having diverse datasets, even in niche application domains such as table extraction in civil engineering. CISOL is unique in that it contains real-world civil engineering documents from industry, making it a distinctive contribution to the field. The dataset contains more than 120,000 annotated instances in over 800 document images, positioning it as a medium-sized dataset that provides a robust foundation for Table Structure Recognition (TSR) and Table Detection (TD) tasks. Benchmarking results show that CISOL achieves 67.22 mAP@0.5:0.95:0.05 using the YOLOv8 model, outperforming the TSR-specific TATR model. This highlights the effectiveness of CISOL as a benchmark for advancing TSR, especially in specialized domains.

Results

TaskDatasetMetricValueModel
Object DetectionCISOL - Track B - TSR-onlymAP@0.5:0.95:0.0561.39YOLO v8.1m
Object DetectionCISOL - Track A - TD-TSRmAP@0.5:0.95:0.0567.22YOLO v8.1m
3DCISOL - Track B - TSR-onlymAP@0.5:0.95:0.0561.39YOLO v8.1m
3DCISOL - Track A - TD-TSRmAP@0.5:0.95:0.0567.22YOLO v8.1m
2D ClassificationCISOL - Track B - TSR-onlymAP@0.5:0.95:0.0561.39YOLO v8.1m
2D ClassificationCISOL - Track A - TD-TSRmAP@0.5:0.95:0.0567.22YOLO v8.1m
2D Object DetectionCISOL - Track B - TSR-onlymAP@0.5:0.95:0.0561.39YOLO v8.1m
2D Object DetectionCISOL - Track A - TD-TSRmAP@0.5:0.95:0.0567.22YOLO v8.1m
16kCISOL - Track B - TSR-onlymAP@0.5:0.95:0.0561.39YOLO v8.1m
16kCISOL - Track A - TD-TSRmAP@0.5:0.95:0.0567.22YOLO v8.1m

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20Training Transformers with Enforced Lipschitz Constants2025-07-17Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17