TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/BERT got a Date: Introducing Transformers to Temporal Tagg...

BERT got a Date: Introducing Transformers to Temporal Tagging

Satya Almasian, Dennis Aumiller, Michael Gertz

2021-09-30Temporal TaggingToken ClassificationRetrievalClassificationLanguage Modelling
PaperPDFCode(official)

Abstract

Temporal expressions in text play a significant role in language understanding and correctly identifying them is fundamental to various retrieval and natural language processing systems. Previous works have slowly shifted from rule-based to neural architectures, capable of tagging expressions with higher accuracy. However, neural models can not yet distinguish between different expression types at the same level as their rule-based counterparts. In this work, we aim to identify the most suitable transformer architecture for joint temporal tagging and type classification, as well as, investigating the effect of semi-supervised training on the performance of these systems. Based on our study of token classification variants and encoder-decoder architectures, we present a transformer encoder-decoder model using the RoBERTa language model as our best performing system. By supplementing training resources with weakly labeled data from rule-based systems, our model surpasses previous works in temporal tagging and type classification, especially on rare classes. Our code and pre-trained experiments are available at: https://github.com/satya77/Transformer_Temporal_Tagger

Results

TaskDatasetMetricValueModel
Information ExtractionTempEval-3 Strict Detection (Pr.)96.37R2R
Information ExtractionTempEval-3 Strict Detection (Re.)96.37R2R
Information ExtractionTempEval-3Relaxed Detection (F1)100R2R
Information ExtractionTempEval-3Relaxed Detection (Pr.)100R2R
Information ExtractionTempEval-3Relaxed Detection (Re.)100R2R
Information ExtractionTempEval-3Strict Detection (F1)96.37R2R
Information ExtractionTempEval-3Type90.43R2R
Information ExtractionTempEval-3 Strict Detection (Pr.)94.11B2B
Information ExtractionTempEval-3 Strict Detection (Re.)81.01B2B
Information ExtractionTempEval-3Relaxed Detection (F1)92.52B2B
Information ExtractionTempEval-3Relaxed Detection (Pr.)100B2B
Information ExtractionTempEval-3Relaxed Detection (Re.)86.09B2B
Information ExtractionTempEval-3Strict Detection (F1)87.07B2B
Information ExtractionTempEval-3Type83.79B2B
Information ExtractionTempEval-3 Strict Detection (Pr.)82.72DateBERT
Information ExtractionTempEval-3 Strict Detection (Re.)85.79DateBERT
Information ExtractionTempEval-3Relaxed Detection (F1)92.6DateBERT
Information ExtractionTempEval-3Relaxed Detection (Pr.)90.95DateBERT
Information ExtractionTempEval-3Relaxed Detection (Re.)94.35DateBERT
Information ExtractionTempEval-3Strict Detection (F1)84.21DateBERT
Information ExtractionTempEval-3Type86.21DateBERT
Information ExtractionTempEval-3 Strict Detection (Pr.)81.83BERT-base
Information ExtractionTempEval-3 Strict Detection (Re.)79.56BERT-base
Information ExtractionTempEval-3Relaxed Detection (F1)90.08BERT-base
Information ExtractionTempEval-3Relaxed Detection (Pr.)91.37BERT-base
Information ExtractionTempEval-3Relaxed Detection (Re.)88.84BERT-base
Information ExtractionTempEval-3Strict Detection (F1)80.67BERT-base
Information ExtractionTempEval-3Type82BERT-base
Temporal ProcessingTempEval-3 Strict Detection (Pr.)96.37R2R
Temporal ProcessingTempEval-3 Strict Detection (Re.)96.37R2R
Temporal ProcessingTempEval-3Relaxed Detection (F1)100R2R
Temporal ProcessingTempEval-3Relaxed Detection (Pr.)100R2R
Temporal ProcessingTempEval-3Relaxed Detection (Re.)100R2R
Temporal ProcessingTempEval-3Strict Detection (F1)96.37R2R
Temporal ProcessingTempEval-3Type90.43R2R
Temporal ProcessingTempEval-3 Strict Detection (Pr.)94.11B2B
Temporal ProcessingTempEval-3 Strict Detection (Re.)81.01B2B
Temporal ProcessingTempEval-3Relaxed Detection (F1)92.52B2B
Temporal ProcessingTempEval-3Relaxed Detection (Pr.)100B2B
Temporal ProcessingTempEval-3Relaxed Detection (Re.)86.09B2B
Temporal ProcessingTempEval-3Strict Detection (F1)87.07B2B
Temporal ProcessingTempEval-3Type83.79B2B
Temporal ProcessingTempEval-3 Strict Detection (Pr.)82.72DateBERT
Temporal ProcessingTempEval-3 Strict Detection (Re.)85.79DateBERT
Temporal ProcessingTempEval-3Relaxed Detection (F1)92.6DateBERT
Temporal ProcessingTempEval-3Relaxed Detection (Pr.)90.95DateBERT
Temporal ProcessingTempEval-3Relaxed Detection (Re.)94.35DateBERT
Temporal ProcessingTempEval-3Strict Detection (F1)84.21DateBERT
Temporal ProcessingTempEval-3Type86.21DateBERT
Temporal ProcessingTempEval-3 Strict Detection (Pr.)81.83BERT-base
Temporal ProcessingTempEval-3 Strict Detection (Re.)79.56BERT-base
Temporal ProcessingTempEval-3Relaxed Detection (F1)90.08BERT-base
Temporal ProcessingTempEval-3Relaxed Detection (Pr.)91.37BERT-base
Temporal ProcessingTempEval-3Relaxed Detection (Re.)88.84BERT-base
Temporal ProcessingTempEval-3Strict Detection (F1)80.67BERT-base
Temporal ProcessingTempEval-3Type82BERT-base
Temporal Information ExtractionTempEval-3 Strict Detection (Pr.)96.37R2R
Temporal Information ExtractionTempEval-3 Strict Detection (Re.)96.37R2R
Temporal Information ExtractionTempEval-3Relaxed Detection (F1)100R2R
Temporal Information ExtractionTempEval-3Relaxed Detection (Pr.)100R2R
Temporal Information ExtractionTempEval-3Relaxed Detection (Re.)100R2R
Temporal Information ExtractionTempEval-3Strict Detection (F1)96.37R2R
Temporal Information ExtractionTempEval-3Type90.43R2R
Temporal Information ExtractionTempEval-3 Strict Detection (Pr.)94.11B2B
Temporal Information ExtractionTempEval-3 Strict Detection (Re.)81.01B2B
Temporal Information ExtractionTempEval-3Relaxed Detection (F1)92.52B2B
Temporal Information ExtractionTempEval-3Relaxed Detection (Pr.)100B2B
Temporal Information ExtractionTempEval-3Relaxed Detection (Re.)86.09B2B
Temporal Information ExtractionTempEval-3Strict Detection (F1)87.07B2B
Temporal Information ExtractionTempEval-3Type83.79B2B
Temporal Information ExtractionTempEval-3 Strict Detection (Pr.)82.72DateBERT
Temporal Information ExtractionTempEval-3 Strict Detection (Re.)85.79DateBERT
Temporal Information ExtractionTempEval-3Relaxed Detection (F1)92.6DateBERT
Temporal Information ExtractionTempEval-3Relaxed Detection (Pr.)90.95DateBERT
Temporal Information ExtractionTempEval-3Relaxed Detection (Re.)94.35DateBERT
Temporal Information ExtractionTempEval-3Strict Detection (F1)84.21DateBERT
Temporal Information ExtractionTempEval-3Type86.21DateBERT
Temporal Information ExtractionTempEval-3 Strict Detection (Pr.)81.83BERT-base
Temporal Information ExtractionTempEval-3 Strict Detection (Re.)79.56BERT-base
Temporal Information ExtractionTempEval-3Relaxed Detection (F1)90.08BERT-base
Temporal Information ExtractionTempEval-3Relaxed Detection (Pr.)91.37BERT-base
Temporal Information ExtractionTempEval-3Relaxed Detection (Re.)88.84BERT-base
Temporal Information ExtractionTempEval-3Strict Detection (F1)80.67BERT-base
Temporal Information ExtractionTempEval-3Type82BERT-base

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17A Survey of Context Engineering for Large Language Models2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17