Satya Almasian, Dennis Aumiller, Michael Gertz
Temporal expressions in text play a significant role in language understanding and correctly identifying them is fundamental to various retrieval and natural language processing systems. Previous works have slowly shifted from rule-based to neural architectures, capable of tagging expressions with higher accuracy. However, neural models can not yet distinguish between different expression types at the same level as their rule-based counterparts. In this work, we aim to identify the most suitable transformer architecture for joint temporal tagging and type classification, as well as, investigating the effect of semi-supervised training on the performance of these systems. Based on our study of token classification variants and encoder-decoder architectures, we present a transformer encoder-decoder model using the RoBERTa language model as our best performing system. By supplementing training resources with weakly labeled data from rule-based systems, our model surpasses previous works in temporal tagging and type classification, especially on rare classes. Our code and pre-trained experiments are available at: https://github.com/satya77/Transformer_Temporal_Tagger
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Information Extraction | TempEval-3 | Strict Detection (Pr.) | 96.37 | R2R |
| Information Extraction | TempEval-3 | Strict Detection (Re.) | 96.37 | R2R |
| Information Extraction | TempEval-3 | Relaxed Detection (F1) | 100 | R2R |
| Information Extraction | TempEval-3 | Relaxed Detection (Pr.) | 100 | R2R |
| Information Extraction | TempEval-3 | Relaxed Detection (Re.) | 100 | R2R |
| Information Extraction | TempEval-3 | Strict Detection (F1) | 96.37 | R2R |
| Information Extraction | TempEval-3 | Type | 90.43 | R2R |
| Information Extraction | TempEval-3 | Strict Detection (Pr.) | 94.11 | B2B |
| Information Extraction | TempEval-3 | Strict Detection (Re.) | 81.01 | B2B |
| Information Extraction | TempEval-3 | Relaxed Detection (F1) | 92.52 | B2B |
| Information Extraction | TempEval-3 | Relaxed Detection (Pr.) | 100 | B2B |
| Information Extraction | TempEval-3 | Relaxed Detection (Re.) | 86.09 | B2B |
| Information Extraction | TempEval-3 | Strict Detection (F1) | 87.07 | B2B |
| Information Extraction | TempEval-3 | Type | 83.79 | B2B |
| Information Extraction | TempEval-3 | Strict Detection (Pr.) | 82.72 | DateBERT |
| Information Extraction | TempEval-3 | Strict Detection (Re.) | 85.79 | DateBERT |
| Information Extraction | TempEval-3 | Relaxed Detection (F1) | 92.6 | DateBERT |
| Information Extraction | TempEval-3 | Relaxed Detection (Pr.) | 90.95 | DateBERT |
| Information Extraction | TempEval-3 | Relaxed Detection (Re.) | 94.35 | DateBERT |
| Information Extraction | TempEval-3 | Strict Detection (F1) | 84.21 | DateBERT |
| Information Extraction | TempEval-3 | Type | 86.21 | DateBERT |
| Information Extraction | TempEval-3 | Strict Detection (Pr.) | 81.83 | BERT-base |
| Information Extraction | TempEval-3 | Strict Detection (Re.) | 79.56 | BERT-base |
| Information Extraction | TempEval-3 | Relaxed Detection (F1) | 90.08 | BERT-base |
| Information Extraction | TempEval-3 | Relaxed Detection (Pr.) | 91.37 | BERT-base |
| Information Extraction | TempEval-3 | Relaxed Detection (Re.) | 88.84 | BERT-base |
| Information Extraction | TempEval-3 | Strict Detection (F1) | 80.67 | BERT-base |
| Information Extraction | TempEval-3 | Type | 82 | BERT-base |
| Temporal Processing | TempEval-3 | Strict Detection (Pr.) | 96.37 | R2R |
| Temporal Processing | TempEval-3 | Strict Detection (Re.) | 96.37 | R2R |
| Temporal Processing | TempEval-3 | Relaxed Detection (F1) | 100 | R2R |
| Temporal Processing | TempEval-3 | Relaxed Detection (Pr.) | 100 | R2R |
| Temporal Processing | TempEval-3 | Relaxed Detection (Re.) | 100 | R2R |
| Temporal Processing | TempEval-3 | Strict Detection (F1) | 96.37 | R2R |
| Temporal Processing | TempEval-3 | Type | 90.43 | R2R |
| Temporal Processing | TempEval-3 | Strict Detection (Pr.) | 94.11 | B2B |
| Temporal Processing | TempEval-3 | Strict Detection (Re.) | 81.01 | B2B |
| Temporal Processing | TempEval-3 | Relaxed Detection (F1) | 92.52 | B2B |
| Temporal Processing | TempEval-3 | Relaxed Detection (Pr.) | 100 | B2B |
| Temporal Processing | TempEval-3 | Relaxed Detection (Re.) | 86.09 | B2B |
| Temporal Processing | TempEval-3 | Strict Detection (F1) | 87.07 | B2B |
| Temporal Processing | TempEval-3 | Type | 83.79 | B2B |
| Temporal Processing | TempEval-3 | Strict Detection (Pr.) | 82.72 | DateBERT |
| Temporal Processing | TempEval-3 | Strict Detection (Re.) | 85.79 | DateBERT |
| Temporal Processing | TempEval-3 | Relaxed Detection (F1) | 92.6 | DateBERT |
| Temporal Processing | TempEval-3 | Relaxed Detection (Pr.) | 90.95 | DateBERT |
| Temporal Processing | TempEval-3 | Relaxed Detection (Re.) | 94.35 | DateBERT |
| Temporal Processing | TempEval-3 | Strict Detection (F1) | 84.21 | DateBERT |
| Temporal Processing | TempEval-3 | Type | 86.21 | DateBERT |
| Temporal Processing | TempEval-3 | Strict Detection (Pr.) | 81.83 | BERT-base |
| Temporal Processing | TempEval-3 | Strict Detection (Re.) | 79.56 | BERT-base |
| Temporal Processing | TempEval-3 | Relaxed Detection (F1) | 90.08 | BERT-base |
| Temporal Processing | TempEval-3 | Relaxed Detection (Pr.) | 91.37 | BERT-base |
| Temporal Processing | TempEval-3 | Relaxed Detection (Re.) | 88.84 | BERT-base |
| Temporal Processing | TempEval-3 | Strict Detection (F1) | 80.67 | BERT-base |
| Temporal Processing | TempEval-3 | Type | 82 | BERT-base |
| Temporal Information Extraction | TempEval-3 | Strict Detection (Pr.) | 96.37 | R2R |
| Temporal Information Extraction | TempEval-3 | Strict Detection (Re.) | 96.37 | R2R |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (F1) | 100 | R2R |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (Pr.) | 100 | R2R |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (Re.) | 100 | R2R |
| Temporal Information Extraction | TempEval-3 | Strict Detection (F1) | 96.37 | R2R |
| Temporal Information Extraction | TempEval-3 | Type | 90.43 | R2R |
| Temporal Information Extraction | TempEval-3 | Strict Detection (Pr.) | 94.11 | B2B |
| Temporal Information Extraction | TempEval-3 | Strict Detection (Re.) | 81.01 | B2B |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (F1) | 92.52 | B2B |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (Pr.) | 100 | B2B |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (Re.) | 86.09 | B2B |
| Temporal Information Extraction | TempEval-3 | Strict Detection (F1) | 87.07 | B2B |
| Temporal Information Extraction | TempEval-3 | Type | 83.79 | B2B |
| Temporal Information Extraction | TempEval-3 | Strict Detection (Pr.) | 82.72 | DateBERT |
| Temporal Information Extraction | TempEval-3 | Strict Detection (Re.) | 85.79 | DateBERT |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (F1) | 92.6 | DateBERT |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (Pr.) | 90.95 | DateBERT |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (Re.) | 94.35 | DateBERT |
| Temporal Information Extraction | TempEval-3 | Strict Detection (F1) | 84.21 | DateBERT |
| Temporal Information Extraction | TempEval-3 | Type | 86.21 | DateBERT |
| Temporal Information Extraction | TempEval-3 | Strict Detection (Pr.) | 81.83 | BERT-base |
| Temporal Information Extraction | TempEval-3 | Strict Detection (Re.) | 79.56 | BERT-base |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (F1) | 90.08 | BERT-base |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (Pr.) | 91.37 | BERT-base |
| Temporal Information Extraction | TempEval-3 | Relaxed Detection (Re.) | 88.84 | BERT-base |
| Temporal Information Extraction | TempEval-3 | Strict Detection (F1) | 80.67 | BERT-base |
| Temporal Information Extraction | TempEval-3 | Type | 82 | BERT-base |