End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

Xuezhe Ma, Eduard Hovy

2016-03-04ACL 2016 8Feature Engineering POS Part-Of-Speech Tagging Named Entity Recognition Named Entity Recognition (NER)POS Tagging

Paper PDF Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code

Abstract

State-of-the-art sequence labeling systems traditionally require large amounts of task-specific knowledge in the form of hand-crafted features and data pre-processing. In this paper, we introduce a novel neutral network architecture that benefits from both word- and character-level representations automatically, by using combination of bidirectional LSTM, CNN and CRF. Our system is truly end-to-end, requiring no feature engineering or data pre-processing, thus making it applicable to a wide range of sequence labeling tasks. We evaluate our system on two data sets for two sequence labeling tasks --- Penn Treebank WSJ corpus for part-of-speech (POS) tagging and CoNLL 2003 corpus for named entity recognition (NER). We obtain state-of-the-art performance on both the two data --- 97.55\% accuracy for POS tagging and 91.21\% F1 for NER.

Results

Task	Dataset	Metric	Value	Model
Part-Of-Speech Tagging	Penn Treebank	Accuracy	97.55	BLSTM-CNN-CRF
Named Entity Recognition (NER)	CoNLL 2003 (English)	F1	91.21	BLSTM-CNN-CRF
Named Entity Recognition (NER)	CoNLL++	F1	91.87	BiLSTM-CNN-CRF

Related Papers

Flippi: End To End GenAI Assistant for E-Commerce2025-07-08 Advancing Magnetic Materials Discovery -- A structure-based machine learning approach for magnetic ordering and magnetic moment prediction2025-07-02 Prompt Mechanisms in Medical Imaging: A Comprehensive Survey2025-06-28 Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28 Quantum Reinforcement Learning Trading Agent for Sector Rotation in the Taiwan Stock Market2025-06-26 Temporal-Aware Graph Attention Network for Cryptocurrency Transaction Fraud Detection2025-06-26 Tabular Feature Discovery With Reasoning Type Exploration2025-06-25 A Deep Learning Approach to Identify Rock Bolts in Complex 3D Point Clouds of Underground Mines Captured Using Mobile Laser Scanners2025-06-25