Logits-Constrained Framework with RoBERTa for Ancient Chinese NER

Wenjie Hua, Shenghan Xu

2025-05-05named-entity-recognition Named Entity Recognition Chinese Named Entity Recognition NER Model Selection Named Entity Recognition (NER)

Paper PDF

Abstract

This paper presents a Logits-Constrained (LC) framework for Ancient Chinese Named Entity Recognition (NER), evaluated on the EvaHan 2025 benchmark. Our two-stage model integrates GujiRoBERTa for contextual encoding and a differentiable decoding mechanism to enforce valid BMES label transitions. Experiments demonstrate that LC improves performance over traditional CRF and BiLSTM-based approaches, especially in high-label or large-data settings. We also propose a model selection criterion balancing label complexity and dataset size, providing practical guidance for real-world Ancient Chinese NLP tasks.

Related Papers

Flippi: End To End GenAI Assistant for E-Commerce2025-07-08 Topic Modeling and Link-Prediction for Material Property Discovery2025-07-08 Advanced Financial Reasoning at Scale: A Comprehensive Evaluation of Large Language Models on CFA Level III2025-06-29 Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28 mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale2025-06-26 The use of cross validation in the analysis of designed experiments2025-06-17 Leveraging Predictive Equivalence in Decision Trees2025-06-17 Evaluating Generalization and Representation Stability in Small LMs via Prompting, Fine-Tuning and Out-of-Distribution Prompts2025-06-16