Logits-Constrained Framework with RoBERTa for Ancient Chinese NER
Wenjie Hua, Shenghan Xu
Abstract
This paper presents a Logits-Constrained (LC) framework for Ancient Chinese Named Entity Recognition (NER), evaluated on the EvaHan 2025 benchmark. Our two-stage model integrates GujiRoBERTa for contextual encoding and a differentiable decoding mechanism to enforce valid BMES label transitions. Experiments demonstrate that LC improves performance over traditional CRF and BiLSTM-based approaches, especially in high-label or large-data settings. We also propose a model selection criterion balancing label complexity and dataset size, providing practical guidance for real-world Ancient Chinese NLP tasks.
Related Papers
Flippi: End To End GenAI Assistant for E-Commerce2025-07-08Topic Modeling and Link-Prediction for Material Property Discovery2025-07-08Advanced Financial Reasoning at Scale: A Comprehensive Evaluation of Large Language Models on CFA Level III2025-06-29Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale2025-06-26The use of cross validation in the analysis of designed experiments2025-06-17Leveraging Predictive Equivalence in Decision Trees2025-06-17Evaluating Generalization and Representation Stability in Small LMs via Prompting, Fine-Tuning and Out-of-Distribution Prompts2025-06-16