TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Glyce: Glyph-vectors for Chinese Character Representations

Glyce: Glyph-vectors for Chinese Character Representations

Yuxian Meng, Wei Wu, Fei Wang, Xiaoya Li, Ping Nie, Fan Yin, Muyu Li, Qinghong Han, Xiaofei Sun, Jiwei Li

2019-01-29NeurIPS 2019 12Text ClassificationChinese Word SegmentationMachine TranslationImage ClassificationSentiment AnalysisPOSPart-Of-Speech TaggingChinese Sentence Pair ClassificationChinese Semantic Role LabelingSentence-Pair ClassificationChinese Named Entity RecognitionNERSemantic Textual SimilarityMulti-Task LearningDocument Classificationtext-classificationSemantic Role LabelingGeneral ClassificationClassificationSentence ClassificationDependency ParsingLanguage Modelling
PaperPDFCode(official)Code

Abstract

It is intuitive that NLP tasks for logographic languages like Chinese should benefit from the use of the glyph information in those languages. However, due to the lack of rich pictographic evidence in glyphs and the weak generalization ability of standard computer vision models on character data, an effective way to utilize the glyph information remains to be found. In this paper, we address this gap by presenting Glyce, the glyph-vectors for Chinese character representations. We make three major innovations: (1) We use historical Chinese scripts (e.g., bronzeware script, seal script, traditional Chinese, etc) to enrich the pictographic evidence in characters; (2) We design CNN structures (called tianzege-CNN) tailored to Chinese character image processing; and (3) We use image-classification as an auxiliary task in a multi-task learning setup to increase the model's ability to generalize. We show that glyph-based models are able to consistently outperform word/char ID-based models in a wide range of Chinese NLP tasks. We are able to set new state-of-the-art results for a variety of Chinese NLP tasks, including tagging (NER, CWS, POS), sentence pair classification, single sentence classification tasks, dependency parsing, and semantic role labeling. For example, the proposed model achieves an F1 score of 80.6 on the OntoNotes dataset of NER, +1.5 over BERT; it achieves an almost perfect accuracy of 99.8\% on the Fudan corpus for text classification. Code found at https://github.com/ShannonAI/glyce.

Results

TaskDatasetMetricValueModel
ChineseCITYUF197.9Glyce + BERT
ChineseCITYUPrecision97.9Glyce + BERT
ChineseCITYURecall98Glyce + BERT
ChineseASF196.7Glyce + BERT
ChineseASPrecision96.6Glyce + BERT
ChineseASRecall96.8Glyce + BERT
ChinesePKUF196.7Glyce + BERT
ChinesePKUPrecision97.1Glyce + BERT
ChinesePKURecall96.4Glyce + BERT
ChineseMSRF198.3Glyce + BERT
ChineseMSRPrecision98.2Glyce + BERT
ChineseMSRRecall98.3Glyce + BERT
Named Entity Recognition (NER)Weibo NERF167.6Glyce + BERT
Named Entity Recognition (NER)Weibo NERPrecision67.68Glyce + BERT
Named Entity Recognition (NER)Weibo NERRecall67.71Glyce + BERT
Named Entity Recognition (NER)MSRAF195.54Glyce + BERT
Named Entity Recognition (NER)MSRAPrecision95.57Glyce + BERT
Named Entity Recognition (NER)MSRARecall95.51Glyce + BERT
Named Entity Recognition (NER)Resume NERF196.54Glyce + BERT
Named Entity Recognition (NER)Resume NERPrecision96.62Glyce + BERT
Named Entity Recognition (NER)Resume NERRecall96.48Glyce + BERT
Named Entity Recognition (NER)OntoNotes 4F180.62Glyce + BERT
Named Entity Recognition (NER)OntoNotes 4Precision81.87Glyce + BERT
Named Entity Recognition (NER)OntoNotes 4Recall81.4Glyce + BERT

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Making Language Model a Hierarchical Classifier and Generator2025-07-17Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17AdaptiSent: Context-Aware Adaptive Attention for Multimodal Aspect-Based Sentiment Analysis2025-07-17