TorontoCL at CMCL 2021 Shared Task: RoBERTa with Multi-Stage Fine-Tuning for Eye-Tracking Prediction

Bai Li, Frank Rudzicz

2021-04-15NAACL (CMCL) 2021 6regression

Abstract

Eye movement data during reading is a useful source of information for understanding language comprehension processes. In this paper, we describe our submission to the CMCL 2021 shared task on predicting human reading patterns. Our model uses RoBERTa with a regression layer to predict 5 eye-tracking features. We train the model in two stages: we first fine-tune on the Provo corpus (another eye-tracking dataset), then fine-tune on the task data. We compare different Transformer models and apply ensembling methods to improve the performance. Our final submission achieves a MAE score of 3.929, ranking 3rd place out of 13 teams that participated in this shared task.

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20 Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16 Imbalanced Regression Pipeline Recommendation2025-07-16 Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16 Sparse Regression Codes exploit Multi-User Diversity without CSI2025-07-15 Bradley-Terry and Multi-Objective Reward Modeling Are Complementary2025-07-10 Active Learning for Manifold Gaussian Process Regression2025-06-26 A Survey of Predictive Maintenance Methods: An Analysis of Prognostics via Classification and Regression2025-06-25