TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Astock: A New Dataset and Automated Stock Trading based on...

Astock: A New Dataset and Automated Stock Trading based on Stock-specific News Analyzing Model

Jinan Zou, Haiyao Cao, Lingqiao Liu, YuHao Lin, Ehsan Abbasnejad, Javen Qinfeng Shi

2022-06-14Text-Based Stock PredictionStock Market PredictionSelf-Supervised LearningDecision MakingNews ClassificationStock Price PredictionSemantic Role LabelingStock PredictionOut-of-Distribution GeneralizationStock Trend Prediction
PaperPDFCode(official)

Abstract

Natural Language Processing(NLP) demonstrates a great potential to support financial decision-making by analyzing the text from social media or news outlets. In this work, we build a platform to study the NLP-aided stock auto-trading algorithms systematically. In contrast to the previous work, our platform is characterized by three features: (1) We provide financial news for each specific stock. (2) We provide various stock factors for each stock. (3) We evaluate performance from more financial-relevant metrics. Such a design allows us to develop and evaluate NLP-aided stock auto-trading algorithms in a more realistic setting. In addition to designing an evaluation platform and dataset collection, we also made a technical contribution by proposing a system to automatically learn a good feature representation from various input information. The key to our algorithm is a method called semantic role labeling Pooling (SRLP), which leverages Semantic Role Labeling (SRL) to create a compact representation of each news paragraph. Based on SRLP, we further incorporate other stock factors to make the final prediction. In addition, we propose a self-supervised learning strategy based on SRLP to enhance the out-of-distribution generalization performance of our system. Through our experimental study, we show that the proposed method achieves better performance and outperforms all the baselines' annualized rate of return as well as the maximum drawdown of the CSI300 index and XIN9 index on real trading. Our Astock dataset and code are available at https://github.com/JinanZou/Astock.

Results

TaskDatasetMetricValueModel
Stock Market PredictionAstockAccuray66.89SRL
Stock Market PredictionAstockF1-score66.92SRL
Stock Market PredictionAstockPrecision66.92SRL
Stock Market PredictionAstockRecall66.95SRL
Stock Market PredictionAstock1-166.89SRLP
Stock Trend PredictionAstockAccuray66.89SRL
Stock Trend PredictionAstockF1-score66.92SRL
Stock Trend PredictionAstockPrecision66.92SRL
Stock Trend PredictionAstockRecall66.95SRL
Stock Trend PredictionAstock1-166.89SRLP

Related Papers

Graph-Structured Data Analysis of Component Failure in Autonomous Cargo Ships Based on Feature Fusion2025-07-18Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17Higher-Order Pattern Unification Modulo Similarity Relations2025-07-17Exploiting Constraint Reasoning to Build Graphical Explanations for Mixed-Integer Linear Programming2025-07-17Acting and Planning with Hierarchical Operational Models on a Mobile Robot: A Study with RAE+UPOM2025-07-15CogDDN: A Cognitive Demand-Driven Navigation with Decision Optimization and Dual-Process Thinking2025-07-15Detección y Cuantificación de Erosión Fluvial con Visión Artificial2025-07-15