TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HoTPP Benchmark: Are We Good at the Long Horizon Events Fo...

HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting?

Ivan Karpukhin, Foma Shipilov, Andrey Savchenko

2024-06-20BenchmarkingTime Series ForecastingPoint Processes
PaperPDFCode(official)Code

Abstract

Accurately forecasting multiple future events within a given time horizon is crucial for finance, retail, social networks, and healthcare applications. Event timing and labels are typically modeled using Marked Temporal Point Processes (MTPP), with evaluations often focused on next-event prediction quality. While some studies have extended evaluations to a fixed number of future events, we demonstrate that this approach leads to inaccuracies in handling false positives and false negatives. To address these issues, we propose a novel evaluation method inspired by object detection techniques from computer vision. Specifically, we introduce Temporal mean Average Precision (T-mAP), a temporal variant of mAP, which overcomes the limitations of existing long-horizon evaluation metrics. Our extensive experiments demonstrate that models with strong next-event prediction accuracy can yield poor long-horizon forecasts and vice versa, indicating that specialized methods are needed for each task. To support further research, we release HoTPP, the first benchmark designed explicitly for evaluating long-horizon MTPP predictions. HoTPP includes large-scale datasets with up to 43 million events and provides optimized procedures for both autoregressive and parallel inference, paving the way for future advancements in the field.

Results

TaskDatasetMetricValueModel
Point ProcessesAmazon MTPPAccuracy (%)11.06NHP
Point ProcessesAmazon MTPPMAE0.449NHP
Point ProcessesAmazon MTPPOTD9.02NHP
Point ProcessesAmazon MTPPT-mAP26.29NHP
Point ProcessesAmazon MTPPAccuracy (%)35.73IFTPP
Point ProcessesAmazon MTPPMAE0.242IFTPP
Point ProcessesAmazon MTPPOTD6.52IFTPP
Point ProcessesAmazon MTPPT-mAP22.56IFTPP
Point ProcessesAmazon MTPPAccuracy (%)35.76RMTPP
Point ProcessesAmazon MTPPMAE0.294RMTPP
Point ProcessesAmazon MTPPOTD6.57RMTPP
Point ProcessesAmazon MTPPT-mAP20.06RMTPP
Point ProcessesStackOverflow MTPPAccuracy (%)45.43RMTPP
Point ProcessesStackOverflow MTPPMAE0.701RMTPP
Point ProcessesStackOverflow MTPPOTD13.17RMTPP
Point ProcessesStackOverflow MTPPT-mAP12.72RMTPP
Point ProcessesStackOverflow MTPPAccuracy (%)45.41IFTPP
Point ProcessesStackOverflow MTPPMAE0.641IFTPP
Point ProcessesStackOverflow MTPPOTD13.64IFTPP
Point ProcessesStackOverflow MTPPT-mAP8.31IFTPP
Point ProcessesRetweet MTPPAccuracy (%)59.95ODE-RNN
Point ProcessesRetweet MTPPMAE18.38ODE-RNN
Point ProcessesRetweet MTPPOTD165.3ODE-RNN
Point ProcessesRetweet MTPPT-mAP48.81ODE-RNN
Point ProcessesRetweet MTPPAccuracy (%)60.08NHP
Point ProcessesRetweet MTPPMAE18.42NHP
Point ProcessesRetweet MTPPOTD165.8NHP
Point ProcessesRetweet MTPPT-mAP45.07NHP
Point ProcessesRetweet MTPPAccuracy (%)60.07RMTPP
Point ProcessesRetweet MTPPMAE18.45RMTPP
Point ProcessesRetweet MTPPOTD166.7RMTPP
Point ProcessesRetweet MTPPT-mAP44.74RMTPP
Point ProcessesRetweet MTPPAccuracy (%)59.95IFTPP
Point ProcessesRetweet MTPPMAE18.27IFTPP
Point ProcessesRetweet MTPPOTD172.7IFTPP
Point ProcessesRetweet MTPPT-mAP31.75IFTPP
Point ProcessesRetweet MTPPAccuracy (%)60.03AttNHP
Point ProcessesRetweet MTPPMAE18.39AttNHP
Point ProcessesRetweet MTPPOTD171.6AttNHP
Point ProcessesRetweet MTPPT-mAP25.85AttNHP
Point ProcessesAgeGroup Transactions MTPPAccuracy (%)34.15RMTPP
Point ProcessesAgeGroup Transactions MTPPMAE0.749RMTPP
Point ProcessesAgeGroup Transactions MTPPOTD6.88RMTPP
Point ProcessesAgeGroup Transactions MTPPT-mAP6.69RMTPP
Point ProcessesAgeGroup Transactions MTPPAccuracy (%)34.08IFTPP
Point ProcessesAgeGroup Transactions MTPPMAE0.693IFTPP
Point ProcessesAgeGroup Transactions MTPPOTD6.9IFTPP
Point ProcessesAgeGroup Transactions MTPPT-mAP5.88IFTPP
Point ProcessesAgeGroup Transactions MTPPAccuracy (%)35.43NHP
Point ProcessesAgeGroup Transactions MTPPMAE0.696NHP
Point ProcessesAgeGroup Transactions MTPPOTD6.97NHP
Point ProcessesAgeGroup Transactions MTPPT-mAP5.61NHP
Point ProcessesAgeGroup Transactions MTPPAccuracy (%)35.6ODE-RNN
Point ProcessesAgeGroup Transactions MTPPMAE0.695ODE-RNN
Point ProcessesAgeGroup Transactions MTPPOTD6.97ODE-RNN
Point ProcessesAgeGroup Transactions MTPPT-mAP5.52ODE-RNN

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20Training Transformers with Enforced Lipschitz Constants2025-07-17Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17The Power of Architecture: Deep Dive into Transformer Architectures for Long-Term Time Series Forecasting2025-07-17DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16DCR: Quantifying Data Contamination in LLMs Evaluation2025-07-15A Multi-View High-Resolution Foot-Ankle Complex Point Cloud Dataset During Gait for Occlusion-Robust 3D Completion2025-07-15