TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Deep Sketched Output Kernel Regression for Structured Pred...

Deep Sketched Output Kernel Regression for Structured Prediction

Tamim El Ahmad, Junjie Yang, Pierre Laforgue, Florence d'Alché-Buc

2024-06-13Cross-Modal RetrievalStructured PredictionregressionPrediction
PaperPDFCode(official)

Abstract

By leveraging the kernel trick in the output space, kernel-induced losses provide a principled way to define structured output prediction tasks for a wide variety of output modalities. In particular, they have been successfully used in the context of surrogate non-parametric regression, where the kernel trick is typically exploited in the input space as well. However, when inputs are images or texts, more expressive models such as deep neural networks seem more suited than non-parametric methods. In this work, we tackle the question of how to train neural networks to solve structured output prediction tasks, while still benefiting from the versatility and relevance of kernel-induced losses. We design a novel family of deep neural architectures, whose last layer predicts in a data-dependent finite-dimensional subspace of the infinite-dimensional output feature space deriving from the kernel-induced loss. This subspace is chosen as the span of the eigenfunctions of a randomly-approximated version of the empirical kernel covariance operator. Interestingly, this approach unlocks the use of gradient descent algorithms (and consequently of any neural architecture) for structured prediction. Experiments on synthetic tasks as well as real-world supervised graph prediction problems show the relevance of our method.

Results

TaskDatasetMetricValueModel
Image Retrieval with Multi-Modal QueryChEBI-20Hits@151DSOKR
Image Retrieval with Multi-Modal QueryChEBI-20Hits@1088.2DSOKR
Image Retrieval with Multi-Modal QueryChEBI-20Mean Rank76.43DSOKR
Image Retrieval with Multi-Modal QueryChEBI-20Test MRR64.2DSOKR
Cross-Modal Information RetrievalChEBI-20Hits@151DSOKR
Cross-Modal Information RetrievalChEBI-20Hits@1088.2DSOKR
Cross-Modal Information RetrievalChEBI-20Mean Rank76.43DSOKR
Cross-Modal Information RetrievalChEBI-20Test MRR64.2DSOKR
Cross-Modal RetrievalChEBI-20Hits@151DSOKR
Cross-Modal RetrievalChEBI-20Hits@1088.2DSOKR
Cross-Modal RetrievalChEBI-20Mean Rank76.43DSOKR
Cross-Modal RetrievalChEBI-20Test MRR64.2DSOKR

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16Sparse Regression Codes exploit Multi-User Diversity without CSI2025-07-15Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11