TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Fidelity Isn't Accuracy: When Linearly Decodable Functions...

Fidelity Isn't Accuracy: When Linearly Decodable Functions Fail to Match the Ground Truth

Jackson Eshbaugh

2025-06-13regressionDiagnostic
PaperPDFCode(official)

Abstract

Neural networks excel as function approximators, but their complexity often obscures the nature of the functions they learn. In this work, we propose the linearity score $\lambda(f)$, a simple and interpretable diagnostic that quantifies how well a regression network's output can be mimicked by a linear model. Defined as the $R^2$ between the network's predictions and those of a trained linear surrogate, $\lambda(f)$ offers insight into the linear decodability of the learned function. We evaluate this framework on both synthetic ($y = x \sin(x) + \epsilon$) and real-world datasets (Medical Insurance, Concrete, California Housing), using dataset-specific networks and surrogates. Our findings show that while high $\lambda(f)$ scores indicate strong linear alignment, they do not necessarily imply predictive accuracy with respect to the ground truth. This underscores both the promise and the limitations of using linear surrogates to understand nonlinear model behavior, particularly in high-stakes regression tasks.

Results

TaskDatasetMetricValueModel
regressionConcrete Compressive StrengthR2 Score0.8588Neural Network
regressionConcrete Compressive Strengthlambda0.6659Neural Network
regressionConcrete Compressive StrengthR2 Score0.5944Baseline Regression
regressionConcrete Compressive StrengthR2 Score0.5821Mimic / Surrogate
regressionMedical Cost Personal DatasetR2 Score0.8673Neural Network
regressionMedical Cost Personal Datasetlambda0.9186Neural Network
regressionMedical Cost Personal DatasetR2 Score0.7836Baseline Regression
regressionMedical Cost Personal DatasetR2 Score0.7835Mimic / Surrogate
regressionSynthetic: y = x * sin xR2 Score0.9755Neural Network
regressionSynthetic: y = x * sin xlambda-0.0105Neural Network
regressionSynthetic: y = x * sin xR2 Score-0.008Baseline Regression
regressionSynthetic: y = x * sin xR2 Score-0.0137Mimic / Surrogate
regressionCalifornia Housing PricesR2 Score0.7908Neural Network
regressionCalifornia Housing Priceslambda0.6968Neural Network
regressionCalifornia Housing PricesR2 Score0.5758Baseline Regression
regressionCalifornia Housing PricesR2 Score0.5658Mimic / Surrogate

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20Smart fault detection in satellite electrical power system2025-07-18Demographic-aware fine-grained classification of pediatric wrist fractures2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16Trustworthy Tree-based Machine Learning by $MoS_2$ Flash-based Analog CAM with Inherent Soft Boundaries2025-07-16Sparse Regression Codes exploit Multi-User Diversity without CSI2025-07-15