TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning local discrete features in explainable-by-design ...

Learning local discrete features in explainable-by-design convolutional neural networks

Pantelis I. Kaplanoglou, Konstantinos Diamantaras

2024-10-31Image ClassificationExplainable Artificial Intelligence (XAI)Interpretable Machine Learning
PaperPDFCode(official)

Abstract

Our proposed framework attempts to break the trade-off between performance and explainability by introducing an explainable-by-design convolutional neural network (CNN) based on the lateral inhibition mechanism. The ExplaiNet model consists of the predictor, that is a high-accuracy CNN with residual or dense skip connections, and the explainer probabilistic graph that expresses the spatial interactions of the network neurons. The value on each graph node is a local discrete feature (LDF) vector, a patch descriptor that represents the indices of antagonistic neurons ordered by the strength of their activations, which are learned with gradient descent. Using LDFs as sequences we can increase the conciseness of explanations by repurposing EXTREME, an EM-based sequence motif discovery method that is typically used in molecular biology. Having a discrete feature motif matrix for each one of intermediate image representations, instead of a continuous activation tensor, allows us to leverage the inherent explainability of Bayesian networks. By collecting observations and directly calculating probabilities, we can explain causal relationships between motifs of adjacent levels and attribute the model's output to global motifs. Moreover, experiments on various tiny image benchmark datasets confirm that our predictor ensures the same level of performance as the baseline architecture for a given count of parameters and/or layers. Our novel method shows promise to exceed this performance while providing an additional stream of explanations. In the solved MNIST classification task, it reaches a comparable to the state-of-the-art performance for single models, using standard training setup and 0.75 million parameters.

Results

TaskDatasetMetricValueModel
Image ClassificationFashion-MNISTAccuracy93.45R-ExplaiNet-26
Image ClassificationFashion-MNISTPercentage error6.55R-ExplaiNet-26
Image ClassificationFashion-MNISTTrainable Parameters892362R-ExplaiNet-26
Image ClassificationOracle-MNISTAccuracy96.93R-ExplaiNet-26
Image ClassificationOracle-MNISTTrainable Parameters892362R-ExplaiNet-26
Image ClassificationCIFAR-10Percentage correct94.15R-ExplaiNet-26
Image ClassificationKuzushiji-MNISTAccuracy98.78R-ExplaiNet-26
Image ClassificationKuzushiji-MNISTError1.22R-ExplaiNet-26
Image ClassificationKuzushiji-MNISTTrainable Parameters892362R-ExplaiNet-26
Image ClassificationMNISTAccuracy99.8R-ExplaiNet-22 (single model)
Image ClassificationMNISTPercentage error0.2R-ExplaiNet-22 (single model)
Image ClassificationMNISTTrainable Parameters743882R-ExplaiNet-22 (single model)

Related Papers

NeuroXAI: Adaptive, robust, explainable surrogate framework for determination of channel importance in EEG application2025-09-12Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks2025-07-14