TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Deep Polynomial Neural Networks

Deep Polynomial Neural Networks

Grigorios Chrysos, Stylianos Moschoglou, Giorgos Bouritsas, Jiankang Deng, Yannis Panagakis, Stefanos Zafeiriou

2020-06-20Face RecognitionImage ClassificationRepresentation LearningFace VerificationImage GenerationFace IdentificationConditional Image Generation
PaperPDFCodeCodeCodeCode(official)Code

Abstract

Deep Convolutional Neural Networks (DCNNs) are currently the method of choice both for generative, as well as for discriminative learning in computer vision and machine learning. The success of DCNNs can be attributed to the careful selection of their building blocks (e.g., residual blocks, rectifiers, sophisticated normalization schemes, to mention but a few). In this paper, we propose $\Pi$-Nets, a new class of function approximators based on polynomial expansions. $\Pi$-Nets are polynomial neural networks, i.e., the output is a high-order polynomial of the input. The unknown parameters, which are naturally represented by high-order tensors, are estimated through a collective tensor factorization with factors sharing. We introduce three tensor decompositions that significantly reduce the number of parameters and show how they can be efficiently implemented by hierarchical neural networks. We empirically demonstrate that $\Pi$-Nets are very expressive and they even produce good results without the use of non-linear activation functions in a large battery of tasks and signals, i.e., images, graphs, and audio. When used in conjunction with activation functions, $\Pi$-Nets produce state-of-the-art results in three challenging tasks, i.e. image generation, face verification and 3D mesh representation learning. The source code is available at \url{https://github.com/grigorisg9gr/polynomial_nets}.

Results

TaskDatasetMetricValueModel
Facial Recognition and ModellingAgeDB-30Accuracy0.98467Prodpoly
Facial Recognition and ModellingLFWAccuracy0.99833Prodpoly
Facial Recognition and ModellingCALFWAccuracy0.96233Prodpoly
Image GenerationCIFAR-10FID16.79ProdPoly
Image GenerationCIFAR-10FID40.45ProdPoly no activation functions
Image GenerationCIFAR-10FID36.77ProdPoly no activation functions
Image GenerationCIFAR-10Inception score7.5ProdPoly no activation functions
Image ClassificationCIFAR-10Percentage correct94.9Prodpoly
Face ReconstructionAgeDB-30Accuracy0.98467Prodpoly
Face ReconstructionLFWAccuracy0.99833Prodpoly
Face ReconstructionCALFWAccuracy0.96233Prodpoly
Face RecognitionAgeDB-30Accuracy0.98467Prodpoly
Face RecognitionLFWAccuracy0.99833Prodpoly
Face RecognitionCALFWAccuracy0.96233Prodpoly
3DAgeDB-30Accuracy0.98467Prodpoly
3DLFWAccuracy0.99833Prodpoly
3DCALFWAccuracy0.96233Prodpoly
3D Face ModellingAgeDB-30Accuracy0.98467Prodpoly
3D Face ModellingLFWAccuracy0.99833Prodpoly
3D Face ModellingCALFWAccuracy0.96233Prodpoly
3D Face ReconstructionAgeDB-30Accuracy0.98467Prodpoly
3D Face ReconstructionLFWAccuracy0.99833Prodpoly
3D Face ReconstructionCALFWAccuracy0.96233Prodpoly
Conditional Image GenerationCIFAR-10FID36.77ProdPoly no activation functions
Conditional Image GenerationCIFAR-10Inception score7.5ProdPoly no activation functions

Related Papers

ProxyFusion: Face Feature Aggregation Through Sparse Experts2025-09-24Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17