TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/WaveNet

WaveNet

AudioIntroduced 2000171 papers
Source Paper

Description

WaveNet is an audio generative model based on the PixelCNN architecture. In order to deal with long-range temporal dependencies needed for raw audio generation, architectures are developed based on dilated causal convolutions, which exhibit very large receptive fields.

The joint probability of a waveform x⃗={x1,…,xT}\vec{x} = \{ x_1, \dots, x_T \}x={x1​,…,xT​} is factorised as a product of conditional probabilities as follows:

p(x⃗)=∏t=1Tp(xt∣x1,…,xt−1)p\left(\vec{x}\right) = \prod_{t=1}^{T} p\left(x_t \mid x_1, \dots ,x_{t-1}\right)p(x)=∏t=1T​p(xt​∣x1​,…,xt−1​)

Each audio sample xtx_txt​ is therefore conditioned on the samples at all previous timesteps.

Papers Using This Method

Aliasing Reduction in Neural Amp Modeling by Smoothing Activations2025-05-07WaveNet-Volterra Neural Networks for Active Noise Control: A Fully Causal Approach2025-04-06An Ensemble Framework for Probabilistic Short-Term Load Forecasting Based on BiTCN and Deep Attention Networks2025-02-25Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis2025-01-13Autoregressive Speech Synthesis with Next-Distribution Prediction2024-12-22SeagrassFinder: Deep Learning for Eelgrass Detection and Coverage Estimation in the Wild2024-12-20Synthetic Time Series Data Generation for Healthcare Applications: A PCG Case Study2024-12-17Deep Learning-Based Approach for Identification and Compensation of Nonlinear Distortions in Parametric Array Loudspeakers2024-12-02Islanding Detection for Active Distribution Networks Using WaveNet+UNet Classifier2024-10-17RF Challenge: The Data-Driven Radio Frequency Signal Separation Challenge2024-09-13InstructSing: High-Fidelity Singing Voice Generation via Instructing Yourself2024-09-10Leveraging WaveNet for Dynamic Listening Head Modeling from Speech2024-09-08Training Universal Vocoders with Feature Smoothing-Based Augmentation Methods for High-Quality TTS Systems2024-09-04Synthesizing Audio from Silent Video using Sequence to Sequence Modeling2024-04-25Foundational GPT Model for MEG2024-04-14A Novel Approach to WaveNet Architecture for RF Signal Separation with Learnable Dilation and Data Augmentation2024-02-08Forecasting VIX using Bayesian Deep Learning2024-01-30An overview of text-to-speech systems and media applications2023-10-22Energy-Based Models For Speech Synthesis2023-10-19WaveNet: Wave-Aware Image Enhancement2023-10-10