Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/WaveNet

WaveNet

AudioIntroduced 2000171 papers

Description

WaveNet is an audio generative model based on the PixelCNN architecture. In order to deal with long-range temporal dependencies needed for raw audio generation, architectures are developed based on dilated causal convolutions, which exhibit very large receptive fields.

The joint probability of a waveform $\vec{x} = \{ x_1, \dots, x_T \}$ is factorised as a product of conditional probabilities as follows:

$p\left(\vec{x}\right) = \prod_{t=1}^{T} p\left(x_t \mid x_1, \dots ,x_{t-1}\right)$

Each audio sample $x_t$ is therefore conditioned on the samples at all previous timesteps.

Papers Using This Method

Aliasing Reduction in Neural Amp Modeling by Smoothing Activations2025-05-07 WaveNet-Volterra Neural Networks for Active Noise Control: A Fully Causal Approach2025-04-06 An Ensemble Framework for Probabilistic Short-Term Load Forecasting Based on BiTCN and Deep Attention Networks2025-02-25 Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis2025-01-13 Autoregressive Speech Synthesis with Next-Distribution Prediction2024-12-22 SeagrassFinder: Deep Learning for Eelgrass Detection and Coverage Estimation in the Wild2024-12-20 Synthetic Time Series Data Generation for Healthcare Applications: A PCG Case Study2024-12-17 Deep Learning-Based Approach for Identification and Compensation of Nonlinear Distortions in Parametric Array Loudspeakers2024-12-02 Islanding Detection for Active Distribution Networks Using WaveNet+UNet Classifier2024-10-17 RF Challenge: The Data-Driven Radio Frequency Signal Separation Challenge2024-09-13 InstructSing: High-Fidelity Singing Voice Generation via Instructing Yourself2024-09-10 Leveraging WaveNet for Dynamic Listening Head Modeling from Speech2024-09-08 Training Universal Vocoders with Feature Smoothing-Based Augmentation Methods for High-Quality TTS Systems2024-09-04 Synthesizing Audio from Silent Video using Sequence to Sequence Modeling2024-04-25 Foundational GPT Model for MEG2024-04-14 A Novel Approach to WaveNet Architecture for RF Signal Separation with Learnable Dilation and Data Augmentation2024-02-08 Forecasting VIX using Bayesian Deep Learning2024-01-30 An overview of text-to-speech systems and media applications2023-10-22 Energy-Based Models For Speech Synthesis2023-10-19 WaveNet: Wave-Aware Image Enhancement2023-10-10