Description
WaveGrad DBlocks are used to downsample the temporal dimension of noisy waveform in WaveGrad. They are similar to UBlocks except that only one residual block is included. The dilation factors are 1, 2, 4 in the main branch. Orthogonal initialization is used.
Papers Using This Method
GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model2024-02-09BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis2022-03-25InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training2022-02-08Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives2021-12-26VocBench: A Neural Vocoder Benchmark for Speech Synthesis2021-12-06WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis2021-06-17WaveGrad: Estimating Gradients for Waveform Generation2020-09-02