Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Phase Shuffle

Phase Shuffle

AudioIntroduced 200030 papers

Description

Phase Shuffle is a technique for removing pitched noise artifacts that come from using transposed convolutions in audio generation models. Phase shuffle is an operation with hyperparameter $n$ . It randomly perturbs the phase of each layer’s activations by − $n$ to $n$ samples before input to the next layer.

In the original application in WaveGAN, the authors only apply phase shuffle to the discriminator, as the latent vector already provides the generator a mechanism to manipulate the phase of a resultant waveform. Intuitively speaking, phase shuffle makes the discriminator’s job more challenging by requiring invariance to the phase of the input waveform.

Papers Using This Method

NAIST Simultaneous Speech Translation System for IWSLT 20242024-06-30 (Un)paired signal-to-signal translation with 1D conditional GANs2024-03-05 The Effects of Signal-to-Noise Ratio on Generative Adversarial Networks Applied to Marine Bioacoustic Data2023-12-22 Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity2022-12-08 HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice Generation2022-10-23 WaveGAN: Frequency-aware GAN for High-Fidelity Few-shot Image Generation2022-07-15 WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis2022-06-20 NatiQ: An End-to-end Text-to-Speech System for Arabic2022-06-15 Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation2022-05-12 MSR-NV: Neural Vocoder Using Multiple Sampling Rates2021-09-28 StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion2021-07-21 Digital Einstein Experience: Fast Text-to-Speech for Conversational AI2021-07-21 Interpreting intermediate convolutional layers of generative CNNs trained on waveforms2021-04-19 Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN2021-04-10 Adversarial Attacks and Defenses for Speech Recognition Systems2021-03-31 Improve GAN-based Neural Vocoder using Pointwise Relativistic LeastSquare GAN2021-03-26 LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation2021-02-22 Study of Pre-processing Defenses against Adversarial Attacks on State-of-the-art Speaker Recognition Systems2021-01-22 Synthesising Realistic Calcium Imaging Data of Neuronal Populations Using GAN2021-01-01 StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization2020-11-03