Description
Bridge-net is an audio model block used in the ClariNet text-to-speech architecture. Bridge-net maps frame-level hidden representation to sample-level through several convolution blocks and transposed convolution layers interleaved with softsign non-linearities.
Papers Using This Method
Clarinet: A Music Retrieval System2022-10-23Learning from a Complementary-label Source Domain: Theory and Algorithms2020-08-04Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation2020-07-29Multi-Speaker End-to-End Speech Synthesis2019-07-09Non-Autoregressive Neural Text-to-Speech2019-05-21Neural source-filter waveform models for statistical parametric speech synthesis2019-04-27ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech2018-07-19