Abstract: The quality of raw audio waveform generated by a vocoder could affect various audio generative tasks. In recent years, the dominance of source-filter vocoders was greatly challenged by ...
Despite significant advances in neural vocoders using diffusion models and their variants, these methods, unfortunately, inherently suffer from a performance-inference dilemma, which stems from the ...
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. Feel free to check my ...
Abstract: A new neural network architecture is proposed that can be used to convert Mel spectrograms into an audio signal. The architecture is designed from the ground up to be run on a mobile device, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results