Abstract: The quality of raw audio waveform generated by a vocoder could affect various audio generative tasks. In recent years, the dominance of source-filter vocoders was greatly challenged by ...
Abstract: A new neural network architecture is proposed that can be used to convert Mel spectrograms into an audio signal. The architecture is designed from the ground up to be run on a mobile device, ...
RENT IT NOW, OWN IT LATER. Your Alternative Pathway To Ownership.
References [1] High-Fidelity and Low-Latency Universal Neural Vocoder based on Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform Modeling [2] Low-latency real-time ...
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. This was my master's ...