inconnu11's Projects
Audio style transfer with shallow random parameters CNN. Result: https://soundcloud.com/mazzzystar/sets/speech-conversion-sample
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Reformer, the efficient Transformer, in Pytorch
A python package to analyze and compare voices with deep learning
An elegant \LaTeX\ résumé template
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.
Towards hot directions in industrial speech recognition
Unsupervised Speech Decomposition Via Triple Information Bottleneck
Voice Conversion Tool Kit
Matlab implementation of sift(opensift) algorithm.
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
DeepMind's Tacotron-2 Tensorflow implementation
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Trax — your path to advanced deep learning
A Trojan proxy written in golang. An unidentifiable mechanism that helps you bypass GFW. Golang实现的Trojan代理,支持多个平台,无依赖
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Uniform Manifold Approximation and Projection
Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Voice Conversion using Tacotron.
Voice Conversion Challenge 2020 CycleVAE baseline system
Voice Converter Using CycleGAN and Non-Parallel Data
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Chainer implementation of Deepmind's WaveNet