shaun95 Goto Github PK
Name: shaun
Type: User
Bio: all-curious about neural synthesizers
Name: shaun
Type: User
Bio: all-curious about neural synthesizers
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
[NeurIPS'22] CoNT: Contrastive Neural Text Generation
speech self-supervised representations
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
A technical report on convolution arithmetic in the context of deep learning
Code release for ConvNeXt model
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Official code for Cotatron @ INTERSPEECH 2020
An invertible and differentiable implementation of the Constant-Q Transform (CQT).
bandwidth extension
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts
Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech
Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
Datasets for turn-taking research
The pytorch implementation of DC-TTS
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.