hcwu1993 Goto Github PK
Type: User
Type: User
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Awesome Knowledge Distillation
TTS
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
DCGAN LSGAN WGAN-GP DRAGAN Tensorflow 2
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
A collection of links and notes on forced alignment tools
All audio extracted from Genshin Impact, music, voicelines and everything else
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Begining of github
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Inference Llama 2 in one file of pure C
This is now the official location of the Merlin project.
Command line utility for forced alignment using Kaldi
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Recipes for using Python's pandas library
Inference and training library for high-quality TTS models.
RNN-based generative models for speech.
Taming Transformers for High-Resolution Image Synthesis
Computation using data flow graphs for scalable machine learning
TensorFlow Tutorial and Examples for beginners
A TensorFlow implementation of DeepMind's WaveNet paper
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
WaveNet vocoder
python 微信《跳一跳》辅助
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.