dongsig Goto Github PK
Name: dyang
Type: User
Company: Tencent
Bio: Speech
Location: Shanghai
Name: dyang
Type: User
Company: Tencent
Bio: Speech
Location: Shanghai
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
A simple example of one-dimensional Gaussian mixture model
Google AI Research
2nd place solution for ID R&D Voice Antispoofing Challenge
Instantaneous pitch estimation based on RAPT framework (EUSIPCO-2012)
Neural network density models for speech separation.
using microphone
code for KalmanNet
bring keras-models to production with tensorflow-serving and nodejs + docker :pizza:
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
Keyword Spotting for detecting a word in an audio file
A KWS model trained on SpeechCommands dataset, written in PyTorch.
digital signal processing library for software-defined radios
A multiclass multiple instance learning method. Only need to know whether any instance of a class exists or not in a sample. Demonstration on multiple object detection and localization.
Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset
An AI for Music Generation
Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"
Music auto-tagging models and trained weights in keras/theano
ncnn is a high-performance neural network inference framework optimized for the mobile platform
simple dnn based vad
NNAEC-Neural Network based Acoustic Echo Cancellation
Audio processing by using pytorch 1D convolution network
speaker verification tool
Modern audio compression for the internet.
Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
A multi-channel neural network audio classifier using Keras
(Under construct) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
percepnet implemented using Keras, still need to be optimized and tuned.
PPG-Based Voice Conversion
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.