runngezhang Goto Github PK
Type: User
Type: User
Voice Activity Detector Module Port From WebRTC
Voice activity detection (VAD) library and Go bindings based on WebRTC's VAD engine
This is the Wrapper Library for WebRTC Voice Engine. Including Acoustic Echo Cancellation (AEC), Noise Suppression (NS), VAD (Voice Active Detection) and so on.
Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks"
Production First and Production Ready End-to-End Speech Recognition Toolkit
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
Production First and Production Ready End-to-End Text-to-Speech Toolkit
Wiener Filer based Speech Enhancement(deep neural networks, LSTM)
Wrapped Gaussian Mixture Model for angular clustering
Anime Scene Search by Image
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Robust Speech Recognition via Large-Scale Weak Supervision
Port of OpenAI's Whisper model in C/C++
95.8% and 80% on CIFAR-10 and CIFAR-100
Best CIFAR-10, CIFAR-100 results with wide-residual networks using PyTorch
Generating multi-channel wind noise based on the Corcos model
Multi-layer Recurrent Neural Networks (LSTM, RNN) for word-level language models in Python using TensorFlow.
Multiple version of word2vec. https://code.google.com/p/word2vec/
Automatically exported from code.google.com/p/word2vec
Segmentation of a text-line into words.
A high-quality speech analysis, manipulation and synthesis system
wrassp is a wrapper for R around Michel Scheffers's libassp (Advanced Speech Signal Processor). The libassp library aims at providing functionality for handling speech signal files in most common audio formats and for performing analyses common in phonetic science/speech science. This includes the calculation of formants, fundamental frequency, root mean square, auto correlation, a variety of spectral analyses, zero crossing rate, filtering etc. This wrapper provides R with a large subset of libassp's signal processing functions and provides them to the user in a (hopefully) user-friendly manner. The wrassp package is used by the EMU Speech Database Management System (EMU-SDMS) to perform signal processing routines.
My paper, note and anything in text.
Weighted Spatial Covariance Matrix Estimation for MUSIC based TDOA Estimation of Speech Source
Convert WSJ sphere format to waveform and do data simulation.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.