zhaoforever Goto Github PK
Type: User
Type: User
ManyEars Sound Source Localization, Tracking and Separation
A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation
Machine Learning Sound Classifier
Companion webpage to the book "Mathematics For Machine Learning"
model compression based on pytorch (1、quantization: 16/8/4/2 bits(dorefa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、ternary/binary value(twn/bnn/xnor-net);2、 pruning: normal、regular and group convolutional channel pruning;3、 group convolution structure;4、batch-normalization folding for quantization)
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurrent neural network (MTF-CRNN) for audio event detection. Our goal is to improve audio event detection performance and recognize target audio events that have different lengths and accompany the complex audio background. We exploit multi-groups of parallel and serial convolutional kernels to learn high-level shift invariant features from the time and frequency domains of acoustic samples. A two-layer bi-direction gated recurrent unit) based on the recurrent neural network is used to capture the temporal context from the extracted high-level features. The proposed method is evaluated on the DCASE2017 challenge dataset. Compared to other methods, the MTF-CRNN achieves one of the best test performances for a single model without pre-training and without using a multi-model ensemble approach.
A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction
Separating singing voice from music based on deep neural networks in Tensorflow
Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.
机器人视觉 无人驾驶 视觉SLAM ORB LSD SVO DSO 深度学习目标检测yolov3 行为检测 opencv PCL 双目视觉
Different implementations of "Weighted Prediction Error" for speech dereverberation
ncnn is a high-performance neural network inference framework optimized for the mobile platform
NISQA - Non-Intrusive Speech Quality Assessment
A Simple DNN-IRM estimator for speech enhancement
Audio processing by using pytorch 1D convolution network
《神经网络与深度学习》 Neural Network and Deep Learning
A higher-level Neural Network library for microcontrollers.
Implements python programs to train and test a Recurrent Neural Network with Tensorflow
A simple audio source separation library built in python
An open-source speech separation and enhancement library
The open Master Hearing Aid (openMHA)
A github repo of the openSMILE feature extraction tool.
Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).
A multi-channel neural network audio classifier using Keras
A Python library for adding effects to audio.
(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.