GithubHelp home page GithubHelp logo

zhaoforever's Projects

manyears icon manyears

ManyEars Sound Source Localization, Tracking and Separation

meta-tasnet icon meta-tasnet

A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation

model-compression icon model-compression

model compression based on pytorch (1、quantization: 16/8/4/2 bits(dorefa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、ternary/binary value(twn/bnn/xnor-net);2、 pruning: normal、regular and group convolutional channel pruning;3、 group convolution structure;4、batch-normalization folding for quantization)

ms-snsd icon ms-snsd

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

mtf-crnn icon mtf-crnn

Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurrent neural network (MTF-CRNN) for audio event detection. Our goal is to improve audio event detection performance and recognize target audio events that have different lengths and accompany the complex audio background. We exploit multi-groups of parallel and serial convolutional kernels to learn high-level shift invariant features from the time and frequency domains of acoustic samples. A two-layer bi-direction gated recurrent unit) based on the recurrent neural network is used to capture the temporal context from the extracted high-level features. The proposed method is evaluated on the DCASE2017 challenge dataset. Compared to other methods, the MTF-CRNN achieves one of the best test performances for a single model without pre-training and without using a multi-model ensemble approach.

musicnn icon musicnn

Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.

mvision icon mvision

机器人视觉 无人驾驶 视觉SLAM ORB LSD SVO DSO 深度学习目标检测yolov3 行为检测 opencv PCL 双目视觉

nara_wpe icon nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

ncnn icon ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

nisqa icon nisqa

NISQA - Non-Intrusive Speech Quality Assessment

nn-irm icon nn-irm

A Simple DNN-IRM estimator for speech enhancement

nnaudio icon nnaudio

Audio processing by using pytorch 1D convolution network

nndl.github.io icon nndl.github.io

《神经网络与深度学习》 Neural Network and Deep Learning

nnom icon nnom

A higher-level Neural Network library for microcontrollers.

nussl icon nussl

A simple audio source separation library built in python

onssen icon onssen

An open-source speech separation and enhancement library

openmha icon openmha

The open Master Hearing Aid (openMHA)

opensmile icon opensmile

A github repo of the openSMILE feature extraction tool.

paderwasn icon paderwasn

Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).

panotti icon panotti

A multi-channel neural network audio classifier using Keras

percepnet icon percepnet

(Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.