zhaoforever Goto Github PK
Type: User
Type: User
Sound event localization and detection of overlapping sources in three dimensions using convolutional recurrent neural network
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
Tools for Speech Enhancement integrated with Kaldi
SincNet is a neural architecture for efficiently processing raw audio samples.
Room acoustic simulator with a SOFA file loader.
Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.
In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone
This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: specialized decision surface (SDS) and disentangled feature (DF) for weakly-supervised learning and guided learning (GL) for semi-supervised learning. We're so glad if you're interested in using it for research purpose or DCASE participation.
DCASE 2018 TASK5
Signal Processing Tools for MATLAB
A Spatial Audio Framework written in C. Functions are included for computing VBAP gain tables, Ambisonics encoding/decoding, spherical array processing etc.
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
A neural network for end-to-end speech denoising
tensorflow训练语音增强脚本
speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
A PyTorch-based Speech Toolkit
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition
Deezer source separation library including pretrained models.
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
Pytorch: Channel-wise subband input for better voice and accompaniment separation
SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
Portable base library for C programmers, designed for performance and simplicity.
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Pytorch implementation of Tacotron
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.