zhaoforever Goto Github PK

followers: 7.0 following: 4.0 repos: 197.0 gists: 0.0

Type: User

zhaoforever's Projects

seld-net

Sound event localization and detection of overlapping sources in three dimensions using convolutional recurrent neural network

sepstereo_eccv2020

Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)

setk

Tools for Speech Enhancement integrated with Kaldi

sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

sofamyroom

Room acoustic simulator with a SOFA file loader.

sotawhat

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.

sound-classification-on-raspberry-pi-with-tensorflow

In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone

sound_event_detection

This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: specialized decision surface (SDS) and disentangled feature (DF) for weakly-supervised learning and guided learning (GL) for semi-supervised learning. We're so glad if you're interested in using it for research purpose or DCASE participation.

sound_event_detection_dcase2017_task4

sounds_detection

DCASE 2018 TASK5

soundscapecityclassification

soundzone_tools

Signal Processing Tools for MATLAB

spatial_audio_framework

A Spatial Audio Framework written in C. Functions are included for computing VBAP gain tables, Ambisonics encoding/decoding, spherical array processing etc.

specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

speech-denoising-wavenet

A neural network for end-to-end speech denoising

speech-enhancement

tensorflow训练语音增强脚本

speech-enhancement-wgan

speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN

speechbrain

A PyTorch-based Speech Toolkit

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

speechpy

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition

spleeter

Deezer source separation library including pretrained models.

stanford-tensorflow-tutorials

This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.

subband-music-separation

Pytorch: Channel-wise subband input for better voice and accompaniment separation

subspectralnet

SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019

sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

zhaoforever Goto Github PK

zhaoforever's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs