GithubHelp home page GithubHelp logo

zhaoforever's Projects

seld-net icon seld-net

Sound event localization and detection of overlapping sources in three dimensions using convolutional recurrent neural network

sepstereo_eccv2020 icon sepstereo_eccv2020

Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)

setk icon setk

Tools for Speech Enhancement integrated with Kaldi

sincnet icon sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

sofamyroom icon sofamyroom

Room acoustic simulator with a SOFA file loader.

sotawhat icon sotawhat

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.

sound-classification-on-raspberry-pi-with-tensorflow icon sound-classification-on-raspberry-pi-with-tensorflow

In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone

sound_event_detection icon sound_event_detection

This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: specialized decision surface (SDS) and disentangled feature (DF) for weakly-supervised learning and guided learning (GL) for semi-supervised learning. We're so glad if you're interested in using it for research purpose or DCASE participation.

spatial_audio_framework icon spatial_audio_framework

A Spatial Audio Framework written in C. Functions are included for computing VBAP gain tables, Ambisonics encoding/decoding, spherical array processing etc.

specaugment icon specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

speechmetrics icon speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

speechpy icon speechpy

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition

spleeter icon spleeter

Deezer source separation library including pretrained models.

subspectralnet icon subspectralnet

SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019

sudo_rm_rf icon sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

svoice icon svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

sx icon sx

Portable base library for C programmers, designed for performance and simplicity.

tacotron icon tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model

tacotron-1 icon tacotron-1

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.