A multiclass multiple instance learning method. Only need to know whether any instance of a class exists or not in a sample. Demonstration on multiple object detection and localization.

multi-task-speech-classification

Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset

musegan

An AI for Music Generation

music-audio-tagging-at-scale-models

Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"

music-auto_tagging-keras

Music auto-tagging models and trained weights in keras/theano

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

nn-vad

simple dnn based vad

nnaec-neuralnetworkbasedacousticechocancellation

NNAEC-Neural Network based Acoustic Echo Cancellation

nnaudio

Audio processing by using pytorch 1D convolution network

open-speaker-verification

speaker verification tool

opus

Modern audio compression for the internet.

paddlespeech

Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

panotti

A multi-channel neural network audio classifier using Keras

percepnet

(Under construct) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

percepnet-keras

percepnet implemented using Keras, still need to be optimized and tuned.

ppg-vc

PPG-Based Voice Conversion

dongsig Goto Github PK

dyang's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs