GithubHelp home page GithubHelp logo

xuridongsheng7142's Projects

open-speech-corpora icon open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

personalvad icon personalvad

An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.

pyannote-audio icon pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pyaudioanalysis icon pyaudioanalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

resemblyzer icon resemblyzer

A python package to analyze and compare voices with deep learning

spleeter icon spleeter

Deezer source separation library including pretrained models.

tnpy icon tnpy

a text analyzing (match, rewrite, extract) engine (python edition)

transferlearning icon transferlearning

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

transformers icon transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

uis-rnn icon uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

vbx icon vbx

Variational Bayes HMM over x-vectors diarization

vi_g2p icon vi_g2p

grapheme-to-phoneme method, converts any Vietnamese word from grapheme-based into a phoneme-based pronunciation that integrates tone information. It is usefull to create a lexicon for deverloping a Vi LVCSR system.

voxpopuli icon voxpopuli

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

wekws icon wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

wenet icon wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

whisper icon whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.