cenwurong Goto Github PK

followers: 0.0 following: 1.0 repos: 30.0 gists: 0.0

Type: User

cenwurong's Projects

amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

bottleneck

Fast NumPy array functions written in C

bp

BP（Back Propagation）by matlab

chinesenlpcorpus

中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。

cloudcompare

CloudCompare main repository

ctc_beam_search_lm

CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统

dccrn

implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch

ecapa-tdnn

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

factorized-tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

fastdiff

PyTorch Implementation of FastDiff (IJCAI'22)

gpt2-chinese

Chinese version of GPT2 training code, using BERT tokenizer.

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

kaldi-ndk-feature

kaldi_nnet3

Simple, yet fast, Python scripts to read Kaldi NNet3 models and compute bottleneck features

llama-factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

masr

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

netron

Visualizer for neural network, deep learning, and machine learning models

nlp-interview-notes

该仓库主要记录 NLP 算法工程师相关的面试题

nonparaseq2seqvc_code

Implementation code of non-parallel sequence-to-sequence VC

python_kaldi_features

python codes to extract MFCC and FBANK speech features for Kaldi

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

pytorch-tacotron

text to speach model

pytsmod

An open-source Python library for audio time-scale modification.

snoring_net

snoring detection project using LNN(linger&thinker)

vall-e-x

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

wespeaker

Research and Production Oriented Speaker Recognition Toolkit

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

whisper-finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

whisperfusion

WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.

cenwurong Goto Github PK

cenwurong's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs