hyzhan Goto Github PK

followers: 9.0 following: 41.0 repos: 63.0 gists: 0.0

Type: User

Location: Guangzhou

hyzhan's Projects

auraloss

Collection of audio-focused loss functions in PyTorch

caffe

Caffe: a fast open framework for deep learning.

chinese_conversation_sentiment

A Chinese sentiment dataset may be useful for sentiment analysis.

cnn_graph

Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering

cosyvoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

deep-voice-conversion

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

fcn.tensorflow

Tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation (http://fcn.berkeleyvision.org)

flowavenet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

forced-alignment-tools

A collection of links and notes on forced alignment tools

frauddetection

Examples and Tutorials related to fraud detection with machine learning and deep learning

g2pm

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

gcn

Implementation of Graph Convolutional Networks in TensorFlow

gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

gpt2-chinese

Chinese version of GPT2 training code, using BERT tokenizer.

grafx

GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch

hyzhan.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

hyzhan Goto Github PK

hyzhan's Projects

Recommend Projects

Recommend Topics

Recommend Org

Jobs