Name: Chenxu Hu
Type: User
Company: Tsinghua University
Bio: I am a Ph.D. student in Computer Science at IIIS, Tsinghua University. I am especially interested in multi-modal machine learning and audio & speech processing.
Location: Beijing, China
Blog: https://huchenxucs.github.io/
Chenxu Hu's Projects
The official repository of "ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory".
CS231n self-learning
My implementations of cs231n 2017
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
DEECAMP summary
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
Generative Flow based Sequence-to-Sequence Toolkit written in Python.
the first try / new ideas
Chenxu Hu's personal page
Minimal, single page, smooth-scrolling theme for Hugo static site generator.
This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"
Project for Android camp of ByteDance
Code for prefix beam search tutorial by @labodk
Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral
Collection of generative models in Pytorch version.
大二下小学期
A simple bash script for switching between installed versions of CUDA.
Out of time: automated lip sync in the wild
My config for ubuntu 16.04
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
WaveRNN Vocoder + TTS
Production First and Production Ready End-to-End Speech Recognition Toolkit
浙江大学课程攻略共享计划