yuchen005 Goto Github PK
Name: Yuchen Hu
Type: User
Company: Nanyang Technological University
Bio: Ph.D. student at NTU, research focus on large language model (LLM), speech processing, and multimodal.
Location: Singapore
Name: Yuchen Hu
Type: User
Company: Nanyang Technological University
Bio: Ph.D. student at NTU, research focus on large language model (LLM), speech processing, and multimodal.
Location: Singapore
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"
Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"
Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"
Single-blind supplementary materials for NeurIPS 2023 submission
Code for paper "MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition"
Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log-Mel Fbank features and several raw wavform listening samples.
Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"
Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"
Code for paper "Unsupervised Noise adaptation using Data Simulation"
Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"
Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"
AcadHomepage: A Modern and Responsive Academic Personal Homepage
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.