GithubHelp home page GithubHelp logo

Peng Zhang's Projects

av-se icon av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

avobjects icon avobjects

Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"

awesome-speech-enhancement icon awesome-speech-enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

coder2gwy icon coder2gwy

互联网首份程序员考公指南,由3位已经进入体制内的前大厂程序员联合献上。

deepxi icon deepxi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

dlib icon dlib

A toolkit for making real world machine learning and data analysis applications in C++

fast_bss_eval icon fast_bss_eval

A fast implementation of bss_eval metrics for blind source separation

fullsubnet icon fullsubnet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

gpurir icon gpurir

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

hifi-gan icon hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

libfacedetection icon libfacedetection

An open source library for face detection in images. The face detection speed can reach 1000FPS.

lipnet-pytorch icon lipnet-pytorch

The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)

lora icon lora

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

ml-nlp icon ml-nlp

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

mtadam icon mtadam

MTAdam: Automatic Balancing of Multiple Training Loss Terms

mtfaa-net icon mtfaa-net

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

open_flamingo icon open_flamingo

An open-source framework for training large multimodal models.

parallelwavegan icon parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

pedalboard icon pedalboard

🎛 🔊 A Python library for adding effects to audio.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.