Pariente Manuel's Projects
A Flexible and Powerful Parameter Server for large-scale machine learning
simple audio I/O for pytorch
:link: Some useful websites for programmers.
Build the Linux Kernel and Modules on board the NVIDIA Jetson Nano Developer Kit
🤗 Fast, efficient, open-access datasets and evaluation metrics in PyTorch, TensorFlow, NumPy and Pandas
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
End-to-End Speech Processing Toolkit
We'll see what this becomes
Header-only library for using Keras models in C++.
This is the code for the "How to Deploy a Keras Model to Production" by Siraj Raval on Youtube
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning"
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Common things to do for Jetson Nano DTK
Deep Learning for humans
An open source dataset for source separation
Python library for audio and music analysis
Markdown - you can mark up titles, lists, tables, etc., in a much cleaner, readable and accurate way if you do it with HTML.
✨ Build a beautiful and simple website in literally minutes.
Starter kit for getting started in the Music Demixing Challenge.
First repository
A resampling function based on placing the filter's first null on Nyquist (Null-on-Nyquist Resample)
🐦 Opytimizer is a Python library consisting of meta-heuristic optimization techniques.
Collection of EM algorithms for blind source separation of audio signals
Probabilistic reasoning and statistical analysis in TensorFlow
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
Python implementation of the Short Term Objective Intelligibility measure
Tensors and Dynamic neural networks in Python with strong GPU acceleration