Name: Sravani Dandu
Type: User
Company: ML Researcher at Comcast Labs
Bio: I am an ML researcher and engineer at Comcast Labs with a focus on NLP, Speech Processing, Human-machine interfaces.
Location: San Francisco, California
Sravani Dandu's Projects
Audio Editor
Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
:mortar_board: Path to a free self-taught education in Computer Science!
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
DeepFaceLab is the leading software for creating deepfakes.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Kubernetes environment for running go figure apps
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Machine Learning Interviews from FAAG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc.
System design patterns for machine learning
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Description-FAQ of the process
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Text to Speech with PyTorch (English and Mongolian)
Geometric Deep Learning Extension Library for PyTorch
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Materials for StyleGAN2 Training class
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
DeepMind's Tacotron-2 Tensorflow implementation
Official PyTorch implementation of TTS Style Transfer
WaveRNN Vocoder + TTS