qianqq Goto Github PK
Name: vad
Type: User
Name: vad
Type: User
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
Pytorch code for End-to-End Audiovisual Speech Recognition
End-to-End Speech Processing Toolkit
:fire: 2D and 3D Face alignment library build using pytorch
Feature Map Inversion to visualize what feature a filter extract from input image in CNNs
A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
Affine transformation virtual 3D object using a finger gesture-based interactive system in the virtual environment.
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Fuzzy String Matching in Python
We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
This repository contains the files used for our Interspeech 2017 paper.
Mirror of GlottHMM
Source Codes of HetSANN in the AAAI'20 paper: An Attention-based Graph Nerual Network for Heterogeneous Structural Learning.
A deep learning project to tell a story with an image or a video.
PyTorch implementation of "InstaGAN: Instance-aware Image Translation" (ICLR 2019)
Extract xvector and ivector under kaldi
Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information
This is now the official location of the Kaldi project.
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure
Kaldi code for doing DNN with tensorflow
Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano and TensorFlow.
Keras implementations of Generative Adversarial Networks.
Simple Generative Adversarial Networks for MNIST data with Keras.
Knowledge-based Semantic Role Labeling
Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.
Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition
This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"
A Keras implementation of LipNet
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.