hadryan Goto Github PK
Type: User
Type: User
Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurrent neural network (MTF-CRNN) for audio event detection. Our goal is to improve audio event detection performance and recognize target audio events that have different lengths and accompany the complex audio background. We exploit multi-groups of parallel and serial convolutional kernels to learn high-level shift invariant features from the time and frequency domains of acoustic samples. A two-layer bi-direction gated recurrent unit) based on the recurrent neural network is used to capture the temporal context from the extracted high-level features. The proposed method is evaluated on the DCASE2017 challenge dataset. Compared to other methods, the MTF-CRNN achieves one of the best test performances for a single model without pre-training and without using a multi-model ensemble approach.
Metadata, scripts and baselines for the MTG-Jamendo dataset
A simple notebook demonstrating prompt-based music generation via Mubert API
Backbone.js related codes
Our task is to Recognize whether an image of a hand-written digit and a recording of a spoken digit refer to the same or different number. We have two input data written number image and number spoken sound MFCC features and one output consist of boolean array state that the respective sound and image matches or not. We choose multi model approach using LSTM for audio features and CNN for image data. The output of both model concatenated at the end and binary loss function applied.
multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
Multiavatar Generator in PHP
Euclidean Distance Method + Manhatan Distance Method + Minkowski Distance Method + Chebychef Distance Method + Canbera Distance Method + Bray Curtis Distance Method + Kullback Leibler Distance Method + Jensen Shannon Distance Method
The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective
Extracting visual features from "Music Videos" for Video Recommender
TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18
A Python Library for Multimodal Analysis of Movies and Content-based Movie Recommendation
Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos
The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face image, mouth tracking.
🚀Easy to make the common PHP/Python/js...script change daemon and multi-process execution
Code for the paper "Multi-Task Learning of Object State Changes from Uncurated Videos"
Easily create complex recyclerview adapters in android
A JavaScript library that abstracts audio-playing functionality of HTML5, Songbird, and SoundManager 2 for use in music playlists
simple javascript display for mumble, using D3.js (WIP)
music recommender system
Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN corpus.
will recommend you songs based on the vocals/mood/vibe
Download sheet music (MSCZ, PDF, MusicXML, MIDI, MP3, download individual parts as PDF) from musescore.com for free, no login or Musescore Pro required | 免登录、免 Musescore Pro,免费下载 musescore.com 上的曲谱 | Installation: https://msdl.librescore.org/install.user.js
Music player based on Auro
electron跨平台音乐播放器;可搜网易云、QQ音乐、虾米音乐;支持QQ、微博、Github登录,云歌单; 支持一键导入音乐平台歌单
Emotion Based music player
Automatic alignment detection of a song’s music and lyric emotions
Supplementary material for IJCNN paper "Musical Artist Classification with Convolutoinal Recurrent Neural Networks"
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.