GithubHelp home page GithubHelp logo

hadryan's Projects

mtf-crnn icon mtf-crnn

Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurrent neural network (MTF-CRNN) for audio event detection. Our goal is to improve audio event detection performance and recognize target audio events that have different lengths and accompany the complex audio background. We exploit multi-groups of parallel and serial convolutional kernels to learn high-level shift invariant features from the time and frequency domains of acoustic samples. A two-layer bi-direction gated recurrent unit) based on the recurrent neural network is used to capture the temporal context from the extracted high-level features. The proposed method is evaluated on the DCASE2017 challenge dataset. Compared to other methods, the MTF-CRNN achieves one of the best test performances for a single model without pre-training and without using a multi-model ensemble approach.

multi-model-approach-for-speak-and-text-image-association-prediction icon multi-model-approach-for-speak-and-text-image-association-prediction

Our task is to Recognize whether an image of a hand-written digit and a recording of a spoken digit refer to the same or different number. We have two input data written number image and number spoken sound MFCC features and one output consist of boolean array state that the respective sound and image matches or not. We choose multi model approach using LSTM for audio features and CNN for image data. The output of both model concatenated at the end and binary loss function applied.

multi-task-nlp icon multi-task-nlp

multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.

multidistance-recommendation-calculation- icon multidistance-recommendation-calculation-

Euclidean Distance Method + Manhatan Distance Method + Minkowski Distance Method + Chebychef Distance Method + Canbera Distance Method + Bray Curtis Distance Method + Kullback Leibler Distance Method + Jensen Shannon Distance Method

multilingual-glm icon multilingual-glm

The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective

multimodalanalysis icon multimodalanalysis

Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos

multimodalanalysis_speakerdiarization icon multimodalanalysis_speakerdiarization

The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face image, mouth tracking.

multiprocess icon multiprocess

🚀Easy to make the common PHP/Python/js...script change daemon and multi-process execution

mumbl icon mumbl

A JavaScript library that abstracts audio-playing functionality of HTML5, Songbird, and SoundManager 2 for use in music playlists

mure icon mure

music recommender system

musan_investigation_cnn_rnn icon musan_investigation_cnn_rnn

Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN corpus.

muse icon muse

will recommend you songs based on the vocals/mood/vibe

musescore-downloader icon musescore-downloader

Download sheet music (MSCZ, PDF, MusicXML, MIDI, MP3, download individual parts as PDF) from musescore.com for free, no login or Musescore Pro required | 免登录、免 Musescore Pro,免费下载 musescore.com 上的曲谱 | Installation: https://msdl.librescore.org/install.user.js

music-1 icon music-1

electron跨平台音乐播放器;可搜网易云、QQ音乐、虾米音乐;支持QQ、微博、Github登录,云歌单; 支持一键导入音乐平台歌单

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.