GithubHelp home page GithubHelp logo

templeblock / 1maxcnn Goto Github PK

View Code? Open in Web Editor NEW

This project forked from pquochuy/1maxcnn

0.0 1.0 0.0 17 KB

1-max pooling CNN for robust audio event recognition

MATLAB 45.18% Objective-C 5.04% Python 17.31% Shell 32.47%

1maxcnn's Introduction

This experiment setup for the published paper: Huy Phan, Lars Hertel, Marco Maass, and Alfred Mertins. Audio Event Recognition with 1-Max Pooling Convolutional Neural Networks. In Proceedings of 17th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 3653-3657, 2016

The setup is similar to that in

I. McLoughlin, H.-M. Zhang, Z.-P. Xie, Y. Song, W. Xiao, Robust Sound Event Classification using Deep Neural Networks, IEEE Trans. Audio, Speech and Language Processing, Jan 2015

(which can be found here http://www.lintech.org/machine_hearing/index.html)

The implementation of 1-max pooling CNN is based on the implementation of Denny Britz:

http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/

Although implementation with variable length is possible, we padded zeros each input to the maximum length to ease the implementation.These zeros paddings actually do not bring significant affects on the 1-max pooling.

The experiments can be run as follows:

On MATLAB: (for SIF feature extraction)

create_database;
extract_sif;
extract_sif_plus;
export_data_tensorflow;
export_data_tensorflow_plus;
export_data_multicondition_tensorflow;
export_data_multicondition_tensorflow_plus;

On Tensorflow: (for CNN training and evaluation)

bash run.sh

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.