GithubHelp home page GithubHelp logo

aenet's Introduction

Audio Event Net

This README shows how to run AENet which classify the audio events. Paper: (https://arxiv.org/abs/1701.00599)

Requirements

How to use

Prerequisites

  • Install all requirements. (This repository includes "HCopy" but you probably need to compile on your machine.)
  • Define the path where you want to keep your models with export AENET_DATA_DIR='YOUR_DATA_PATH'

Run example

Run python run_sample.py to test the installation and see how to use the code.

  • This sample program reads wave files and stores corresponding AENet features in "ae_feat".
  • For testing your installation, the sample program also displays the error between your extraction and a reference.
  • If you got a high error, it might be due to the installtion of "HCopy".

Install as a package

  • Make sure to have set the data directory: AENET_DATA_DIR
  • Download the model files by running bin/download.sh
  • Install the package with python setup.py install
  • Now you can use it via import aenet

Supported format

Currently only wave file format with 16kHz sampling rate, 16bit, monoral channel is supported. If you would like to extract AENet feature from other format audio files, please first convert it.

For convenience the class aenet.AENet contains the function write_wav which writes the audio stream of a video in the correct format using moviepy.

Known problems

Sometimes, the function write_wav fails, due to a moviepy error. See Zulko/moviepy#377

Reference

If you end up using this code or the pre-trained network, we ask you to cite the following paper:

Naoya Takahashi, Michael Gygli, and Luc Van Gool, "AENet: Learning Deep Audio Features for Video Analysis", arXiv preprint arXiv:1701.00599, 2017.

aenet's People

Contributors

gyglim avatar omynona avatar znaoya avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.