GithubHelp home page GithubHelp logo

cafefars / audioneex Goto Github PK

View Code? Open in Web Editor NEW

This project forked from a-gram/audioneex

0.0 0.0 0.0 5.29 MB

General purpose, real-time audio recognition engine

License: Mozilla Public License 2.0

CMake 2.94% C++ 95.95% Batchfile 0.37% C 0.15% Shell 0.59%

audioneex's Introduction

logo

Build Status Documentation Status License: MPL 2.0

Audioneex is an audio content recognition engine specifically designed for real-time applications. It is general purpose, based on content-agnostic algorithms and runs on all kinds of machines, from big servers to mobile and embedded devices.

Features

  • Compact fingerprints - 1 hr of audio encoded in less than a MB
  • Fast identification - only a few seconds to perform a recognition
  • Content-agnostic - recognition of audio of different nature
  • Cross-platform - runs anywhere there is a modern C++ compiler (v11+)
  • IoT & Mobile-ready - runs well on small devices for on-device ACR
  • Database-neutral - can be used with any database (requires drivers)

Documentation

The official documentation can be found here

Quick start

The engine needs the following dependencies

  • Boost
  • FFTSS
  • Tokyo Cabinet | Couchbase
  • TagLib (optional)
  • FFmpeg (optional)

After compiling and installing the dependencies, open a shell and issue the following commands to start the build process

$ git clone https://github.com/a-gram/audioneex.git
$ cd audioneex
$ ./build WITH_EXAMPLES=ON

This will build the library along with the demo programs within the _build directory. Please refer to the documentation for more details.

Demo app

If you want to see the engine in action and play around with it straight away there is a demo app for Windows that can be downloaded from here

License

This code is released under the Mozilla Public License 2.0.

In a nutshell:

  • It can be used freely in commercial projects without publishing proprietary code.
  • Any modifications to the source code must be made available, under the same license.
  • If distributed under any form (source or binary) all copyright notices must be retained (for binaries, the notices can be put in the docs or "about" box).

References

For the more curious, it is an implementation of the methods described in this paper. If you are including this work in your research, please use the following BibTex citation

@misc{agramaglia2014-acr,
   author =       "Alberto Gramaglia",
   title =        "A Binary Auditory Words Model for Audio Content Identification",
   howpublished = "\url{https://github.com/a-gram/audioneex}",
   year =         "2014"
}

or in plain text

A.Gramaglia (2014). "A Binary Auditory Words Model for Audio Content Identification". Audioneex.com.

audioneex's People

Contributors

a-gram avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.