GithubHelp home page GithubHelp logo

hfink / matchbox Goto Github PK

View Code? Open in Web Editor NEW
86.0 86.0 25.0 14.24 MB

Mel-Frequency-Cepstral-Coefficients and Dynamic-Time-Warping for iOS/OSX

Home Page: http://hfink.eu/archive-2012/matchbox/index.html

C++ 95.77% C 2.94% MATLAB 0.14% Shell 0.01% Perl 0.01% Objective-C 1.14%

matchbox's People

Contributors

hfink avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

matchbox's Issues

Usage in simple humming recognition?

Hi, thanks for sharing your code. I took a dive into the code to see if the project could be configured to work as a simple humming recognition library (which I intend to share to the community also).

Just wondering if I could ask from the expert whether your code could easily be converted? I'm imagining this scenario:

  1. You have pre-recorded a 4 second tune that someone has hummed. So you have that stored locally on the device.
  2. You use the record feature on the app to record a user's attempt to mimic the hum.
  3. You score the user's closeness to the original.

Based on your experience and approach, is this possible? If the original hum was from a man, would a women's hum be markedly different in terms of the min distance score?

MFCCProcessor returns inf and NaN values

I'm using your MFCC processor in a custom project and some songs return inf and NaN float values for some frames.

More information:
I am using a custom audio file loader and run MFCC processor in a single run - all audio samples are processed in a single buffer, not buffer by buffer when the samples are read. Only a single channel float format is sent to MFCC processor, values between -1.0 and 1.0 of course.

In some cases the songs I use to test first few frames return NaN (the rest of them are fine apparently). That causes the mean operation to place NaN values across whole vector.

Also the AVAsset reader apparently does some preprocessing itself (mixing, normalizing or something), do you have any information regarding that? Because audio loading system I use is built on Extended Audio Services and sample values are different than sample values obtained from AVAssetReader. This leads to MFCC be quite different if they are calculated using AVAsset reader source or Extended Audio Services (I am actually using TheAmazingAudioEngine audio file loading operation).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.