GithubHelp home page GithubHelp logo

pranaychuramani21 / urban-sound-classification Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 1005 KB

This is an audio classification project that classifies various sounds using their unique spectrogram into classes such as Dog Barking, Sirens, Street Music etc using Neural Networks

Jupyter Notebook 100.00%
keras librosa mfcc neural-networks numpy pandas scipy tensorflow

urban-sound-classification's Introduction

Urban Sound Classification

This is an audio classification project that classifies various sounds using their unique spectrogram into multiple classes such as Dog Barking, Sirens, Street Music etc using Neural Networks

Description

  • The UrbanSound8k dataset contains 8732 labelled sound excerpts (<=4s) of urban sounds from 10 classes: air_conditioner, car_horn, children_playing, dog_bark, drilling, enginge_idling, gun_shot, jackhammer, siren, and street_music. The classes are drawn from the urban sound taxonomy.

  • 8732 audio files of urban sounds (see description above) in WAV format. The sampling rate, bit depth, and a number of channels are the same as those of the original file uploaded to Freesound (and hence may vary from file to file). The UrbanSound8k dataset used for model training can be downloaded from the following link: https://urbansounddataset.weebly.com/

Librosa

Librosa is a python package for music and audio analysis. It provides the building blocks necessary to create music information retrieval systems. It is used for data pre-processing and feature extraction. Features used:

MFCC
  • The Mel-Frequency Cepstral Coefficients (MFCC) summarises the frequency distribution across the window size, so it is possible to analyze both the frequency and time characteristics of the sound. These audio representations will allow us to identify features for classification. It will try to convert the audio into some kind of feature based on the frequency and time characteristics which will help us to do the classification.

  • MFCC does nothing but extract patterns based on the frequency and time characteristics. This will uniquely able to identify that particular audio signal like in which class it actually belongs because this audio signal will be later used in deep learning Techniques.

Conclusion

  • The Testing Accuracy comes out to be: 78.82%

Technology Used

  • Artificial Neural Networks
  • Librosa
  • Scipy
  • Mel-Frequency Cepstral Coefficients(MFCC)
  • TensorFlow
  • keras
  • NumPy
  • pandas

urban-sound-classification's People

Contributors

pranaychuramani21 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.