GithubHelp home page GithubHelp logo

tik-team's Introduction

tik-team

Global AI Hub Project - UrbanSounds8K


In this project, how to perform Urban Sound Classification with deep learning is described. First, the general perspective of the project is presented, and then, (Data Set), the tools used and the results of the project are examined. All the codes used for this project are included in this section.

Overview of the sound classification project

Sound classification with deep learning and automatically is a growing field with numerous real-world applications. While much research has been done on audio files such as speech or music, work on ambient sound is relatively rare.

Likewise, seeing recent advances in image classification where Convolutional Neural Networks are used to classify images with high accuracy raises the question of whether these methods can be used in other fields such as classification. Sound is also used or not. There are many real-world applications for what is taught in this material, which are described below.

  1. Content-based multimedia indexing and retrieval
  2. Helping the deaf to do daily activities
  3. Use in smart home applications such as 360 degree security and security features
  4. Industrial uses such as predictive maintenance

What is meant by audio data?

You are always in contact with sound, directly or indirectly. Your brain is constantly processing and understanding sound information and giving you information about the environment. A simple example would be the conversations you have with people on a daily basis. This talk is discussed by another person to continue the discussion. Even when you are thinking in a quiet environment, you tend to pick up very subtle sounds like rustling leaves or the sound of rain. This is the extent of your connection with sound.

And can you somehow get the floating sounds around you to do something useful? Yes of course! Devices have been developed to help capture these sounds and can display them in a computer-readable format. Examples of these formats are:

WAV (Waveform Audio File) format MP3 (MPEG-1 Audio Layer 3) format WMA (Windows Media Audio) format

Audio processing applications

๐Ÿ“Œ Although we said that audio data can be useful for analysis, what are the potential applications of audio processing? Here we introduce some of them:

  1. Indexing of music collections according to their audio characteristics
  2. Music suggestion for radio channels
  3. A similar search for audio files
  4. Speech processing and synthesis - artificial voice generation for conversational agents

Data collection

๐Ÿ“Œ For this problem, a dataset called Urbansound8K has been used. This dataset contains 8732 selected audio files (less than 4 seconds) of urban sounds from 10 classes, which are:

  1. Air conditioner
  2. Car horn
  3. child's play
  4. dog's bark
  5. drilling
  6. The sound of the car engine at rest
  7. gun fire
  8. Jackhammer (hand drill)
  9. Siren
  10. Street music

Download the dataset from the following address:

https://urbansounddataset.weebly.com/urbansound8k.html

and using the UrbanSound8K.csv file in the data folder

tik-team's People

Contributors

emrejilta avatar tohid-yousefi avatar yigityld avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.