Light

caomw / scene-classification Goto Github PK

View Code? Open in Web Editor NEW

This project forked from gmichaeljaison/scene-classification

0.0 1.0 0.0 38.45 MB

Scene classification using visual bag of words approach and Spatial Pyramid matching

MATLAB 100.00%

scene-classification's Introduction

Scene classification

Scene classification using visual bag of words approach and Spatial Pyramid matching

Dataset: SUN database

the dataset contains 1600 images from various scene categories like "airport", "bedroom", and "dessert"

Filter bank: 20

(1 - 5) Guassian filters with sigma values - 1, 2, 4, 8, sqrt(2)*8.
(6 - 10) Laplacian of Guassian filter with same scale.
(11 - 15) x-derivative of Guassian fillter with same scale.
(16 - 20) y-derivative of Guassian fillter with same scale.

Classifier used to determine the dictionary of visual words: K-Means classification (k = 100)

Algorithm:

Training:

Build visual word dictionary
1. Apply 20 filters on each image and construct visual words using random alpha pixels in each image (alpha used = 50).
2. Use k-means classifier to create k visual words out of it (k = 100).
3. Save the vocabulary in dictionary.mat file.
Building recognition system
1. Using the dictionary, iterate through every test image and find the nearest visual word for each pixel. This is referred as 'wordMap'.
2. Construct a histogram of wordMap for every test image.
3. To increase the context in the image, the approach uses "Spatial Pyramid Matching" technique and gives high priority to nearby pixels. This is achieved by building the histogram for every patch in the pyramid layer.

Testing:

Construct the wordMap histogram for the test image in the similar way.
Compare the histogram with all training image histograms, and find the nearest match.
The category of the matched image is the result.

scene-classification's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs