GithubHelp home page GithubHelp logo

sagarvegad / video-classification-cnn-and-lstm- Goto Github PK

View Code? Open in Web Editor NEW
272.0 10.0 116.0 13 KB

To classify video into various classes using keras library with tensorflow as back-end.

Python 100.00%
video cnn-model lstm deep-neural-networks video-classification keras-models python

video-classification-cnn-and-lstm-'s People

Contributors

sagarvegad avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

video-classification-cnn-and-lstm-'s Issues

data

Hello, can you tell us about the data directory structure in the flow_from_directory() method?
thank you so much!

training problems

Hello

I have some questions regarding the pipeline.
I've collected the images from Youtube and extracted features using VGG16, stored them into .npy files.

  • When i start training CNN_LSTM the on epoch 5 the acc reaches 95%, but the validation is very low, so after epoch 5 basically no changes.
  • How big the dataset should be? For example i've trained the jet class for 800 images. how many did you've used?
  • There 2 training networks. So should I first train CNN with bottleneck and then train CNN_RNN?

queries

  1. code train_CNN_RNN.py what do numbers 56021 and 3974 indicate ? Is it number of images?

for x,y in train_generator:
if batch == (56021/batch_size):

for x,y in validation_generator:
if batch == (3974/batch_size):

  1. When I execute code train_CNN_RNN.py I receive error
    File "train_CNN_RNN.py", line 53, in extract_features_and_store
    if x_generator==None:
    ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()

About the Input

Hi, I am new to LSTM, and I think there are some details in your codes I find it hard to understand.

First, I don't understand why we should set the shuffle=True, after all, we will feed the frame sequence into LSTM later, and I guess the sequence should be arrange by the time order, so it seems unreasonable to shuffle the image before training as it will break the order.

Also, the batch size is set as 128, I am wondering if you mean a batch of images belongs to the same video? After training those batches on vgg, the result batches are sent to LSTM, so I guess a batch of images are seen as frame sequence from the same video, am I right?

Please help me, thank you!

data

can you please share your data ?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.