GithubHelp home page GithubHelp logo

amoussawi / caffe Goto Github PK

View Code? Open in Web Editor NEW

This project forked from weiliu89/caffe

18.0 18.0 3.0 65.73 MB

Single Shot Multibox Detector on Caltech pedestrian dataset

License: Other

CMake 1.69% Makefile 0.43% Shell 0.33% C++ 58.95% Cuda 4.34% MATLAB 14.07% M 0.01% Python 6.38% Protocol Buffer 1.32% C 0.56% HTML 11.90% CSS 0.02%
caffe caltech-pedestrian-dataset deep-learning pedestrian-detection single-shot-detection single-shot-multibox-detector ssd

caffe's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

caffe's Issues

Question about ETH & TUD-Brussel dataset and hard negative mining

Hi @amoussawi , I have two questions in my SSD training on Caltech.

  1. Have you used hard negative mining during training?

  2. In your paper, you have mentioned the ETH and TUD-Brussels dataset results additional 4000 images, but I could only find 2313 images with pedestrian annotations.
    (ETH setup1 ~ set00 V000 999
    set01 V001 451
    set02 V002 354,
    TUD ~ 507)

Estimating the number training iterations

Hi Abdallah,
Thank you for the great work and sharing your code.

I have few questions about training settings. How did you know that after 20K iterations the model starts to overfit ? and how many training images did you have, and the number of epochs this 20K represents ?

Thank you.

Questions about aspect_ratio and speed

Hi
Thx for your excellent excellent excellent job! I'm a new learner and it benefit me a lot.

My question is: i wonder which of these two aspect_ratios configuration work better. How much better ? and How about the distinction of time cost between them?

(1) in ssd_caltech_512.py
331 aspect_ratio_per_layer = [0.15,0.3,0.41,0.6,0.75,0.9]
332 aspect_ratios = [aspect_ratio_per_layer]*len(mbox_source_layers)
(2) in ssd_caltech_512_ft.py
334 aspect_ratio_per_layer = [0.41]
335 aspect_ratios = [aspect_ratio_per_layer]*len(mbox_source_layers)

Another question is : i can only run the SSD512 model with the speed about 12 FPS (Geforce GTX Titan X), how can i achieve 24 FPS like u?

Thx again again again :) @amoussawi

A few questions for training and testing on Caltech

Hi @amoussawi
Thanks for sharing the code. I'm trying to apply SSD on Caltech too, so I have a few questions here:
1、How do you deal with annotations like 'person?' or 'people' in your trainval set? Also, did you use all ground truth annotations for training, regardless their heights and occlusions?
2、Since the detection output and evaluation code is for VOC, how do you evaluate your results in Caltech style?
3、Have you tried training only with caltech trainset? I'm interested in how much the performance will boost using extra data(ETH,...) for training.

Error running pre-trained model with original SSD

Hi,

First of all, thank you for sharing your code!

I have a question/issue: I'm trying to run the provided pre-trained model with the original SSD code, but I'm getting a couple of errors. First, the field "extra_ar" in the PriorBox layers of the deploy.prototxt cannot be read: Error parsing text-format caffe.NetParameter: 1010:13: Message type "caffe.PriorBoxParameter" has no field named "extra_ar"

If such fields are removed, then another error appears: F0627 07:17:17.537052 12113 detection_output_layer.cpp:164] Check failed: num_priors_ * num_loc_classes_ * 4 == bottom[0]->channels() (163836 vs. 54612) Number of priors must match number of location predictions.

For what I've seen, your have modified certain things in the SSD architecture, such as the aspect ratio in the layers. However, I though such modifications were self-contained, i.e. they only affected definitions and scripts, but that the resulting deploy.prototxt and .caffemodel were compatible with mainstream SSD. Is that the case, or is your branch needed to run the model for inference? Any tips or explanations about the error are welcomed!

Thanks!

Cannot creat lmdb in the 4th step in Preparation

Issue summary

Hi, first I want to thank you for the effort of doing Caltech on SSD. Here is my question:
I do step 4 in the Preparation step(I didn't do step 3 since I don't need evaluation yet), and I put the downloaded Caltech file in /home/data/caltech_code/. But when I execute ./data/caltech/creat_list.sh, it shows these two messages:

Create list for caltech trainval...
ls: cannot access '/home/paris/caffe/data/caltech/trainval/images//': No such file or directory
ls: cannot access '/home/paris/caffe/data/caltech/trainval/images//': No such file or directory
and ...
Create list for caltech test...
ls: cannot access '/home/paris/caffe/data/caltech/test/images//': No such file or directory
ls: cannot access '/home/paris/caffe/data/caltech/test/images//': No such file or directory

I'm don't know where I've done wrong or do I miss any step?

where to get the essay?

Hi, very glad to see your excellent work. I'd like to know where I can get the full text of your essay:An FPGA-Accelerated Design for Deep Learning Pedestrian Detection in Self-Driving Vehicles. Please help. Thx.

what is the detection_eval after 20k iterations?

@amoussawi Hi, thanks for your work. You mentioned that the miss rate is 11.8% after 20k iterations.

  1. And I'd like to know what is the detection_eval that the network gives out on test set after 20k iterations?
  2. I get the detecion_eval = 0.545697 after 8k iterations with fine-tuning. I think it's a little bit low. Is it normal or maybe something is wrong? What's your opinion?
    Thanks.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.