robert-junwang / pelee Goto Github PK

Pelee: A Real-Time Object Detection System on Mobile Devices

License: Apache License 2.0

Python 13.94% Jupyter Notebook 86.06%

pelee's Issues

does it costs 6min per 50iters during training?

the bactch size is 32(*4=128) in 4 gpus，i also use (build/tools/caffe time) while the cost time is shorter about 1minute 27seconds. i am confused with this, can anyone help me? thanks

pelee speed

Hi ，the Pelee test time in 1080Ti is what？

The paper points:
To compensate for the negative impact on accuracy caused by this change, we use a shallow and wide network structure. We also add a 1x1 convolution layer after the last dense block to get the stronger representational abilities

This design, In the code, it doesn't seem to show up.

BatchNorm

why don't you use BatchNorm layer, does it hurt the performance?

C or C++ implementation

Hello,
Is there c or c++ implementation for PeleeNet ??
Thanks

Any plan to add a detection program?

Hi guys, thank you for your work and contribution. I would like to know is there any plan to add a simple cpp code or python script that only performs the inference process on a single image (just like ssd_detect.py)?

MXNet model

Can you share a pre trained model in MXNet. I want to try this network. Thanks

./build/tools/caffe: not found

Hello, thank you for your code!
When I run the train_voc.py, I meet a problem:

args: Namespace(arch='pelee', batch_size=32, image_size=304, kernel_size=1, lr=0.005, posfix='', run_soon=True, step_value=[80000, 100000, 120000], weight_decay=0.0005, weights='models/peleenet_inet_acc7243.caffemodel')
jobs/pelee/VOC0712/SSD_304x304/pelee_SSD_304x304.sh: 2: jobs/pelee/VOC0712/SSD_304x304/pelee_SSD_304x304.sh: ./build/tools/caffe: not found

How can I solve it? Appreciate it you can give me some advice !!!Thank u!!!

compare with yolov3-tiny

Did anyone compare Pelee with yolov3-tiny ? about speed and accuracy

Hi i want to test pelee + ssd using webcam

Hi i am studying object detection. i read ur paper and now i am trying to test pelee + ssd using webcam
i changed ssd_pascal_webcam.py source.
i removed VGGNETBODY and used pelee

but it seems it doesnt detect properly
i am not sure i am doing properly..
i used pelee_voc pretrained model

thanks

Accuracy is much low of conv_bn_merged model

I downloaded your merged_model(.deployprototxt & .caffemodel) and evaluated VOC2007 test set.

the accuracy of is 0.003.

Did you also check out this results?

SSD+MobileNet： FPS

How do you measure FPS？

Cannot match performance on COCO

Hi @Robert-JunWang ,

Thanks for the contribution.
I downloaded and tested your trained model using COCO API, but I could just get 21.8% mAP instead of 22.4% mAP reported in the paper. The following is the detail results:

Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.218
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.376
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.219
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.034
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.209
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.400
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.212
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.309
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.327
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.072
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.345
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.562

Could you provide your evaluation code to get the target evaluation performance? Thank you!

detection_eval= 0.000040

Hello
When i try
Evaluate the model:
cd $CAFFE_ROOT
python examples/pelee/eval_voc.py

i used pretrained model

depthwise convolution to speed up

Does pelee net use depthwise comvolution, like mobilenet?
If not, can it be modified to use it to speed up?

Thanks

Network structure can not be visualized.

python/draw_net.py models/pelee/VOC0712/SSD_304x304/train.prototxt peleeNet.png

The figure of this execution result is plain.
Is the prototxt correct?

19x19 features are fed twice

According to dot file I obtained for the deploy_merged.prototxt the 19x19 feature maps are fed to confidence and location extraction layers twice. Is it intentional or a copy/paste error?

deploy_merged.pdf

Could you please share the project of peleenet in iOS?

Hi @Robert-JunWang ! Sorry to disturb you. I noticed that you have implement peleenet in iOS using coreML tools. I want to build a iOS app to detect things using peleenet. Could you please share that project for us? Thank you very much!

trian/test.prototxt

@Robert-JunWang

I found that in the final prediction, stage4_tb/ext/pm2/res was used to make two predictions, except that the min_size and max_size settings in the "PriorBox" were different, all the others are same

Why do you make two predictions separately instead of merging them? What are the advantages of these two predictions?

test accuracy

Hello, I'm testing on the voc using the trained model you provided. The result of the pelee_SSD_304x304_iter_112000.caffemodel model map is 0.701925, and the map given in your thesis is 70.9. The same pelee_304x304_voc_coco_iter2k_7637.caffemodel model test map for 0.710299, your result is 76.4. I don't know if there is a problem with my test or what other skills? Does anyone have the same problem? Thank you.

can you provide a test python script?

Thank you for your work!
Can you please provide a python script to detect a single image in test ?

is there 14x14 pooling in pelee net?

Hello, I read your code, and I reproduce the code with pytorch, then I found that 14x14 pooling on the top of innerproduct, am I wrong? And why the last transition layer do not execute the pooing operation? Is this 'big' pooling work well?

pytorch-peleenet

hi,Robert!
i have reproduced the peleenet in pytorch, hope it is widely used by others.
pytorch-peleenet

Can you share pelee classification newwork on imagenet?

Hi @Robert-JunWang, I want to train pelee classification network, I am confused about solver.prototxt, Can you provide your train file including train.prototxt, test.prototxt and solver.prototxt?
Thank you in advance!

Paper Clarification

@Robert-JunWang Hi

The paper points:
Residual prediction block make it possible for us to apply 1x1 convolutional kernels to predict the category scores and box offsets

Why can use 1*1 here?

The idea of the Residual Prediction Block comes from the paper: Residual features and unified Prediction network for single stage detection

in the code it provids, it is still 3*3.

try TensorRT on pelee

has anyone tried tensorRT on pelee?
how is the performance?
could you share your experience with us?

where is the pretrained model ''models/peleenetv2_inet_288_7243.caffemodel" in train_voc.py line 242?

Why I run peleeNet to detect video and cost 78.5ms per image while SSD cost only 40.5ms per image

First of all, thank you for your work and contribution.
I trained the peleeNet detection network and it behaves pretty good. The model_size of peleeNet is only 20.4M while SSD is 89.1M. However the time of detecting cost 78.5ms per image on my server，by contrast, SSD cost only 40.5ms.
I feel very strange about this phenomenon.

when train in windows,the error:AttributeError: 'LayerParameter' object has no attribute 'min_size'

when i train_voc.py in windows,the error:AttributeError: 'LayerParameter' object has no attribute 'min_size'

Pretrained model does not seem to give good detection_eval

First of all many thanks for providing the code, greatly appreciated!!

I have run your evaluation code on your pre-trained model, i.e.

python examples/pelee/eval_voc.py --weights=models/pelee/peleenet_inet_acc7243.caffemodel

which is giving to my surprise a loss value of loss=29.92 and detection_eval = 0.002 on the Pascal VOC validation data.

I have then retrained the model for 120,000 iterations and I am obtaining a loss value of loss=1.81 and detection_eval = 0.705, so fairly close to what you have published.

Would you maybe so kind to look once more at the pre-trained model at https://drive.google.com/file/d/1OBzEnD5VEB_q_B8YkLx-i3PMHVO-wagk/view?usp=sharing? Does that model give you good validation?

Is there something wrong with pycaffe?

Hi, @Robert-JunWang
Thanks for your great job~ It's really fast~
I trained peleeNet with my own data and got fantastic mAp in test dataset. But when it comes to pycaffe, and I tried to detect one image and check detection results, I got no results. I found 'detection_output' blob only contained [0, -1, -1, -1, -1, -1, -1]. I'm sure that the code is OK.
Is there something in peleeNet unsupported by pycaffe?
Thanks ~~~

Which pretrained model files(.prototxt & .caffemodel) to use for Object detection?

The link provided for PeeleNet pretrained model(peleenet_inet_acc7243.caffemodel) is of the classification model.Where can I find the pretrained .caffemodel for Object detection?
Also which .prototxt files are supposed to be used for Object detection??

Can't start training

I cloned Pelee and created a soft link to $CAFFE_ROOT/examples.
But when I run : python examples/pelee/train_voc.py there is an error : No such file or directory
Please help me. Thanks!

Train on voc07+12+coco dataset mAP problem

Hi, I use your upload coco model, its mAP is 38.3(IOU = 0.5).
Then I finetune the model on voc0712 dataset, I set some parameters below:

I follow upload voc0712+coco model's min_size and max_size, ex: (21.28, 45.6)、(45.6, 100.32)、(100.32、155.04)、(155.04, 209.76)、(209.76, 264.48)、(264.48, 319.2)
I rename all confidence layers, preserve all loc layers.
I set train batch size 20 and test batch size 4 (due to my GPU memory cost)
Other hyper-parameters(which in solver.prototxt), I keep it equal to yours.

I get a 73% mAP model after training 200k iterations, 3 percentage point lower than yours.
Can you give me some advice?

How about peleeNet training from scratch?

I think peleenet is similar to DSOD, how about peleeNet training from scratch, does it work???
I appreciate it if you can give me some advices!!!
Thank u!!!

How do I get testing accuracy?

When i run python examples/pelee/eval_voc.py --weights=models/pelee/VOC0712/SSD_304x304/pelee_304x304_acc7094.caffemodel,it runs and gives this.
.
And the only results i get are in /data/VOCdevkit/results/VOC2007/SSD_304x304/Main/ directory in the form of this.

How do i get overall test set accuracy in terms of map, loss etc?

train_voc.py about mAP

您好，请问我在voc0712上训练出来的mAP=69.9，和实际70.9相差了一个百分点，pre_caffemodel用的是peleenet_inet_acc7243.caffemodel,不知道是为什么？

no ”step“ parameter in PriorBox layer？

Thanks the authors' work and contribution. I find there no “step” parameter in PeleeNet which is different from original SSD network，WHY？

MAX v.s. AVE Pooling

Hi Robert,

I noticed that you have commented out line 80 in peleenet.py file, which is the max pooling layer replacing the original average pooling layer. Could you share your experience on the performance difference between the two options? Thank you!

how to implement pelee on IOS device?

Are there some tools that I can used for implementing pelee on iphone??

Error when changing the number of classes

Hi,

@Robert-JunWang When I change the num_classes in the train_voc.py to 2 for my custom object detection I am gettting error as follows:

F0628 20:45:42.785300 15254 multibox_loss_layer.cpp:139] Check failed: num_priors_ * loc_classes_ * 4 == bottom[0]->channels() (6856 vs. 17600) Number of priors must match number of location predictions.

Do I have to change the mbox_loss params to match the priors? As far as original SSD is concerned there was no problem when I changed the num_classes to 2.
Thanks!

Small object detection

@Robert-JunWang Hello, thanks for your great work! I want to use this network to detect the small objects in a drone dataset. The images are about 1400px1080px, and the objects are only 50px50px. The training mbox_loss maintains at 4 or 5 after 20000 iterations, the detection accuracy is only 30%. Is there any suggestions to modify the network or is there any tricks during training phase? Please help me, thank you~

fps of Pelee on TX1

At first, thank you for the good work.

Including the pre and post processings, I got 14 fps detection rate on Nvidia TX1 board. I didn't utilize tensorRT engine. Pretty good results but still want to get your comments on it. Is it reasonable with respect to fps results given for the iphones?

training on custom data

Hi,
Thanks for this new way of CNN object detection.
I have to train this on my custom data. Its not clear from the documentation. what are the steps?

Also how to use this trained model in c++ code?

training script does not work in CPU mode?

python examples/pelee/train_voc.py
Because I have not installed the cuDNN drivers yet, only run the training available in CPU mode.
It shows does not work.
Even afterwards, I did comment some lines of GPU mode related and also make change as below etc:
gpus = # "0"
num_gpus = 0 # len(gpulist)

Show the CORE dump output:

I0505 13:59:09.810305 15159 net.cpp:157] Top shape: 32 16 10 10 (51200)
I0505 13:59:09.810309 15159 net.cpp:165] Memory required for data: 5256052864
I0505 13:59:09.810313 15159 layer_factory.hpp:77] Creating layer stage4_4/branch2a
I0505 13:59:09.810322 15159 net.cpp:100] Creating Layer stage4_4/branch2a
I0505 13:59:09.810328 15159 net.cpp:434] stage4_4/branch2a <- stage4_3/concat_stage4_3/concat_0_split_1
I0505 13:59:09.810335 15159 net.cpp:408] stage4_4/branch2a -> stage4_4/branch2a
@ (nil) (unknown)
Aborted (core dumped)

@Robert-JunWang , Could you help me to figure it out? Thanks.

Bounding box sizes too large

Robert many thanks for your great work!

I am having trouble understanding why I am getting larger than expected bounding boxes for Pelee detections.

The heights and widths are not as closely cropped when compared to mobilenet-SSD implementations. I have read that you trained the model with pytorch, could the conv padding be a problem? Or is there something else I have missed?

Many Thanks,

Simon

I am using the following python script to for my test:

net_file= 'pelee.prototxt'  
caffe_model='pelee_304x304_acc7637.caffemodel' 
test_dir = "images"

if not os.path.exists(caffe_model):
    print("caffemodel does not exist")
    exit()
net = caffe.Net(net_file,caffe_model,caffe.TEST)  

CLASSES = ('background',
           'aeroplane', 'bicycle', 'bird', 'boat',
           'bottle', 'bus', 'car', 'cat', 'chair',
           'cow', 'diningtable', 'dog', 'horse',
           'motorbike', 'person', 'pottedplant',
           'sheep', 'sofa', 'train', 'tvmonitor')

def preprocess(src):
    img = cv2.resize(src, (304,304))
    img_mean = np.array([103.94, 116.78, 123.68], dtype=np.float32)
    img = img.astype(np.float32, copy=True) - img_mean
    img = img * 0.017
    return img

def postprocess(img, out):   
    h = img.shape[0]
    w = img.shape[1]
    box = out['detection_out'][0,0,:,3:7] * np.array([w, h, w, h])
    cls = out['detection_out'][0,0,:,1]
    conf = out['detection_out'][0,0,:,2]
    return (box.astype(np.int32), conf, cls)

def detect(imgfile, thresh):
    origimg = cv2.imread(imgfile)
    img = preprocess(origimg)
    img = img.astype(np.float32)
    img = img.transpose((2, 0, 1))

    net.blobs['data'].data[...] = img
    out = net.forward()  
    box, conf, cls = postprocess(origimg, out)
    for i in range(len(box)):
       if conf[i] > thresh :
          p1 = (box[i][0], box[i][1])
          p2 = (box[i][2], box[i][3])
          cv2.rectangle(origimg, p1, p2, (0,255,0))
          p3 = (max(p1[0], 15), max(p1[1], 15))
          title = "%s:%.2f" % (CLASSES[int(cls[i])], conf[i])
          cv2.putText(origimg, title, p3, cv2.FONT_ITALIC, 0.6, (0, 255, 0), 1)
    cv2.imshow("Pelee", origimg)
 
    k = cv2.waitKey(0) & 0xff
    if k == 27 : return False
    return True

for f in os.listdir(test_dir):
    if detect(test_dir + "/" + f, 0.2) == False:
       break

I test your model with the ssd/example/ssd/object_detect.cpp,it detect nothing

i have fix the mean value, what is the problom.

How

how to use the merged prototxt? Do we need to modified the convolution layer code?

mvNCCompile failed

When I trained the model and converted it to graph, I encountered the following error. Excuse me, have anyone met, how to solve it, thank you

Traceback (most recent call last):
File "/usr/local/bin/mvNCCompile", line 118, in
create_graph(args.network, args.inputnode, args.outputnode, args.outfile, args.nshaves, args.inputsize, args.weights)
File "/usr/local/bin/mvNCCompile", line 101, in create_graph
net = parse_caffe(args, myriad_config)
File "/usr/local/bin/ncsdk/Controllers/CaffeParser.py", line 503, in parse_caffe
node.concat_axis = layer.concat_param.axis
AttributeError: 'list' object has no attribute 'concat_axis'

About model size

hi all, in the paper, the trained model is 5.4M on PASCAL VOC 2007 test. However, the link to the trained model from the author is 20M~. Does the network has some differenence? Anyone can explain that for me? THX!

robert-junwang / pelee Goto Github PK

pelee's Issues

Show the CORE dump output:

Recommend Projects

Recommend Topics

Recommend Org

Jobs