Does pelee net use depthwise comvolution, like mobilenet? If not, can it be modifi

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

depthwise convolution to speed up about pelee HOT 13 OPEN

robert-junwang commented on August 28, 2024

depthwise convolution to speed up

from pelee.

Comments (13)

Robert-JunWang commented on August 28, 2024

No. PeleeNet is built with conventional convolution. Depthwise convolution can reduce the number of multi-add. But for the real speed, it depends on the devices and the framework you used. For example, both MobileNet+SSD and Pelee are much faster than TinyYOLO on CPU and iPhone6s, but TinyYOLO is slightly faster than Pelee on GTX 1080 Ti GPU and iPhone8. Although the number of multi-add of TinyYOLO is about 3 times larger than Pelee.

from pelee.

Robert-JunWang commented on August 28, 2024

Most of the work in this paper was completed 8 months ago. At that time, except Tensorflow, other frameworks had bad support for Depthwise Separatable Convolution. However, the situation has changed a lot now. The performance of grouped convolution has improved greatly on cuDNNv7. Apple's CoreML also supports grouped convolution very well.

It is a good time to try the Depthwise Separatable Convolution version now. I am more interested in improving the accuracy by increasing the number of channels with Depthwise Separatable Convolution. Both MobileNet and PeleeNet can perform the image classification on iPhone6s, a phone released three years ago, in less than 50ms. The speed is good enough for many device-side applications.

from pelee.

kaishijeng commented on August 28, 2024

Pelee detection on my RK3399, (2 Core- arm A72 at 1.8Ghz) takes 400ms for a frame.
So performance is not sufficient for my application (<100ms)
Thanks,

from pelee.

Robert-JunWang commented on August 28, 2024

What framework do you use? If the framework doesn't support automatic merging BN layer with Conv layer, you need to it by yourself. It can save over 50% inference time. If you train the model from scratch, you can try a wide and shallow network. For example, for the last two dense block, using half of the number of the dense layer with a doubled growth rate.

…

On Tue, Apr 24, 2018 at 12:47 AM, kaishijeng ***@***.***> wrote: Pelee detection on my RK3399, (2 Core- arm A72 at 1.8Ghz) takes 400ms for a frame. So performance is not sufficient for my application (<100ms) Thanks, — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#5 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AG20rZM2aGtEhPsQabipc8cfouhfl2J8ks5trq5sgaJpZM4TfB_v> .

from pelee.

kaishijeng commented on August 28, 2024

I use caffe framework and your pretrained model.
How do I check BN/Conv layer is merged or not?

Thanks,

from pelee.

foralliance commented on August 28, 2024

@Robert-JunWang 　HI

According to the model you provided, in trian/test.prototxt, the BN/Scale layer still exists alone. such as

layer {
  name: "stem1/bn"
  type: "BatchNorm"
  bottom: "stem1"
  top: "stem1"
  param {
    lr_mult: 0
    decay_mult: 0
  }
  param {
    lr_mult: 0
    decay_mult: 0
  }
  param {
    lr_mult: 0
    decay_mult: 0
  }
  batch_norm_param {
    moving_average_fraction: 0.999
    eps: 0.001
  }
}
layer {
  name: "stem1/scale"
  type: "Scale"
  bottom: "stem1"
  top: "stem1"
  scale_param {
    filler {
      value: 1
    }
    bias_term: true
    bias_filler {
      value: 0
    }
  }
}

does this mean that conv layer and BN layer are not merged? If so, why is the inference time so fast?

The so-called "automatic merging BN layer with Conv layer", does it mean that we need to modify the underlying C++ code?

from pelee.

Robert-JunWang commented on August 28, 2024

The models I provided are not merged. You can merge it by hand or other tools. I can add the merged model and script I used next week.

from pelee.

foralliance commented on August 28, 2024

hope for your update

from pelee.

ujsyehao commented on August 28, 2024

@Robert-JunWang Hi, I have two problems:

I found that you have uploaded a merged model, what is the principle of this? merged BN with convolution layer change original convolution calculation method?
In your paper, it says

we use a shallow and wide network structure to compensate for the negative impact on accuracy caused by this change

Is there any theoretical support for this? In general, a deeper network can bring higher accuracy.

from pelee.

siyiding1216 commented on August 28, 2024

Are you comparing the latency for your model with mobileNet SSD (without depthwise convolution)?
But mobileNet SSD could get 4 timers faster in GPU and 10 times faster in CPU with depthwise conv implementation. That means your model would be several times slower than mobileNet SSD with DW Conv...

from pelee.

Robert-JunWang commented on August 28, 2024

Do you mean MobileNetV1+SSDlite is 10 times faster than MobileNetV1+SSD on CPU? Do you mind to offer more detail information on how you evaluate the speed and how to get that result? In my understanding, the computation cost of the SSD algorithm is mostly consumed by the backbone network. The real speed difference between SSDlite and the original SSD should not be that big.

from pelee.

xonobo commented on August 28, 2024

As long as I understand the given merged models do not contain any batch normalization. Just convolution and ReLU, right?

from pelee.

lqs19881030 commented on August 28, 2024

mute

@Robert-JunWang what is the meaning of using half of the number of the dense layer with a doubled growth rate.can you show it and the map can fall much ?Thank you

from pelee.

depthwise convolution to speed up about pelee HOT 13 OPEN

Comments (13)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs