Comments (5)
There seem to be two ways of calculating FPS.
1.via the 'time' model of caffe with batch_size = 1:
build/tools/caffe time -model=models/VGGNet/VOC0712/refinedet_vgg16_320x320/deploy.prototxt -gpu=0
2.get the time interval and estimate the FPS.
for example, got the following lines after I run examples/ssd/score_ssd_pascal.py:
I0713 20:50:25.756376 117599 net.cpp:684] Ignoring source layer mbox_loss
I0713 20:51:32.472930 117599 solver.cpp:531] Test net output #0: detection_eval = 0.723217
I0713 20:51:32.473057 117599 solver.cpp:325] Optimization Done.
The total time spent on evaluating 4952 test images is about 67 seconds, and thus FPS = 4952 / 67 = 74.
Which method should be used to calculate FPS? Is there any other way?
from pelee.
The FPS of intel is got by Caffe time. The FPS of iPhone is calculated by the average time of 100 images processed by the CoreML model. This time includes the image pre-processing time, but it does not include the time of the post-processing part (decoding the bounding-boxes and performing non-maximum suppression). Usually, post-processing is done on the CPU, which can be executed asynchronously with the other parts that are executed on mobile GPU. Hence, the actual speed should be very close to our test result.
from pelee.
I think both methods are fine, as long as you evaluate all the models in the same way.
from pelee.
@Robert-JunWang
thanks your reply
The difference between the two approaches is a bit big.
from pelee.
@foralliance what is batch_size in your method2?
I think batch size affect speed much.
from pelee.
Related Issues (20)
- fine tuning with different number of classes HOT 1
- peele-SSD add_extra_layers_pelee
- About the stanford dog dataset. HOT 3
- 2-way dense layer in code and paper seems mismatch. HOT 2
- how can i get the fps=120 on nvidia tx2? please help me HOT 1
- Calculation of number of parameter, macc, and flops HOT 3
- pytorch pretained model
- max_iter
- Does it support 512 or bigger input size? HOT 1
- question for iteration HOT 1
- can not download the pretrained PeleeNet model
- one question
- Question for peleeNet structure
- How can I train my own model?
- train error
- peleeNet speed in GTX1080ti HOT 1
- Question about 1x1 convolutional kernels to reduce computational cost
- the paper was accepted two years ago???
- Pelee input resolution problem
- question about merge bn?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pelee.