GithubHelp home page GithubHelp logo

tamp's Issues

How to correctly evaluate the performance of this design?

Hi All,

Thanks for open source your design and providing interface to caffe implementation! I have a question about evaluating your performance in terms of training time of alexnet.

Here is the result of time consumption of traditional caffe over alexnet:
"""
I0208 16:09:37.450343 18112 caffe.cpp:401] Average time per layer:
I0208 16:09:37.450346 18112 caffe.cpp:404] data forward: 1.36319 ms.
I0208 16:09:37.450350 18112 caffe.cpp:407] data backward: 0.0024288 ms.
I0208 16:09:37.450354 18112 caffe.cpp:404] conv1 forward: 12.9353 ms.
I0208 16:09:37.450358 18112 caffe.cpp:407] conv1 backward: 17.3249 ms.
I0208 16:09:37.450361 18112 caffe.cpp:404] relu1 forward: 2.56323 ms.
I0208 16:09:37.450364 18112 caffe.cpp:407] relu1 backward: 3.82991 ms.
I0208 16:09:37.450368 18112 caffe.cpp:404] norm1 forward: 2.89879 ms.
I0208 16:09:37.450371 18112 caffe.cpp:407] norm1 backward: 4.46302 ms.
I0208 16:09:37.450381 18112 caffe.cpp:404] pool1 forward: 2.44754 ms.
I0208 16:09:37.450404 18112 caffe.cpp:407] pool1 backward: 8.88663 ms.
I0208 16:09:37.450408 18112 caffe.cpp:404] conv2 forward: 16.7941 ms.
I0208 16:09:37.450412 18112 caffe.cpp:407] conv2 backward: 39.8721 ms.
I0208 16:09:37.450414 18112 caffe.cpp:404] relu2 forward: 1.65409 ms.
I0208 16:09:37.450417 18112 caffe.cpp:407] relu2 backward: 2.42135 ms.
I0208 16:09:37.450438 18112 caffe.cpp:404] norm2 forward: 2.13315 ms.
I0208 16:09:37.450440 18112 caffe.cpp:407] norm2 backward: 3.20389 ms.
I0208 16:09:37.450443 18112 caffe.cpp:404] pool2 forward: 1.60964 ms.
I0208 16:09:37.450446 18112 caffe.cpp:407] pool2 backward: 5.85156 ms.
I0208 16:09:37.450449 18112 caffe.cpp:404] conv3 forward: 8.77778 ms.
I0208 16:09:37.450453 18112 caffe.cpp:407] conv3 backward: 18.8411 ms.
I0208 16:09:37.450455 18112 caffe.cpp:404] relu3 forward: 0.600653 ms.
I0208 16:09:37.450459 18112 caffe.cpp:407] relu3 backward: 0.890301 ms.
I0208 16:09:37.450462 18112 caffe.cpp:404] conv4 forward: 6.55406 ms.
I0208 16:09:37.450465 18112 caffe.cpp:407] conv4 backward: 15.3713 ms.
I0208 16:09:37.450469 18112 caffe.cpp:404] relu4 forward: 0.601238 ms.
I0208 16:09:37.450472 18112 caffe.cpp:407] relu4 backward: 0.891581 ms.
I0208 16:09:37.450475 18112 caffe.cpp:404] conv5 forward: 4.26468 ms.
I0208 16:09:37.450479 18112 caffe.cpp:407] conv5 backward: 9.63692 ms.
I0208 16:09:37.450481 18112 caffe.cpp:404] relu5 forward: 0.404426 ms.
I0208 16:09:37.450484 18112 caffe.cpp:407] relu5 backward: 0.618806 ms.
I0208 16:09:37.450489 18112 caffe.cpp:404] pool5 forward: 0.393098 ms.
I0208 16:09:37.450491 18112 caffe.cpp:407] pool5 backward: 1.35045 ms.
I0208 16:09:37.450495 18112 caffe.cpp:404] fc6 forward: 6.99653 ms.
I0208 16:09:37.450497 18112 caffe.cpp:407] fc6 backward: 5.61412 ms.
I0208 16:09:37.450500 18112 caffe.cpp:404] relu6 forward: 0.0519008 ms.
I0208 16:09:37.450503 18112 caffe.cpp:407] relu6 backward: 0.0635392 ms.
I0208 16:09:37.450507 18112 caffe.cpp:404] drop6 forward: 0.0957504 ms.
I0208 16:09:37.450510 18112 caffe.cpp:407] drop6 backward: 0.0612928 ms.
I0208 16:09:37.450513 18112 caffe.cpp:404] fc7 forward: 1.68963 ms.
I0208 16:09:37.450516 18112 caffe.cpp:407] fc7 backward: 2.68684 ms.
I0208 16:09:37.450520 18112 caffe.cpp:404] relu7 forward: 0.0450144 ms.
I0208 16:09:37.450522 18112 caffe.cpp:407] relu7 backward: 0.070224 ms.
I0208 16:09:37.450526 18112 caffe.cpp:404] drop7 forward: 0.0842816 ms.
I0208 16:09:37.450529 18112 caffe.cpp:407] drop7 backward: 0.0615776 ms.
I0208 16:09:37.450532 18112 caffe.cpp:404] fc8 forward: 0.73568 ms.
I0208 16:09:37.450536 18112 caffe.cpp:407] fc8 backward: 0.736986 ms.
I0208 16:09:37.450538 18112 caffe.cpp:404] loss forward: 0.145184 ms.
I0208 16:09:37.450541 18112 caffe.cpp:407] loss backward: 0.0344864 ms.
I0208 16:09:37.450556 18112 caffe.cpp:412] Average Forward pass: 76.0813 ms.
I0208 16:09:37.450561 18112 caffe.cpp:414] Average Backward pass: 142.982 ms.
I0208 16:09:37.450568 18112 caffe.cpp:416] Average Forward-Backward: 219.162 ms.
I0208 16:09:37.450574 18112 caffe.cpp:418] Total Time: 2191.62 ms.
"""

Here is the result of time consumption of caffe with your circulant design over alexnet:
"""
I0208 16:11:00.087167 19202 caffe.cpp:401] Average time per layer:
I0208 16:11:00.087170 19202 caffe.cpp:404] data forward: 1.37564 ms.
I0208 16:11:00.087175 19202 caffe.cpp:407] data backward: 0.0028576 ms.
I0208 16:11:00.087180 19202 caffe.cpp:404] conv1 forward: 12.8175 ms.
I0208 16:11:00.087184 19202 caffe.cpp:407] conv1 backward: 16.9658 ms.
I0208 16:11:00.087188 19202 caffe.cpp:404] relu1 forward: 2.56445 ms.
I0208 16:11:00.087190 19202 caffe.cpp:407] relu1 backward: 3.82018 ms.
I0208 16:11:00.087193 19202 caffe.cpp:404] norm1 forward: 2.90364 ms.
I0208 16:11:00.087198 19202 caffe.cpp:407] norm1 backward: 4.49031 ms.
I0208 16:11:00.087199 19202 caffe.cpp:404] pool1 forward: 2.52628 ms.
I0208 16:11:00.087203 19202 caffe.cpp:407] pool1 backward: 9.33765 ms.
I0208 16:11:00.087205 19202 caffe.cpp:404] conv2 forward: 16.5641 ms.
I0208 16:11:00.087209 19202 caffe.cpp:407] conv2 backward: 39.1221 ms.
I0208 16:11:00.087213 19202 caffe.cpp:404] relu2 forward: 1.6541 ms.
I0208 16:11:00.087215 19202 caffe.cpp:407] relu2 backward: 2.42211 ms.
I0208 16:11:00.087218 19202 caffe.cpp:404] norm2 forward: 2.12487 ms.
I0208 16:11:00.087221 19202 caffe.cpp:407] norm2 backward: 3.1529 ms.
I0208 16:11:00.087224 19202 caffe.cpp:404] pool2 forward: 1.72133 ms.
I0208 16:11:00.087236 19202 caffe.cpp:407] pool2 backward: 6.10486 ms.
I0208 16:11:00.087258 19202 caffe.cpp:404] conv3 forward: 8.78861 ms.
I0208 16:11:00.087263 19202 caffe.cpp:407] conv3 backward: 18.4702 ms.
I0208 16:11:00.087266 19202 caffe.cpp:404] relu3 forward: 0.603286 ms.
I0208 16:11:00.087285 19202 caffe.cpp:407] relu3 backward: 0.89009 ms.
I0208 16:11:00.087288 19202 caffe.cpp:404] conv4 forward: 6.57151 ms.
I0208 16:11:00.087292 19202 caffe.cpp:407] conv4 backward: 15.254 ms.
I0208 16:11:00.087296 19202 caffe.cpp:404] relu4 forward: 0.602928 ms.
I0208 16:11:00.087299 19202 caffe.cpp:407] relu4 backward: 0.888906 ms.
I0208 16:11:00.087302 19202 caffe.cpp:404] conv5 forward: 4.26599 ms.
I0208 16:11:00.087306 19202 caffe.cpp:407] conv5 backward: 9.4658 ms.
I0208 16:11:00.087309 19202 caffe.cpp:404] relu5 forward: 0.404566 ms.
I0208 16:11:00.087313 19202 caffe.cpp:407] relu5 backward: 0.602019 ms.
I0208 16:11:00.087316 19202 caffe.cpp:404] pool5 forward: 0.431626 ms.
I0208 16:11:00.087321 19202 caffe.cpp:407] pool5 backward: 1.40352 ms.
I0208 16:11:00.087323 19202 caffe.cpp:404] fc6 forward: 6.23818 ms.
I0208 16:11:00.087327 19202 caffe.cpp:407] fc6 backward: 9.62638 ms.
I0208 16:11:00.087329 19202 caffe.cpp:404] relu6 forward: 0.0550912 ms.
I0208 16:11:00.087333 19202 caffe.cpp:407] relu6 backward: 0.069808 ms.
I0208 16:11:00.087337 19202 caffe.cpp:404] fc7 forward: 10.9973 ms.
I0208 16:11:00.087340 19202 caffe.cpp:407] fc7 backward: 9.6887 ms.
I0208 16:11:00.087343 19202 caffe.cpp:404] relu7 forward: 0.0512128 ms.
I0208 16:11:00.087347 19202 caffe.cpp:407] relu7 backward: 0.0694688 ms.
I0208 16:11:00.087350 19202 caffe.cpp:404] fc8 forward: 0.450304 ms.
I0208 16:11:00.087354 19202 caffe.cpp:407] fc8 backward: 0.733661 ms.
I0208 16:11:00.087357 19202 caffe.cpp:404] loss forward: 0.136755 ms.
I0208 16:11:00.087362 19202 caffe.cpp:407] loss backward: 0.0364608 ms.
I0208 16:11:00.087378 19202 caffe.cpp:412] Average Forward pass: 84.08 ms.
I0208 16:11:00.087383 19202 caffe.cpp:414] Average Backward pass: 152.801 ms.
I0208 16:11:00.087388 19202 caffe.cpp:416] Average Forward-Backward: 236.982 ms.
I0208 16:11:00.087394 19202 caffe.cpp:418] Total Time: 2369.82 ms.
"""

I evaluate time performance with the built-in support of caffe over single GTX1080. But I believe you did speed up since I could tell from the time-stamp of saved models by caffe. May I ask how to correctly evaluate your time performance?

Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.