The python-performance-tips from timtan

#python-performance-tips

##Total Requirements

Repository

https://github.com/timtan/python-performance-tips.git

###Platform Dependency

mac
- XCode
- libevent

python dependency

pip install -r requirement.txt

##Profile

Analyze profile_sample1

python -m cProfile -s cumulative profile_sample1.py

-m means directly invoke the module
-s is the sort order.

you will see output like the following:

       5400283 function calls (5400273 primitive calls) in 5.527 seconds

 Ordered by: cumulative time

 ncalls  tottime  percall  cumtime  percall filename:lineno(function)
      1    1.062    1.062    5.527    5.527 profile_sample1.py:1(<module>)
 900000    1.352    0.000    4.441    0.000 profile_sample1.py:3(find_domain)
 900000    0.521    0.000    1.900    0.000 re.py:188(compile)
 900000    1.104    0.000    1.380    0.000 re.py:229(_compile)
 900000    0.958    0.000    0.958    0.000 {built-in method match}
 900000    0.262    0.000    0.262    0.000 {method 'get' of 'dict' objects}
 900000    0.231    0.000    0.231    0.000 {built-in method group}
      1    0.024    0.024    0.024    0.024 {range}
      1    0.000    0.000    0.013    0.013 sre_compile.py:495(compile)
      1    0.000    0.000    0.013    0.013 sre_compile.py:480(_code)

functions are ordered by cumulative execution time. and you have a rough idea what function cost much time. it this example. the candidates are find_domain, re.compile.

with the about clue, first we can try compile the pattern just once.

  2700286 function calls (2700276 primitive calls) in 2.780 seconds

 Ordered by: cumulative time

 ncalls  tottime  percall  cumtime  percall filename:lineno(function)
      1    0.950    0.950    2.780    2.780 profile_sample2.py:1(<module>)
 900000    0.810    0.000    1.806    0.000 profile_sample2.py:3(find_domain)
 900000    0.812    0.000    0.812    0.000 {built-in method match}
 900000    0.184    0.000    0.184    0.000 {built-in method group}
      1    0.024    0.024    0.024    0.024 {range}
      1    0.000    0.000    0.000    0.000 re.py:188(compile)
      1    0.000    0.000    0.000    0.000 re.py:229(_compile)

How Profile Works

to read the doc, what-is-deterministic-profiling, we can know cProfile monitor function call, function return and exception events. the profiling is fine gran. because python automatically provides hooks for each event.

what if I have no knowledge that compile can be reduced to once

there are some reason that you cannot adopt the above process

you don't have knowledge about how to accerlate.
you are lack of time

Some lazy approach

the original python

time python profile_sample1.py

real	0m7.234s
user	0m7.111s
sys	    0m0.086s

the amazing pypy

time pypy  profile_sample1.py

real	0m1.348s
user	0m0.980s
sys	    0m0.054s

drawback of pypy

will you open ppt from keynote?

the pypy cannot using c module. (don't be scare, standard library written in c is rewrite)

cython

before using cython, it is easier to have a setup.py first.

reference

please type the command first

make

before using cython

time profile_sample1

real	0m2.938s
user	0m2.912s
sys	0m0.024s

after using cython

time cython_sample 

real	0m2.800s
user	0m2.775s
sys	0m0.022s

Cython Reference

Multiprocessing and multithreading

the sample program taks 7 second.

time python  computation_parallel_example.py
	
real	0m7.084s
user	0m6.639s
sys	    0m0.403s

however, the multithreading version are also take 7 second :(

time python  computation_parallel_example_threading.py 


real	0m7.009s
user	0m6.560s
sys	0m0.429s

multiprocessing version with 2 process

time python computation_parallel_multiprocessing.py 

real	0m4.537s
user	0m7.682s
sys	    0m0.589s

a very good explanation of GIL

Gevent

victor's good introduction

timtan / python-performance-tips Goto Github PK

python-performance-tips's Introduction

Repository

python dependency

Analyze profile_sample1

How Profile Works

what if I have no knowledge that compile can be reduced to once

Some lazy approach

drawback of pypy

cython

Cython Reference

Multiprocessing and multithreading

Gevent

python-performance-tips's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs