GithubHelp home page GithubHelp logo

Comments (5)

LaurentPlagne avatar LaurentPlagne commented on May 13, 2024

I agree.
In addition, I am not totally confident about the zero overhead of a single threaded (numThreads=1) OMP version compared to a non OMP one.
Several sequential transpositions could run in concurrent threads...

from hptt.

springer13 avatar springer13 commented on May 13, 2024

I have just pushed a version which doesn't rely on openmp anylonger. Indeed, HPTT is now building with clang. However, you have to make sure that you are on an AVX-enabled CPU since the explicit vectorization requies AVX (the makefile currently uses -march=native). I might change the build system to CMAKE in the near future to detect these settings automatically.

If you don't want to run on an AVX-enabled CPU you currently only have the option to build the scalar version ("make scalar").

If you want to compile HPTT on system that doesn't support AVX, you have to make sure that you enable avx (e.g., -mavx).

@LaurentPlagne regarding the overhead, please notice that HPTT also exposes an expert interface via execute_expert<...> which indeed entirely avoid the overhead of spawning threads. Moreover, this interface is especially useful if HPTT is called from within a parallel region where threads have already been spawned.

from hptt.

LaurentPlagne avatar LaurentPlagne commented on May 13, 2024

Oops, I am discovering HPTT and I missed the execute_expert<...> API. Thank you for mentioning it !

from hptt.

springer13 avatar springer13 commented on May 13, 2024

No worries, HPTT is not (well) documented. I will have to find the time to do this at some point. In the meantime, please feel free to open issues if you run into any problems.

from hptt.

solomonik avatar solomonik commented on May 13, 2024

Now builds with clang for me and runs with CTF without error when building with make scalar. My desktop evidently does not have AVX. Will test on some larger machines eventually that ought to have AVX.

Thanks for your help!

from hptt.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.