Comments (3)
I will share this plot which shows some of the options (not complete list of what is possible), this was performed on a Intel Xeon processor, this is a bit older model (a few years old). This also shows variation as a function of number of cores to see how well the scaling of the code performs for a cantilever beam simulation with around 20.000 shell elements (I would not expect it to scale very well beyond 4 cores). Here you can see that adding Ofast, adding xHost and removing the -ax... line reduces the CPU time. Adding -static does not improve noticeable compared to -Ofast on this plot. The black vertical line indicates a factor of two in the total CPU time.
Compiler_comparisons_CPU_time.pdf
Note that the speed-up might be even bigger on newer CPUs.
from openradioss.
As requested the details of the CPU are: Intel(R) Xeon(R) Gold 6234 CPU @ 3.30GHz (x86_64), 3300 MHz, 515398 MB RAM
from openradioss.
Thanks a lot for the finding! This is really interesting.
We are investigating to reproduce and understand better from which option(s) the improvement is coming and if it changes the numerical answer.
The generic options we provided allow to run on many platforms. Using -xhost when the compilation and run machine is the same sounds a good tip.
from openradioss.
Related Issues (20)
- Error while running the 1M element neon model HOT 2
- Confusing judgment conditions HOT 3
- OpenRadioss does not converge with increasing spatial resolution for a cantilever beam simulation HOT 6
- Law19 EREF sh3n element with initially tension works not corectly HOT 4
- Openradioss HOT 1
- Shell element integration points (Lobatto vs Gaussian quadrature) HOT 2
- Having trouble getting MPI working on linux HOT 2
- Dyna input creates incorrectly formatted engine file HOT 2
- *PART_COMPOSITE ply's material lists parsing incosistent with ls-dyna default format HOT 9
- Release naming and older versions HOT 1
- libraduser*.a is missing for User Subroutine compilation HOT 3
- Forget to update SUBTRIA(I)=IT0(3,ITQ)? HOT 3
- Improve efficiency by replacing the loop over all edges to labeled edges HOT 2
- Little doubt on updating TAG in the case of multi-contact HOT 4
- Apptainer file for arm64 Linux is incorrect
- th_to_csv: Wrong variable in if statement HOT 2
- th_to_csv_win64.exe does not convert contact forces correctly HOT 8
- Compile error HOT 3
- Energy error exceeds limit when only a fraction of part is rigid HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from openradioss.