Comments (2)
The NUMBER_OF_SIMULTANEOUS_RUNS
parameter is available in all versions of SPECFEM and would be a useful target for this issue. It allows a User to submit one large job for N events, each event running on P processors. Rather than submitting N array jobs, each running on N cores, the User submits one job on NxP cores, and internally SPECFEM will distribute the job.
I need to test this capability and see what the finer details are, but I think SeisFlows can take advantage of this capability to submit large, long queue time, high core-number jobs.
from seisflows.
Notes on NUMBER_OF_SIMULTANEOUS_RUNS
parameter (developing with Global code)
- Each
run????
directory that is not run0001 does not require aPar_file
- Failed runs create text files with names like 'run0001_failed' or 'run_with_local_rank_00000000and_global_rank_00000000_failed'
- The ROOTDIR LOCAL_PATH directories (OUTPUT_FILES and DATABASES_MPI) can be empty (when using
broadcast_mesh_and_model
parameter). Onlyrun0001
requires actual mesh and model files - ROOTDIR/DATA only requires a
Par_file
but not CMTSOLUTION or STATIONS file
Outline on what will need to be changed:
- Solver must initialize working directories to match required SPECFEM structure (cdf4841)
- System needs to submit one large job rather than array job
- Workflow needs to be adjusted to bundle jobs differently since things like preprocessing cannot be included in this large many-core job
- Preprocessing, and bookkeeping needs to be re-structured as it currently is addressed by each solver array job. This will likely require some code restructuring.
from seisflows.
Related Issues (20)
- error in creating parameter file HOT 3
- 'DATA_CASE' not found in parameters.yaml HOT 4
- Error at 'postprocess_event_kernels' stage HOT 2
- methodology for line search inversion using the gradient HOT 2
- create and populate an examples directory
- Need some help with using seisflows in Cluster HOT 4
- NPROC > 1 not working HOT 10
- Documentation update planning
- System cluster problem: "ModuleNotFoundError: No module named 'seisflows'" HOT 2
- system parameter ntask_max is not honored for certain subclasses
- DATA_CASE' not found in parameters.yaml HOT 4
- add support for SPECFEM2D acoustic domain
- Example 2 fails to run HOT 8
- potential race condition prevents 'unix.rm' from deleting directory HOT 2
- Have some problems when trying to create an Example for Seisflows based on Marmousi data. HOT 8
- SIGTRAP & SIGFILL Errors HOT 3
- Issue with adjoint in the Inversion Workflow HOT 13
- Issue with running example 1 HOT 1
- Model class does not work with decomposed Cartesian meshes & NumPy>=1.24 HOT 1
- Program received signal SIGILL: Illegal instruction - Acoustic/elastic problem HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from seisflows.