Comments (3)
What are the details of the job being run? Is it asking for multiple nodes/cores or GPUs?
from psij-python.
test_run.py is really just running /bin/date
:
#!/usr/bin/env python3
# This is meant as a simple test file to check if psij was installed successfully
import sys
from psij import Job, JobExecutor, JobSpec
if __name__ == '__main__':
name = 'local'
if len(sys.argv) > 1:
name = sys.argv[1]
jobs = list()
job = Job(JobSpec(executable='/bin/date'))
exec = JobExecutor.get_instance(name)
exec.submit(job)
jobs.append(job)
print('Job submitted')
status = job.wait()
print('Job done: {}'.format(status))
The generated flux spec is:
{'attributes': {'system': {'duration': 600.0}},
'resources': [{'count': 1,
'label': 'task',
'type': 'slot',
'with': [{'count': 1, 'type': 'core'},
{'count': 1, 'type': 'gpu'}]}],
'tasks': [{'command': ['/bin/date'],
'count': {'per_slot': 1},
'slot': 'task'}],
'version': 1}
I actually don't understand why a GPU is requested, but either way, a GPU is available also.
Edit: Duh, the gpu is requested because of this:
jspec.resources = ResourceSpecV1(process_count=1,
processes_per_node=1,
cpu_cores_per_process=1,
gpu_cores_per_process=1,
exclusive_node_use=True)
That looks at least incomplete / wrong. Will fix. The question remains why does the spec fail to run?
from psij-python.
The question remains why does the spec fail to run?
That's a good quesiton. It may be because the Flux instance you are submitting to does not include the flux-sched Fluxion scheduler. If you run flux module list
is sched-simple
loaded? If so, I believe that is the issue since that scheduler does not support GPUs. The other issue could be Flux not detecting the GPUs due to an hwloc that isn't configured against CUDA. Does flux resource list
report any GPUs?
from psij-python.
Related Issues (20)
- polling for jobs removed from the queue does not properly mark them as completed HOT 6
- Documentation: Example for how to load modules HOT 2
- Implement https://github.com/ExaWorks/job-api-spec/pull/168 HOT 1
- Implement https://github.com/ExaWorks/job-api-spec/pull/172
- Incorrect job duration formatting for SLURM HOT 2
- Proposal: Allow JSON-serializable format for `duration` in `JobAttributes` HOT 7
- `main` branch breaks `custom_attributes` (at least for Slurm) HOT 3
- Some words are cut off on PSI/J Python Webpage HOT 1
- Next steps for a new release? HOT 4
- The `environment` kwarg in `JobSpec` gets the wrong number of quotation marks if there are spaces HOT 4
- Not able to get ResourceSpecV1 working with PBS HOT 4
- PBS walltime hour is being formatted as a float causing job submission failure
- Account vs. project HOT 2
- Dependency on typeguard 2.12 conflicts with other packages. HOT 3
- Leftover of `distutils.` in 0.9.5 HOT 1
- Dependency `filelock` move to requirements-tests HOT 1
- `requirements.txt` missing from PyPI source code HOT 4
- Deprecation warning for "project_name" is always triggered HOT 2
- CANCELED status after cancel() HOT 1
- Feature request: support for psutil 6.0 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from psij-python.