I'm currently working on a very large genome (>10 Gbp), and the Linux cluster (whic

All tasks need to be restartable to deal with job queueing problems about pbbioconda HOT 4 CLOSED

bruc commented on May 27, 2024

All tasks need to be restartable to deal with job queueing problems

from pbbioconda.

Comments (4)

pb-cdunn commented on May 27, 2024

Can you be more clear? What software are you running? What is your environment? Your configuration? What is your precise use-case? In what way is it failing?

For Falcon, pypeflow already does what you seem to want. If any task fails, it will be re-run when pypeflow is re-invoked (fc_run/fc_unzip/etc). We could retry failed tasks without quitting, but that has never been an important use-case, since it's so easy to restart.

As for keeping partial results, you need to be very specific. We already partition the workflow. I see no reason why a given task cannot be restarted from scratch if it failed before. If a task is too large, you can control that youself by alteringpa_daligner_option or pa_DBsplit_option.

To put a runtime limit on a task, use pwatcher_type=blocking and specify the limits in the submit string. If you need different limits for different sections, you can specify your own variables (using ALL_CAPS) to be substituted into your own submit string. It's very flexible.

from pbbioconda.

bruc commented on May 27, 2024

We're running a variant of SGE, but that's not the point. In our environment, a 'qsub' command can fail for reasons that we do not have control over and which have nothing to do with PacBio's code. For a large assembly, it is impractical to restart pypeflow when there is a failure of qsub command. We have solved this problem for our use case by implementing a script aptly named, 'qsub_with_retry'. It can detect if a qsub command completed normally, and if not, it will try again subject to limits that we can control.

As a hypothetical example, the task that creates the Dazzler database for the raw reads or corrected reads should first check to see if the database already exists in the directory where the database will be created. If it does, it should be deleted first before the FASTA files are loaded into it. If that task were restarted after the database was partially written, we would have two copies of the sequences loaded during the first attempt.

In the case of SMRT Link, restartability is easy to accomplish. Each task writes a small set of files in the task directory, and those files have the same name regardless of the function of the task. Thus, a task can be restarted by deleting all other files in the task directory before rerunning the 'qsub' command that accomplishes the task.

If the files written into a task directory at the beginning of a task for pb-assembly and pypeflow are the same for all tasks, then I can solve this problem for myself. Is that the case?

from pbbioconda.

pb-cdunn commented on May 27, 2024

As a hypothetical example, the task that creates the Dazzler database for the raw reads or corrected reads should first check to see if the database already exists in the directory where the database will be created. If it does, it should be deleted first before the FASTA files are loaded into it. If that task were restarted after the database was partially written, we would have two copies of the sequences loaded during the first attempt.

We already do that:

$ cat 0-rawreads/build/build_db.sh
#!/bin/bash
set -vex
echo "PBFALCON_ERRFILE=$PBFALCON_ERRFILE"
set -o pipefail
rm -f raw_reads.db .raw_reads.* # in case of re-run
...

All steps should be restartable on error. If you find one that is not, please let us know.

If the files written into a task directory at the beginning of a task for pb-assembly and pypeflow are the same for all tasks, then I can solve this problem for myself. Is that the case?

Ok. I see what you're looking for. You want to delete everything yourself (since you don't trust us), aside from the files which you need to keep. So you want to know which files to keep.

You have not mentioned which pwatcher_type you use. (You have not even supplied your .cfg, which would be helpful.) I can explain how it works for pwatcher_type=blocking:

pypeflow dumps some files into a run-directory:
- task.json
- task.sh
- run.sh
- run-XXX.bash (maybe, depending on pwatcher_type)
pypeflow calls your "submit" command (e.g. qsub) on pypeFLOW/pwatcher/mains/job_start.sh.
job_start.sh will be passed two environment variables by pypeflow:
- PYPEFLOW_JOB_START_SCRIPT -- the generated run-XXX.bash script
- PYPEFLOW_JOB_START_TIMEOUT -- a number
job_start.sh will wait TIMEOUT seconds for the SCRIPT to exist. Then it will run that script.
- Its purpose is to give qsub a definitely existing script, job_start.sh. (Generated files might be subject to filesystem latency. Many users have had latency problems with generated scripts.)
The run-XXX.bash script will change to the correct run-directory and run run.sh.
- Its purpose is to change to the run-directory, in case qsub/etc. did not. (Some users need this.)
run.sh will run task.sh and touch run.sh.done when finished.
- Its purpose is the creation of that sentinel file. (This indicates "success", not the finishing of qsub.)
task.sh will run python do_task task.json
- Its purpose is to tell us something about the current machine on error, in case resources are being over-used.
do_task.py will wait on input files in task.json, and on output files at the end.
- This could be written in a different language someday.

(For pwatcher_type=fs_based, things are slightly different. But you can still rely on run.sh, notwithstanding filesystem latency.)

(One reason why pbsmrtpipe is simpler is that it's slow, so it tends to have fewer filesystem latency problems. Another is that it's used by a smaller set of users, so it hasn't encountered as many user problems as we have via GitHub interactions.)

Here is what we actually pass to submit:

About to submit: Node(0-rawreads/report)
Popen: '/bin/bash -C /localdisk/scratch/cdunn/repo/pypeFLOW/pwatcher/mains/job_start.sh >| /localdisk/scratch/cdunn/repo/FALCON-examples/run/synth0/0-rawreads/report/run-P0_report_19f0d0cd122fac952635bbb0e199e785.bash.stdout 2>| /localdisk/scratch/cdunn/repo/FALCON-examples/run/synth0/0-rawreads/report/run-P0_report_19f0d0cd122fac952635bbb0e199e785.bash.stderr'

With

#submit = bash -c ${JOB_SCRIPT} >| ${JOB_STDOUT} 2>| ${JOB_STDERR}
#submit = bash -c ${JOB_SCRIPT}
submit =  qsub -S /bin/bash -sync y -V -q ${JOB_QUEUE} \
  -N ${JOB_ID}        \
  -o "${STDOUT_FILE}" \
  -e "${STDERR_FILE}" \
  -pe smp ${NPROC}    \
  "${CMD}"

we would pass something like:

About to submit: Node(0-rawreads/report)
Popen: 'qsub -S /bin/bash -sync y -V -q default7 \
-N P0_report_ffbf6f8eb30c62d8c2e227e1625f5a1a        \
-o "/lustre/hpcprod/cdunn/repo/FALCON-examples/run/synth0/0-rawreads/report/run-P0_report_ffbf6f8eb30c62d8c2e227e1625f5a1a.bash.stdout" \
-e "/lustre/hpcprod/cdunn/repo/FALCON-examples/run/synth0/0-rawreads/report/run-P0_report_ffbf6f8eb30c62d8c2e227e1625f5a1a.bash.stderr" \
-pe smp 1    \
"/localdisk/scratch/cdunn/repo/pypeFLOW/pwatcher/mains/job_start.sh"'

Yes, you need only those 4 files. If it's a problem that run-XXX.bash has an unpredictable name, we can change that. Or, you can actually skip that file, instead switching to the run-directory yourself and calling run.sh. So technically, you need only 3 files.

But you should avoid deleting run-XXX.bash.stderr/out. Those are actually stderr/stdout of the actual qsub call, as shown above. (pbsmrtpipe also writes both stderr/out and cluster.stderr/out into the run-directory, but the filenames are known, which helps in your case.)

from pbbioconda.

bruc commented on May 27, 2024

Just to avoid creating a negative impression -- I trust all the PacBio developers -- you all do a great job solving the assembly problem with this software. What I don't trust is our Linux cluster. It is highly reliable, but not enough for jobs that require 100,000 job submissions (or more).

I missed the code for creating a Dazzler database, and what you wrote above addresses my issue. I used a bad example -- my apologies.

I'm currently running a job that has about 700 Gbp of Sequel reads, and I'm occasionally restarting jobs -- so far, no problems. Our cluster is having some NFS issues, so I have been losing nodes with my tasks running on them, but every restart has worked (as far as not causing an error exit).

from pbbioconda.

All tasks need to be restartable to deal with job queueing problems about pbbioconda HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs