Hi, would it be possible that the final consensus sequences would be

hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-ho

collect consensus sequences in one folder about porecov HOT 6 CLOSED

replikation commented on June 25, 2024

collect consensus sequences in one folder

from porecov.

Comments (6)

hoelzer commented on June 25, 2024

I think having a multi FASTA at 2.Genomes/all-consensus-sequences.fasta that combines all the single FASTAs in that folder into one file should do the job? Of course, this might then still include reconstructed consensus that fail a later QC: but this can be checked in the report

from porecov.

oliverdrechsel commented on June 25, 2024

i personally would object multi fasta files as they expect that all sequencing data are delivered to the same target folder.
They are much harder to split (recreating meaningful names) than single files are to fuse.

from porecov.

replikation commented on June 25, 2024

hi @oliverdrechsel

each samples consensus fasta file is located in this folder:

./<outputdirectory>/2.Genomes/<sample_name>/<samplename>_consensus.fasta

<outputdirectory> can be changed via --output flag. samplename is usually "barcode01" etc. if you start from basecalling

Multifastafiles (with QC passing) are only collected via the optional --rki flag. maybe I misunderstood the question?

from porecov.

hoelzer commented on June 25, 2024

i personally would object multi fasta files as they expect that all sequencing data are delivered to the same target folder.
They are much harder to split (recreating meaningful names) than single files are to fuse.

Ah, okay now I get what you mean @oliverdrechsel . You just want to have all the single FASTAs (one per sample) in one single output folder, right? Instead of sub-folders like described by @replikation:

./<outputdirectory>/2.Genomes/<sample_name>/<samplename>_consensus.fasta

so something like:

./<outputdirectory>/2.Genomes/all/<samplename>_consensus.fasta

It's a minor thing but maybe we can simply publish the FASTAs also to

./<outputdirectory>/2.Genomes/all_consensus/<samplename>_consensus.fasta

Thus, we would have the per-sample folder structure to check for details (VCF, BAM, FASTA, PDF, ...) and another folder that just has all the FASTAs.

or do you want an additional output folder for all the consensuses that is outside of the

./<outputdirectory>/

structure? This would need an additional parameter e.g.

--output_consensus /some/other/path/tp/write/all/consensus/fasta

from porecov.

oliverdrechsel commented on June 25, 2024

Hi @hoelzer
something like ./<outputdirectory>/2.Genomes/all/<samplename>_consensus.fasta would be fine, i think.
It would be easier to distribute the data to somewhere else, if one just has to visit one folder and not iterate through various folders to get all output data.

from porecov.

hoelzer commented on June 25, 2024

Hi @hoelzer
something like ./<outputdirectory>/2.Genomes/all/<samplename>_consensus.fasta would be fine, i think.
It would be easier to distribute the data to somewhere else, if one just has to visit one folder and not iterate through various folders to get all output data.

Okay, I think this should be doable with e.g. an optional --collect <outputdirectory> flag

from porecov.

collect consensus sequences in one folder about porecov HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs