GithubHelp home page GithubHelp logo

Comments (6)

hoelzer avatar hoelzer commented on June 25, 2024

I think having a multi FASTA at 2.Genomes/all-consensus-sequences.fasta that combines all the single FASTAs in that folder into one file should do the job? Of course, this might then still include reconstructed consensus that fail a later QC: but this can be checked in the report

from porecov.

oliverdrechsel avatar oliverdrechsel commented on June 25, 2024

i personally would object multi fasta files as they expect that all sequencing data are delivered to the same target folder.
They are much harder to split (recreating meaningful names) than single files are to fuse.

from porecov.

replikation avatar replikation commented on June 25, 2024

hi @oliverdrechsel

each samples consensus fasta file is located in this folder:

./<outputdirectory>/2.Genomes/<sample_name>/<samplename>_consensus.fasta

<outputdirectory> can be changed via --output flag. samplename is usually "barcode01" etc. if you start from basecalling

Multifastafiles (with QC passing) are only collected via the optional --rki flag. maybe I misunderstood the question?

from porecov.

hoelzer avatar hoelzer commented on June 25, 2024

i personally would object multi fasta files as they expect that all sequencing data are delivered to the same target folder.
They are much harder to split (recreating meaningful names) than single files are to fuse.

Ah, okay now I get what you mean @oliverdrechsel . You just want to have all the single FASTAs (one per sample) in one single output folder, right? Instead of sub-folders like described by @replikation:

./<outputdirectory>/2.Genomes/<sample_name>/<samplename>_consensus.fasta

so something like:

./<outputdirectory>/2.Genomes/all/<samplename>_consensus.fasta

?

It's a minor thing but maybe we can simply publish the FASTAs also to

./<outputdirectory>/2.Genomes/all_consensus/<samplename>_consensus.fasta

Thus, we would have the per-sample folder structure to check for details (VCF, BAM, FASTA, PDF, ...) and another folder that just has all the FASTAs.

or do you want an additional output folder for all the consensuses that is outside of the

./<outputdirectory>/

structure? This would need an additional parameter e.g.

--output_consensus /some/other/path/tp/write/all/consensus/fasta

from porecov.

oliverdrechsel avatar oliverdrechsel commented on June 25, 2024

Hi @hoelzer
something like ./<outputdirectory>/2.Genomes/all/<samplename>_consensus.fasta would be fine, i think.
It would be easier to distribute the data to somewhere else, if one just has to visit one folder and not iterate through various folders to get all output data.

from porecov.

hoelzer avatar hoelzer commented on June 25, 2024

Hi @hoelzer
something like ./<outputdirectory>/2.Genomes/all/<samplename>_consensus.fasta would be fine, i think.
It would be easier to distribute the data to somewhere else, if one just has to visit one folder and not iterate through various folders to get all output data.

Okay, I think this should be doable with e.g. an optional --collect <outputdirectory> flag

from porecov.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.