GithubHelp home page GithubHelp logo

diegobrambilla / 16s_normalization Goto Github PK

View Code? Open in Web Editor NEW

This project forked from cnr-irsa-meg/16s_normalization

0.0 0.0 3.0 452 KB

Script for the normalization of relative abundance out of DIAMOND default output (BLOSUM62 matrix) through a formula that relies on the 16S marker gene.

R 100.00%

16s_normalization's People

Watchers

 avatar

16s_normalization's Issues

sum normalized counts per gene name

Start from the test case.
The code calculates the normalized count for each gene accession number.
Note that one gene name is typically associated to more than one gene accession number.

The aim is to sum the normalized count of rows that have the same gene name.

example:
from

db_ID gene_name nomalized_count
1 gene 1
2 gene 0.5

to

gene_name nomalized_count
gene 1.5

introduce patterns to generalize input files and sample names

The test script refers to specific file names and specific sample names.
I would like to create a main script for a more general purpose, so that it can process different input files of the same type (from several samples for instance) at once.

Maybe by generalizing input file names through patterns?
Also, introducing for loops to process several tables from different samples into the same , single pivot table?
How about automatization: include flags to allow users to provide file names and sample names?

creating pivot table of normalized count

Once #4 will be closed, the main version of the 16S normalization script can be created.

The new script (provisional name: 16S_normalization.Rmd) is expected to calculate the normalization for all samples and report results in a pivot table.

The layout of the pivot table should be something like this:

sample1 sample2 sample3
gene1
gene2
gene3

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.