GithubHelp home page GithubHelp logo

PCoA plot of all samples about emp HOT 3 CLOSED

biocore avatar biocore commented on August 25, 2024
PCoA plot of all samples

from emp.

Comments (3)

wasade avatar wasade commented on August 25, 2024

AFAIK we should be okay. The full QIIMEdb merges with new OTUs took ~25gb when I was monitoring, and should have been submitted to a 64GB queue regardless. Greg, do you have approx sample and obs counts there?

from emp.

gregcaporaso avatar gregcaporaso commented on August 25, 2024

@wasade: approximate number of samples is 5-10k (still working out exactly what will be included). approximate number of otus is a little harder: we're currently about 2/3 through OTU picking on the first iteration of data (~4.5k samples) and have around 2 million OTUs, with singletons excluded. So, maybe 3m OTUs overall?

from emp.

cuttlefishh avatar cuttlefishh commented on August 25, 2024

My how time as passed. As of Dec 2015, @wasade has figured out that whole biom-format thing and is working on faster UniFrac. @ElDeveloper and @mortonjt are working on faster Emperor.

Recent update from @wasade on faster UniFrac with smaller biom tables (optimization with large biom tables continues):

We can compute unweighted unifrac on 1000 samples 
(AG, random sample of 1k rarefied set, against gg 97%) 
in about 30s on a laptop. 50s if you account for parse of 
the input tree and table prior to bdiv. Peak total memory, 
including threads is <1GB.

from emp.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.