Comments (3)
Hey @lucaz88,
The clustering of genomes in the pangenomic workflow takes place here in panops.py
:
(...)
for clustering_tuple in [('gene_cluster presence absence', self.view_data),
('gene_cluster frequencies', self.view_data_presence_absence)]:
v, d = clustering_tuple
newick = clustering.get_newick_tree_data_for_dict(d,
transpose=True,
distance = self.distance,
linkage=self.linkage)
(...)
If you follow the function get_newick_tree_data_for_dict
in panops.py
you will find that the default clustering uses 'euclidean' distance with 'ward' linkage on l1-normalized vectors by default, where linkage is calculated like this:
linkage = hierarchy.linkage(vectors, metric=distance, method=linkage)
Your first dendrogram is almost identical to the one in anvi'o (if you rotate things around a bit), and differences are likely due to the normalization step. I hope this helps.
Best wishes,
Meren.
from anvio.
Thanks @meren !! I missed the normalization step! Now it looks identical without the need to do rotate anything 😉
from anvio.
Perfect :) Thank you for reporting back!
from anvio.
Related Issues (20)
- Error running `anvi-reaction-network` HOT 22
- anvio 7.1: anvi-setup-ncbi-cogs --reset HOT 1
- [BUG] anvi-help not working anymore HOT 1
- [BUG] Most annotations from anvi-run-cazymes have undefined ('-') accession numbers HOT 5
- [BUG] anvi-script-augustus-output-to-external-gene-calls 0 gens parsed
- [DISCUSSION] Methylation processing in anvio HOT 2
- [BUG] Search with operators in the interactive interface HOT 1
- [BUG] anvio-cluster-contigs fails with CONCOCT HOT 13
- [BUG] anvi-get metabolic-model-file ImportError: Numba needs NumPy 1.25 or less HOT 2
- [FEATURE REQUEST] Outputs for function- and pathway-level variability analysis HOT 2
- [BUG] Insert a short but descriptive title (leave the '[BUG]' part) HOT 1
- DAStool finishes without errors but output not recgnized by Anvi'o HOT 3
- [BUG] Issue with --pre-computed-inversions HOT 2
- Performing `anvi-self-test` but the interactive operator did not load in Chrome HOT 1
- [FEATURE REQUEST] A command to make new HMM sources from a list of COG IDs. HOT 8
- Problems about Structure Display with anvi-display-structure HOT 7
- [BUG] anvi-interactive crashes when using a collection and a external tree HOT 8
- [BUG] anvi-get-sequences-for-hmm-hits in combination with --gene-names silently removes genomes HOT 7
- anvi-setup-scg-taxonomy and anvi-setup-ncbi-cogs error HOT 2
- [BUG] Genes with multiple annotations overinflate stepwise pathway copy number values HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from anvio.