This page is a companion to our poster at ASHG2020: Density-based clustering from large multi-ethnic biobanks improves risk prediction in large, diverse cohorts. https://www.abstractsonline.com/pp8/#!/9070/presentation/2384.
For an interactive demonstration using 1000 Genomes Project data, see: https://diazale.github.io/popgenclust/1000gp.html
We are currently working on our manuscript for this project -- come back soon for a preprint! For our previous work on dimension reduction of genotype data, see:
- UMAP reveals cryptic population structure and phenotype heterogeneity in large genomic cohorts (2019). Alex Diaz-Papkovich, Luke Anderson-Trocmé, Chief Ben-Eghan, Simon Gravel. PLoS Genetics. https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1008432.
- A review of UMAP in population genetics (2020). Alex Diaz-Papkovich, Luke Anderson-Trocmé, Simon Gravel. Journal of Human Genetics. https://www.nature.com/articles/s10038-020-00851-4.