Comments (3)
However, your abstract "Heterogeneous network link prediction prioritizes disease-associated genes” shows 698 associations extracted from GWAS catalog.
For clarification, there are two main studies we've conducted regarding edge prediction on hetnets. From https://het.io/about/#cite:
Hetionet v1.0 was created as part of Project Rephetio, i.e. Systematic integration of biomedical knowledge prioritizes drugs for repurposing. So you can mostly ignore "Heterogeneous network link prediction prioritizes disease-associated genes" and just focus on Hetionet v1.0.
I saw that Hetionet combines data from 29 databases, which does not include GWAS database.
GWAS data does make it into Hetionet. Copying from the methods:
Disease--associates--Gene edges were extracted from the GWAS Catalog [130], DISEASES [131,132], DisGeNET [133,134], and DOAF [135,136]. The GWAS Catalog compiles disease--SNP associations from published GWAS [137]. We aggregated overlapping loci associated with each disease and identified the mode reported gene for each high confidence locus [138,139]. DISEASES integrates evidence of association from text mining, curated catalogs, and experimental data [140]. Associations from DISEASES with integrated scores ≥ 2 were included after removing the contribution of DistiLD. DisGeNET integrates evidence from over 10 sources and reports a single score for each association [141,142]. Associations with scores ≥ 0.06 were included. DOAF mines Entrez Gene GeneRIFs (textual annotations of gene function) for disease mentions [143]. Associations with 3 or more supporting GeneRIFs were included.
The most important supplemental discussion on how we processed the GWAS catalog is at https://doi.org/10.15363/thinklab.d80.
I am specially looking for detecting relation ships between clinical symptoms to disease to genes
Cool. One place to start would be putting a symptom into https://het.io/search/ and then subsetting to genes for the target node:
This is a more manual exploratory approach. There are also more automated high-throughput approaches you could do with some scripting / programming.
from hetionet.
Thank you very much Daniel. I would like to learn more about automated high-throughput approaches for querying the database. Can you guide me to it?
Also, in the search, I added breast cancer in source node, but the target nodes do not show up common predisposition genes like brca1 and brca2 etc. Why is that?
Best,
Nilesh
from hetionet.
I added breast cancer in source node, but the target nodes do not show up common predisposition genes like brca1 and brca2 etc. Why is that?
The approach does find many types of paths that occur more than expected by chance between breast cancer and BRCA1 and BRCA2. So I think the question is more why don't BRCA1 and BRCA2 show up as the top result for breast cancer:
One reason is that the metric we're ranking by is simplistic. The search result ranking is by number of significant types of paths. It does not take into account how significant those types of paths are. That being said, the top result of MYC has 19 significant types of paths (metapaths), while BRCA1 has 12... so it's not that far from the top.
Will make another comment to address the rest of your questions.
from hetionet.
Related Issues (20)
- Local files HOT 2
- Multiple Match Queries Not Working HOT 2
- Question About Hetionet's Dictionary HOT 3
- How to add new disease and anatomy nodes HOT 2
- Providing a dump version of Hetionet HOT 11
- http://neo4j.het.io/browser/ time out HOT 4
- Neo4J instance down (?) HOT 7
- Updated TSV version HOT 6
- graph.db database offline in neo4j HOT 3
- neo4j website down HOT 6
- Hetionet Browser is down HOT 4
- Mapping to original databases HOT 2
- Cannot map non-existing file HOT 5
- Do any relations imply another relation? HOT 1
- Connectivity Search Automated Query Question HOT 8
- Docker compatibility question HOT 4
- Question on metrics HOT 1
- What does it mean if something up regulates a disease in this context? HOT 3
- Speeding up data import to Neo4j v5 and CSV format data HOT 2
- Inquiry about metapaths from 2017 Paper "Systematic Integration of Biomedical Knowledge Prioritizes Drugs for Repurposing" HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hetionet.