GithubHelp home page GithubHelp logo

xjtushilei / pdd_data_set Goto Github PK

View Code? Open in Web Editor NEW
69.0 6.0 22.0 522 KB

A Patient Disease Drug Graph. 一个权威的医疗 RDF 数据集, 关于医疗知识图谱。

Home Page: http://pdd.wangmengsd.com/

Java 28.56% HTML 71.44%
linked-data graph-database sparql-query sparql-endpoints icu patients drugs emr graph

pdd_data_set's Introduction

HomePage

http://pdd.wangmengsd.com/

Patient Disease Drug Graph

This is the display project of the drug database. The use of spring-boot made url redirect, the front end of the use of thymeleaf as MVC model to pass the data receiver.

Introduction to data set

The data set is rdf data set, on the physical therapy, which has a diagnosis, medication, etc., web online display only shows a part of the data, download the nt format data, you can get a complete data set.

Using these data sets, you can perform sparql queries, perform entity relationship identification, perform medical data mining, and so on.

So what is important is this data set ,not this project!

What is PDD Graph

What is PDD Graph (Patient-Disease-Drug Graph):

Electronic medical records contain multi-format electronic medical data that consist of an abundance of medical knowledge. Facing with patients symptoms, experienced caregivers make right medical decisions based on their professional knowledge that accurately grasps relationships between symptoms, diagnosis, and treatments. We aim to capture these relationships by constructing a large and high-quality heterogeneous graph linking patients, diseases, and drugs (PDD) in EMRs.

Specifically, we extract important medical entities from MIMIC-III (Medical Information Mart for Intensive Care III) and automatically link them with the existing biomedical knowledge graphs, including ICD-9 ontology and DrugBank. The PDD graph presented is accessible on the Web via the SPARQL endpoint, and provides a pathway for medical discovery and applications, such as effective treatment recommendations.

A subgraph of PDD is illustrated in the followng figure to betterunderstand the PDD graph.

Download

Home page to konw how to download

Server bandwidth is limited, it is recommended to Datahub download.

Update

  • V1.3

    We have attached DDI triples in the latest version 1.3. These DDI triples are extracted from DrugBank and will be applied to conveniently retrieve the possible adverse drug combinations taken by corresponding patients.

    A specific example please refer to the Tutorial SPARQL Query Example5

  • V1.2

    Fix the bugs in "diagnose_icd_information.nt".

    In the new version, we have eliminated an engineering bug that was made when label matching of ICD-9 codes. This bug results in the linking failures of 380 diseases in MIMIC-III.

    For diseases in the latest PDD version, the overall number of diseases is 6985, and 6,983 diseases are connected to ICD-9 ontology. The only two failed matching codes are '71970' and 'NULL', which are not included in ICD-9 ontology.

  • V1.1

    Add Patient BMI data.

This Project Technology

not data set

  • spring-boot
  • thymeleaf
  • jquery

Example

When you want to query out an entity, click directly to see what the entity is. E.g:

In Patient-Disease-Drug Data Set,Can be online inquiries, you can see the data as follows:

数据集在线查询展示

When we click on one of the data,E.g:http://kmap.xjtudlc.com/pdd_data/resource/145834 , you can see what the current system shows:

该系统展示

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

pdd_data_set's People

Contributors

xjtushilei avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

pdd_data_set's Issues

求数据集

那个HomePage打不开,可以再提供一下数据集么?非常感谢!

你好,是不是少了.ttl本体文件

你好,初学用 apache-jena-fuseki-3.13.1 配置 , 是不是少了本体文件 .ttl, 查询都是空的, 还是不需要ttl就可以 ???

网络上也找不到有人使用pdd做数据集分析的例子,只好这里问。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.