GithubHelp home page GithubHelp logo

clinical-decision-support-system's Introduction

Clinical-decision-support-system

Clinical-decision-support-system


Author Chris Zhang
E-mail [email protected]

content

dataanalysis

introducation:

  1. The goal of the demo is to build a clinical diagnosis prediction model which can be used by Hospital to identify patients with diseases during early diagnosis stage.The clinical diagnosis prediction model will be built on historical patients’ records, and later can be used to predict illness base on new patients’ conditions.
  2. the patients records contains Chief Complaint, History of Current Illness and History of Past illness, so the first step is to transfor the words to vector and input with model
  3. the goal of building model is to create 14 binary classifiers, 8 for body part and 6 for symptom, and the whole 14 classifiers results can be referenced by doctor

distribution:

  1. Attribute: 1
  2. age and gender distribution: 2
  3. label : 3 4
  4. body part: 5 6 7 5.symptom: 8 9 10
  5. the sentence length distribution: 11

dataprocessing

Obviously, the distribution of positive and negative samples is extremely unbalance,so,the first step we should solve the unbalanced data, there are two options:undersample and oversample, oversample based on the final inputs, but our sample will under NLP(make sample into vectors) before input in the neural networks, so , it is difficult for us to do oversample, and I choose undersample, but if we do undersample based on 922 samples, the negative samples only 70 in total, how about we do data enhancement before undersample. Let’s look the sample in detail:

data enhancement:

1.Chief complaint: X;Current medical history:Y;Past history:Z
2.Whether using Word embedding or character embedding,Before we do vectorization, we must convert each word into its own index number, so changing the order of the three parts of the statement will make the result of the vectorization completely different. I call these step as data enhancement
3.+1,Chief complaint: X;Past history:Z;Current medical history:Y
4.+1,Current medical history:Y;Chief complaint: X;Past history:Z
5.+1,Current medical history:Y;Past history:Z;Chief complaint: X
6.+1,Past history:Z;Chief complaint: X;Current medical history:Y
7.+1,Past history:Z;Current medical history:Y;Chief complaint: X

and now ,after data enhancement, we have 922*6 samples in total, then, we can do undersample.

tensorflow_and_keras

Embedding methods:

1.Rand: Randomly initialize the word vector of each word  and adjust through the later  training.For example: character embedding (TensorFlow),embedding layer (Keras)                        
3.Static:Use pre-trained word vectors, such as word vectors trained by word2vec, will not adjust the word vector during training. For example: Word embedding(gensim)
5.Genism:Word---index---vector(trained)
6.Sklearn:Word---index---tf-idf(fixed)
7.Keras :  Word ---- index----vector(random)

design CNN + LSTM:

12 results: 13 14

design textCNN:

The CNN filter’s width should be the same as vector’s dimension,however the height can be as 3,4,5 , I think the most important point is because each row of vector we input represents a word, and in the process of nature language process, the word is the minimum granularity of the text. Compared computer version, pixel is is the minimum granularity of the picture results: 15 16 17

BERT

used in IBM model 1

web_demo

input the basic information(Chief Complaint, History of Current Illness and History of Past illness)

18

click the prediction button

19

get the prediction results for doctor

20

clinical-decision-support-system's People

Contributors

chriszhangmw avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

clinical-decision-support-system's Issues

FileNotFoundError: [Errno 2] No such file or directory: 'D:/Eclipse_workplace/web_tensorflow/cut_sentence_corpus.txt'

I have not this files.How to get this files?

(Clinical) D:\lab\LABprogram\Clinical-decision-support-system-master\web_demo\working>python test.py
2021-01-26 15:25:47.402749: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library cudart64_100.dll
Traceback (most recent call last):
File "test.py", line 7, in
from model_run import first_action, record_sample
File "D:\lab\LABprogram\Clinical-decision-support-system-master\web_demo\working\model_run.py", line 7, in
from sentence_cut import cut_non
File "D:\lab\LABprogram\Clinical-decision-support-system-master\web_demo\working\sentence_cut.py", line 11, in
with open(path1,'r',encoding='utf-8') as v:
FileNotFoundError: [Errno 2] No such file or directory: 'D:/Eclipse_workplace/web_tensorflow/cut_sentence_corpus.txt'

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.