GithubHelp home page GithubHelp logo

mithilesh1609 / unwrapping-the-covid-19-genome Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 626 KB

This is a python implementation of understanding the genome of covid-19 and compare it with previously well known disease's genome

License: MIT License

Jupyter Notebook 100.00%
covid-19 genome-analysis python3 numpy pandas regex genomics-visualization machine-learning deep-learning clustal

unwrapping-the-covid-19-genome's Introduction

Unwrapping-the-COVID-19-genome

  • Here we will try to answering some of the questions of bio-informatics in regards to the recent panademic outbreak of Corona virus(COVID-19), from a non-medical background person's perspective. Every analysis presented here is based on the stuff I learnt and understood (or felt that I actually understood) about the genomes from the internet.

How to run?

  • I have used kaggle kernal for running the code, you can use that or jupyter notebook or any editor like pycharm or atom or visual studio code.
  • For kaggle just create new notbook and copy the given Unwrapping the COVID-19 genome.ipynb file and Add data there from links given in dataset.md file, then select GPU instaed of null in preference section for fast execuastion.

Acknowledgement

I am very thankful to Allen Institute For AI for providing wonderful dataset.

Conclusions

  • Based on the oligonucleotide compositions, all the genomes from coronaviridae family show similar composition (trinucleotide and tetranucleotide composition)
  • Based on GC content, the composition of Hedgehog infected Corona virus is closely similar to COVID-19 genome's GC content.
  • Judging by the number of protein strands, the count in COVID19's genome is very close to those found in Bat infected with SARS like corona virus.
  • If you filter down some of the categories in the plotly-plot of distribution of amino acids, you'll see the percentage of amino acids in COVID-19's genome closely match to those of MERS, Civet infected with SARS like CoV, Camel with alpha CoV and Malaria except for some and low high contents among L (Leucine), P (Proline) and T (Threonine) amino acids.
  • Checking by the generated Phylogenetic tree, the genomes of COVID-19 and malaria virus show the maximum similarity. Perhaps this also explains why there's a lot of buzz on the news and internet about trying Hydroxycholorquine and anti-retroviral drugs which happen to be the medicines for malaria and HIV viruses, respectively.
  • Of all the genomes of COVID-19 infected patients avaialable on Kaggle, some patients show similar traits when clustered based on their genome sequence similarity.I really don't know how helpful this would be but if something works or had worked for a patient, then same should also work for patients in the same cluster.

Image of Hylogenetic Tree

unwrapping-the-covid-19-genome's People

Contributors

imgbotapp avatar mithilesh1609 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.