GithubHelp home page GithubHelp logo

cgga's Introduction

Cancer Subtype Identification by Consensus Guided Graph AutoEncoders(CGGA)

😃Update

We have uploaded the processed cancer datasets under the cancer_datasets/ directory. Each cancer dataset consists of three omics and could be loaded into the Matlab directly.

Method Description

CGGA is a computational framework that can effectively and reliably uncover cancer subtypes. It mainly consists of two steps. First, for each omic, a new feature matrix is learned by using graph autoencoders, which can incorporate both structure information and node features during the learning process. Second, a set of omic-specific similarity matrices as well as a consensus matrix is learned based on the features obtained in the first step. The learned omic-specific similarity matrices are then fed back to the graph autoencoders to guide the feature learning. By iterating the two steps above, our method obtains a final consensus similarity matrix for cancer subtyping.

Requirements

>= MATLAB 2014b.

Usage

To run our algorithm, please load the script 'CGGA.m' into your MATLAB programming environment and click 'run'. Users can also run the script in standard command-line mode, where you should input the following commands for each function, respectively:

matlab -nodisplay -nodesktop -nosplash -r "CGGA;exit;"

All the cancer datasets used in the code can be directly downloaded at http://acgt.cs.tau.ac.il/multi_omic_benchmark/download.html.

Parameters

There are three parameters in our algorithm that users can tune according to their own needs, i.e. lambda, the number of neighbors k and the number of layers in CGGA. The default values for lambda and k are fixed to 1e-5 and 15, respectively. The number of layers in CGGA is set to 2 and user can specify a larger value to construct a deeper graph autoencoder.

Input and Output Directories

To change the input file directory, please refer to the 'dataDir' variable in the processTCGAdata.m. For output file directory, please refer to the 'outDir' variable in the same script.

Contact

For any questions regarding our work, please feel free to contact us: [email protected].

Cite Our Paper

Cheng Liang, Mingchao Shang, Jiawei Luo. Cancer subtype identification by consensus guided graph autoencoders. Bioinformatics, 2021, doi: 10.1093/bioinformatics/btab535.

cgga's People

Contributors

alcs417 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.