GithubHelp home page GithubHelp logo

piano's Introduction

PIANO

Graph masked self-distillation learning for prediction of mutation impact on protein-protein interactions.

This source code is tested with Python3.8 on Ubuntu20.04.

Step 1: Clone the GitHub repository

git clone https://github.com/MLMIP/PIANO.git
cd PIANO

Step 2: Build required dependencies

Before installing the relevant dependency packages, please make sure Anaconda3 has been installed. If not, please click here to install it.

source install.sh

Executing the above command will automatically install the Anaconda virtual environment. Upon completion, a virtual environment named "piano" will be created. In addition, the model weights can be obtained through Zenodo-PIANO. After downloading, place it in piano/Data/model_params.

Step 3: Download required software

The download links for various software packages are provided below. After downloading, you can install it directly according to the official tutorial.

PSI-BLAST

PSI-BLAST Database:Uniref90

HHblits

HHblits Database:Uniref30

NACCESS

MSMS

Please install these software in the piano/Software. In addition, mkdssp needs to be granted certain permissions, which can be operated by executing the following command:

chmod a+x mkdssp

Step 4: Running PIANO

Activate the installed piano virtual environment and ensure that the current working directory is PIANO.

conda activate piano

If you want to obtain the prediction results of a single sample, directly execute the following command on the command line to obtain the prediction results, and the results are saved in the PredictedResults.txt:

python run.py 0 [pdb name] [mut_chain] [wildtype] [mutant] [resid] [partnerA_partnerB]

where the digit 0 signifies that the program performs predictions for individual samples only. [pdb name] is the name of the complex to be predicted, such as 1a4y. [mut_chain] is the name of the mutated chain. [wildtype], [mutant], and [resid] are wild-type amino acid, mutant amino acid, and the mutation position, respectively. [partnerA_partnerB] describes the two interaction partners in the protein complex, such as A_B.

A specific example is:

python run.py 0 1a4y A E A 401 A_B

If you want to perform batch prediction of multiple samples, please first organize the relevant mutation sample data in pred_data.csv. The input format is consistent with the data format within an individual sample. After the completion of filling in pred_data.csv, execute the following command in the command line to obtain the prediction results. The results are saved in the PredictedResults.txt:

python run.py 1

where the digit 1 indicates that the program conducts predictions for multiple samples.

piano's People

Contributors

mlmip avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.