Light

salesforce / genhance Goto Github PK

View Code? Open in Web Editor NEW

31.0 6.0 11.0 4.71 MB

License: BSD 3-Clause "New" or "Revised" License

Jupyter Notebook 73.91% Python 26.05% Shell 0.04%

genhance's Introduction

GENhance

Deep Extrapolation for Attribute-Enhanced Generation

Objective: Generate sequences, in natural language and proteins, that go beyond the label training distribution

Check out our paper here!

Data

ACE2 250K sequences with FoldX ddG values: gs://sfr-amadani-conference-data/genhance/ACE2_subdomain_ddG_data.tar.gz

SST-5 data splits: gs://sfr-amadani-conference-data/genhance/SST5_data.tar.gz

Models

GENhance SST-5 (leave all positives out) - gs://sfr-amadani-conference-data/genhance/GENhance_model_SST5_34.tar.gz

GENhance SST-5 (keep 200 positives) - gs://sfr-amadani-conference-data/genhance/GENhance_model_SST5_4.tar.gz

GENhance ACE2 subdomain - gs://sfr-amadani-conference-data/genhance/GENhance_model_ACE2.tar.gz

Code overview

ACE/: code and data for ACE2 experiments
SST5/: code and data for SST5 experiments

Requirements

If running on A100s, needs PyTorch 1.6 or 1.7 (with CUDA 11), requires >= 2x A100 GPUs to run
tape proteins (tries to downgrade to PyTorch 1.4, so with pip install --no-dependencies tape_proteins)
huggingface transformers

genhance's People

Contributors

Stargazers

Watchers

Forkers

liaopeiyuan animesh stjordanis ipark2021 codeaudit mawright tpan1039-ui isabella232 ggchen1997 dongcf

genhance's Issues

Code to reproduce CbAS results on ACE2

Do you have code to reproduce the CbAS results in your paper? Thanks!

not found transformers_custom?

Hi，when I run this repo and transformers_custom not found? thanks

Dependency Problem

It raise an import error in file /genhance/ACE2/train_baseline_generator.py line 9
from progeny_tokenizer import TAPETokenizer

There is no project progeny_tokenizer package in the project, how can we fix it?

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs