GithubHelp home page GithubHelp logo

gkild / scrna.seq.datasets Goto Github PK

View Code? Open in Web Editor NEW

This project forked from hemberg-lab/scrna.seq.datasets

0.0 0.0 0.0 772 KB

Collection of public scRNA-Seq datasets used by our group

Home Page: https://hemberg-lab.github.io/scRNA.seq.datasets/

License: GNU General Public License v3.0

R 67.41% Shell 28.30% Perl 4.29%

scrna.seq.datasets's Introduction

Public scRNA-Seq Datasets

This repository is used to build scater objects and reports (in continuous integration manner) for various publicly available scRNA-Seq datasets used by our group. This pipeline is implemented using docker containers and cloud computing. The resulting website is available here. Below are some notes on the pipeline setup.

Website

The website is generated using MkDocs generator. Links to S3 storage and data annotations are added manually. If you are creating a pull request and adding new data please add its annotations to one of the files in the website folder.

S3 storage

To list files on the S3 storage please use this link.

Instance setup

AWS

  1. Launch Amazon Linux EC2 instance.
  2. Using security groups add access to the instance on port 8080.
  3. Connect to instance and install Jenkins.
  4. Add permission for Jenkins to run Docker:
sudo usermod -aG docker jenkins

Hard reboot your instance after that. Now Jenkins can run docker images.

  1. Install s3cmd utility to be able to upload data to the S3 storage:
sudo apt-get install s3cmd
  1. In Jenkins Export S3 key ID, secret key and region as environmental variables. Use secret text option provided by Jenkins. Some details available here.

  2. File listing can be setup on AWS S3 bucket using this plugin.

OpenStack Cloud (Sanger)

  1. Launch Ubuntu Trusty instance (m1.medium flavour)
  2. Add the instance to the default, cloudforms_icmp_in, cloudforms_ssh_in cloudforms_web_in security groups.
  3. Create additional security group: TCP with port 8080 (this is needed for Jenkins) and add your instance to this group.
  4. Associate a floating IP (FLOATING_IP) number with your instance.
  5. Login to instance:
ssh -i ~/.ssh/your_key.pem ubuntu@FLOATING_IP

Then become root using this command:

sudo su -
  1. In the instance install openjdk-7-jdk:
sudo apt-get update
sudo apt-get install openjdk-7-jdk
  1. In the instance install Jenkins.

To setup Jenkins after installation go to http://FLOATING_IP:8080 (this is only accessible via Sanger wired network).

  1. In the instance install docker.

  2. Add permission for Jenkins to run Docker:

sudo usermod -aG docker jenkins

Hard reboot your instance after that. Now Jenkins can run docker images.

  1. Install s3cmd utility to be able to upload data to the S3 storage:
sudo apt-get install s3cmd
  1. In Jenkins Export S3 key ID, secret key and region as environmental variables. Use secret text option provided by Jenkins. Some details available here.

Jenkins build

# build and deploy
sh deploy.sh $WORKSPACE

AWS Calculator

To calculate how much you can spend on AWS one can us the AWS Calculator.

scrna.seq.datasets's People

Contributors

wikiselev avatar tallulandrews avatar gkild avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.