GithubHelp home page GithubHelp logo

ml-tutorial-codemash2019's Introduction

ML-Tutorial-Codemash2019

Pre-requisites

To maximize your tutorial time, please use the following steps to setup your environment on the cloud (preferrably) or on your local machine before arriving on the day.

Setting up your working environment

For the Lab 1-3 we would require access to Anaconda environment with Python 3.7. As well as access to the editor, such as PyCharm (Download Community Edition from JetBrains website)

NOTE: That environment comes by default via an Azure Data Science Virtual Machine (a cloud virtual machine with a prepared data science environment) as described below in the Option 1, or you can install it on your local laptop as described in the Option 2. Please prepare your own environment if you are using any other cloud provider.

NOTE: For Lab 4 (Deep Learning) it's advised to setup a cloud environemnt (Azure, AWS, GCP) with Keras framework installed. For the purposes of the tutorial, we do not require access to GPU.

Option 1. Microsoft Azure subscription

  1. Sign-in or create a new Microsoft Account if you don't have one by using Microsoft Azure sign-up page.

NOTE: Don't use your corporate email address if you cannot use your corporate Azure subscription for the tutorial. Also, please create a different account if you have used an Azure Trial offer against an existing Microsoft account previously.

Task 1. Create an Azure Data Science Virtual Machine
  1. Go to the Azure Portal at https://portal.azure.com

  2. Click + (or + Create a resource ) and search for "Data Science virtual machine"

    Screenshot of the Create a new virtual machine.

  3. Select one of the options and click "Create" at the bottom of the page. [Note] The steps below are for "Data Science Virtual Machine for Linux (CentOS)".

  4. [Optional] If you don't have existing Azure account, create one by clicking "Start free". That will open a separate page to create a trial azure account. Please follow the steps required and then return and refresh the Azure portal (Step 1 above)

    Azure Trial.

  5. On the Basics blade provide the following inputs:

    a. Resource group: click "Create new" and type "rgCodemash"

    b. Name: vmCodemash

    c. Region: select either "South Central US" or "East US".

    d. Size: click "change size" and on "Select a VM size" blade click "Clear all filters" and then search for "D12_v2".

    e. Authentication type: choose "Password"

    f. User name: demouser

    g. Password / Confirm password: provide yours (but make sure you remember it, as you'll need it)

NOTE: Depending on your type of Azure subscription, you may want to choose a large size VM, or a GPU-enabled machine, though this is not strictly required for the workshop. There are limitations about the number of cores that could be requested by default on a Trial subscription.

  1. Click Review + create button for your VM to be provisioned.

  2. Once validation scripts passed successfully, click Create button

    Screenshot of the Create blade.

  3. The VM should take a few minutes to provision

Task 2. Verify access to Jupyter Hub on the Data Science VM
  1. When the VM is ready, you should see a notification. Select Go to resource to view the deployed VM in the Portal.

    The notification alert will show the message about a successful deployment.

  2. On the blade for the VM copy **Public IP address" for your machine.

    Screenshot of the Public IP address.

  3. Go to the browser and type https://(your VM's ip address):8000 to sign-in to Jupyter Hub

  4. You might see a pop-up with a warning for an invalid certificate Click Continue anyway in that case: Screenshot showing certificate warning.

  5. In the sign in window provide your username and password as provided during the VM Provisioning steps

    Screenshot of Juputer Hub - Sign In.

  6. Confirm that you can see the root directory in the Jupyter hub

Screenshot of Juputer Hub - Success.

Task 3: Stop the VM

If you are performing this setup a day before the tutorial, you can optionally Stop the VM to save on costs overnight, and resume it when you are ready to start on the day. Follow these steps to Stop the VM:

  1. Return to the Azure Portal

  2. Navigate to the blade of your vmCodemash

  3. Select the Stop button

    Screenshot of the Stop button.

NOTE: When you are ready to re-start the VM, simply follow the previous steps and instead of selecting Stop, select Start. Your VM will take a couple of minutes to start, after which you can connect to Jupyter Hub as above.

Option 2. Use your laptop

NOTE: Depending on your laptop specs, you may have some issues following all steps in Part 4 Deep Learning module. However, for the purposes of the tutorial, we do not require access to GPU.

  1. Install Anaconda environment using the official Downloads page.

  2. The detailed guides on how to install it based on your OS are available here.

  3. Once installation is successful, try running your first Python as described here

  4. For tutorials we expect you to use either an editor, such as PyCharm or Jupyter Notebook environment (as described in step 3)

  5. Create a new virtual environment "codemash":

  • Open the anaconda prompt
  • conda create --name codemash python=3.7
  1. Activate your conda virtual environment:
  • (WINDOWS): activate codemash
  • (LINUX, macOS): source activate codemash
  1. Install scikit-learn / pandas / matplotlib packages in the codemash conda environment:
  • conda install scikit-learn
  • conda install pandas
  • conda install matplotlib
  1. Run PyCharm editor from conda environmnt:
  • cd to PyCharm directory
  • run pycharm64.exe file to start PyCharm editor with all the correct environment variables
  1. [For Lab 4, Deep learning] If you prefer to run Deep Learning lab on your local machine, please find and follow a tutorial that describes setting up Keras on a local environment (one such example is here)

ml-tutorial-codemash2019's People

Contributors

galiya avatar garyshort avatar

Watchers

Jean Lange avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.