To maximize your tutorial time, please use the following steps to setup your environment on the cloud (preferrably) or on your local machine before arriving on the day.
For the Lab 1-3 we would require access to Anaconda environment with Python 3.7. As well as access to the editor, such as PyCharm (Download Community Edition from JetBrains website)
NOTE: That environment comes by default via an Azure Data Science Virtual Machine (a cloud virtual machine with a prepared data science environment) as described below in the Option 1, or you can install it on your local laptop as described in the Option 2. Please prepare your own environment if you are using any other cloud provider.
NOTE: For Lab 4 (Deep Learning) it's advised to setup a cloud environemnt (Azure, AWS, GCP) with Keras framework installed. For the purposes of the tutorial, we do not require access to GPU.
- Sign-in or create a new Microsoft Account if you don't have one by using Microsoft Azure sign-up page.
NOTE: Don't use your corporate email address if you cannot use your corporate Azure subscription for the tutorial. Also, please create a different account if you have used an Azure Trial offer against an existing Microsoft account previously.
-
Go to the Azure Portal at https://portal.azure.com
-
Click + (or + Create a resource ) and search for "Data Science virtual machine"
-
Select one of the options and click "Create" at the bottom of the page. [Note] The steps below are for "Data Science Virtual Machine for Linux (CentOS)".
-
[Optional] If you don't have existing Azure account, create one by clicking "Start free". That will open a separate page to create a trial azure account. Please follow the steps required and then return and refresh the Azure portal (Step 1 above)
-
On the Basics blade provide the following inputs:
a. Resource group: click "Create new" and type "rgCodemash"
b. Name: vmCodemash
c. Region: select either "South Central US" or "East US".
d. Size: click "change size" and on "Select a VM size" blade click "Clear all filters" and then search for "D12_v2".
e. Authentication type: choose "Password"
f. User name: demouser
g. Password / Confirm password: provide yours (but make sure you remember it, as you'll need it)
NOTE: Depending on your type of Azure subscription, you may want to choose a large size VM, or a GPU-enabled machine, though this is not strictly required for the workshop. There are limitations about the number of cores that could be requested by default on a Trial subscription.
-
Click Review + create button for your VM to be provisioned.
-
Once validation scripts passed successfully, click Create button
-
The VM should take a few minutes to provision
-
When the VM is ready, you should see a notification. Select Go to resource to view the deployed VM in the Portal.
-
On the blade for the VM copy **Public IP address" for your machine.
-
Go to the browser and type https://(your VM's ip address):8000 to sign-in to Jupyter Hub
-
You might see a pop-up with a warning for an invalid certificate Click Continue anyway in that case:
-
In the sign in window provide your username and password as provided during the VM Provisioning steps
-
Confirm that you can see the root directory in the Jupyter hub
If you are performing this setup a day before the tutorial, you can optionally Stop the VM to save on costs overnight, and resume it when you are ready to start on the day. Follow these steps to Stop the VM:
NOTE: When you are ready to re-start the VM, simply follow the previous steps and instead of selecting Stop, select Start. Your VM will take a couple of minutes to start, after which you can connect to Jupyter Hub as above.
NOTE: Depending on your laptop specs, you may have some issues following all steps in Part 4 Deep Learning module. However, for the purposes of the tutorial, we do not require access to GPU.
-
Install Anaconda environment using the official Downloads page.
-
The detailed guides on how to install it based on your OS are available here.
-
Once installation is successful, try running your first Python as described here
-
For tutorials we expect you to use either an editor, such as PyCharm or Jupyter Notebook environment (as described in step 3)
-
Create a new virtual environment "codemash":
- Open the anaconda prompt
- conda create --name codemash python=3.7
- Activate your conda virtual environment:
- (WINDOWS): activate codemash
- (LINUX, macOS): source activate codemash
- Install scikit-learn / pandas / matplotlib packages in the codemash conda environment:
- conda install scikit-learn
- conda install pandas
- conda install matplotlib
- Run PyCharm editor from conda environmnt:
- cd to PyCharm directory
- run pycharm64.exe file to start PyCharm editor with all the correct environment variables
- [For Lab 4, Deep learning] If you prefer to run Deep Learning lab on your local machine, please find and follow a tutorial that describes setting up Keras on a local environment (one such example is here)