GithubHelp home page GithubHelp logo

vaidasmo / me114 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from pjsio/me114

1.0 2.0 0.0 23.82 MB

ME114 Introduction to Data Science and Big Data Analytics 2015

HTML 99.90% TeX 0.01% R 0.09%

me114's Introduction

ME114 Introduction to Data Science and Big Data Analytics

LSE Methods Summer Programme 2015

Kenneth Benoit, Department of Methodology, LSE
Slava Mikhaylov, University College London
Paul Nulty (Labs), Department of Methodology, LSE

Version: August 25, 2015

This repository contains the class materials for the LSE Methods Summer Programme course ME114 Introduction to Data Science and Big Data Analytics taught in August 2015 by Kenneth Benoit and Slava Mikhaylov.

Instructions for use

You have three options for downloading the course material found on this page:

  1. You can download the materials by clicking on each link.

  2. You can "clone" repository, using the buttons found to the right side of your browser window as you view this repository. This is the button labelled "Clone in Desktop". If you do not have a git client installed on your system, you will need to get one here and also to make sure that git is installed. This is preferred, since you can refresh your clone as new content gets pushed to the course repository. (And new material will get actively pushed to the course repository at least once per day as this course takes place.)

  3. Statically, you can choose the button on the right marked "Download zip" which will download the entire repository as a zip file.

You can also subscribe to the repository if you have a GitHub account, which will send you updates each time new changes are pushed to the repository.

Full course details

Office Hours

  • Ken: Tues 26 Aug, Wed 27 Aug, and Thurs 28 Aug, 13:00 - 14:00, Columbia House B8.11
  • Slava: Mon 25 Aug, Wed 25 Aug, 13:00 - 14:00, Columbia House B8.05

Instructions for Submitting Homeworks

Each homework will be a single file in the RMarkdown format. The files linked below are named very carefully, to make it easy for us to identify your completed lab assignments.

  1. Obtaining the starter files.

    Each day below will link the name of a starter file for you to download and work with. These are in the RMarkdown format. You should embed your answers, with code, into your version of the instruction files.

  2. Renaming the starter files.

    For example, the first assignment file is named ME114_assignment1_LASTNAME_FIRSTNAME.Rmd, which you will need to rename by replacing the uppercase terms with your own last and first names, e.g. ME114_assignment1_Nulty_Paul.Rmd.

  3. From RStudio, you can create an HTML outfile file with your evaluated code, figure, and text answers by clicking the "Knit HTML" button in the top of the editor pane in RStudio. The resulting HTML file will be saved in your working directory.

  4. You will need to upload the resulting HTML file -- renamed! -- to the course Moodle page, to the appropriate assignment folder.

We will walk you through this process in the Day 1 lab, so don't worry if it seems complicated the first time. This sort of careful workflow process and file management is part of learning practical data science too!

Resources for each day

Monday, August 17: Overview and introduction to data science [KB, SM]

Tuesday, August 18: Research design issues in data science [SM, KB]

Wednesday, August 19: Linear Regression [SM]

Thursday, August 20: Generalized Linear Regression [SM]

Friday, August 21: Resampling Methods [SM]

Monday, August 24: Association rules and clustering [KB]

Tuesday, August 25: Machine Learning [KB]

Wednesday, August 26: Text analysis [KB]

Thursday, August 27: Unsupervised learning and dimensional reduction [KB]

Friday, August 28: Mining the Social Web [PN]

Exam: Friday, August 28, 14:00 - 16:00, Room: TBA

  • Instructions: Complete and submit the exam just as you would any lab assignment: by renaming the file, editing the R Markdown, and submitting through Moodle.
  • Formatting: Put your own textual answers in boldface (using **boldface type** in RMarkdown), so that we can easily identify them.
  • Deadline: Monday 30 September 17:00 London time (GMT+1)
  • The exam in R Markdown and html
  • The exam solution in R Markdown and html

me114's People

Contributors

kbenoit avatar pnulty avatar vaidasmo avatar

Stargazers

 avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.