GithubHelp home page GithubHelp logo

feigeliudan01 / classroom Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ubc-stat/stat-545-guidebook

0.0 0.0 0.0 22.64 MB

Content for running instances of STAT 545/547M at UBC

Home Page: http://stat545.com/Classroom

HTML 99.99% R 0.01%

classroom's Introduction

STAT 545A/547M: Exploratory Data Analysis

This is the course syllabus for the Fall 2018 edition of STAT 545 and STAT 547. You should use this syllabus to:

  1. find information about the course, and
  2. navigate the course.

Quick Navigation

Both STAT 545 and STAT 547 make use of the following tools:

Tool + Link Description
http://stat545.com/Classroom (here) Think of this as the course "home" -- and this syllabus as your launch pad to other destinations. Contains lecture notes, assignments, and course information.
assignments, peer review, and participation information Info for Assignments, Peer Review, and Participation assessments can be found here.
Discussion-Internal GitHub repository For internal discussion. The world cannot see this.
Discussion GitHub repository. For public discussion. The world can see this.
STAT545-UBC-students GitHub Organization This will contain one GitHub repository per student, for you to submit homework to and give peer reviews.
UBC canvas This is for grade management. You'll be interacting with it by submitting a link to your homework.
http://stat545.com This holds course content, such as tutorials. Think of this as a textbook. We'll point you there when needed. This previously contained the information contained in Classroom, but that eventually became confusing. Some headers there are becoming deprecated.
My participation repository You can find completed exercises here

This framework of tools is under construction, as we move to a solution that's more scalable in terms of future iterations and multiple collaborators. We appreciate your patience and welcome your feedback!

About STAT 545/547M

STAT 545 is "Part I" of learning how to

  • explore, groom, visualize, and analyze data
  • make all of that reproducible, reusable, and shareable
  • using R

STAT 547M is "Part II".

Each course is worth 1.5 credits.

Selected Topics

STAT 545:

  • Introduction to R and the RStudio IDE: scripts, the workspace, RStudio Projects, daily workflow
  • Generate reports from R scripts and R Markdown
  • Coding style, file and project organization
  • Data frames or "tibbles" are the core data structure for data analysis: care for them with the tidyverse
  • Data visualization with ggplot2
  • How to write functions and work with R in a functional style
  • Version control with Git; collaboration via GitHub

STAT 547:

  • Be the boss of non-numeric data, esp. character and factor
  • Interactive pages, apps, and graphics with Shiny
  • Get data off the web and expose data, code, results on the web
  • Distribute data and code via an R package
  • Automate an analytical pipeline, e.g. via Make

Timing and Location

STAT 545 runs from September 4 until October 18, 2018.

STAT 547 runs from October 23 until November 29, 2018.

We'll meet as a class every Tuesday and Thursday, 09:30-11:00, in ESB 2012.

I'll aim to end class at 10:45.

Prerequisites

There are no official pre-requisites for STAT 545, but most students will have had at least one prior statistics course or comparable experience.

STAT 547 requires STAT 545.

Roots

Jenny Bryan deserves a huge amount of credit for founding and developing both STAT 545 and 547M over many years, along with her TA's. Some of their content is even being used in this very syllabus. Thank you!

Class Meetings

Here are the topics and links to the notes for each class meeting.

STAT 547:

| Meeting No. | Date | TA's | Topic | |---------------------|--------------|------|-------|-----------| | 01 | oct-23 tues | Chad, Sherrie | Writing your own R functions | | 02 | oct-25 thurs | Chad, Hossam | Regular expressions and character data | | 03 | oct-30 tues | Hossam, Sherrie | purrr, list-columns, nested data frames | | 04 | nov-01 thurs | Hossam, Rashedul | Part II | | 05 | nov-06 tues | Chad, Sherrie | Build your first R package | | 06 | nov-08 thurs | Chad, Rashedul | Part II | | 07 | nov-13 tues | Rashedul, Sherrie | shiny apps, and related tools | | 08 | nov-15 thurs | Rashedul, Chad | Part II | | 09 | nov-20 tues | Hossam, Sherrie | Automate tasks and pipelines | | 10 | nov-22 thurs | Hossam, Chad | Part II | | 11 | nov-27 tues | Chad, Sherrie | Get data from the web | | 12 | nov-29 thurs | Hossam, Rashedul | Part II |

STAT 545:

Meeting No. Date TA's Topic
01 sep-04 tues Chad, Sherrie Intro to course and software
02 sep-06 thurs Chad, Hossam Markdown and GitHub
03 sep-11 tues Rashedul, Sherrie Getting familiar with R & RMarkdown
04 sep-13 thurs Rashedul, Hossam The git workflow; collab with GitHub
05 sep-18 tues Chad, Sherrie working with data in R; dplyr and the tidyverse
06 sep-20 thurs Chad, Rashedul ggplot2, Part I
07 sep-25 tues Hossam, Sherrie ggplot2, Part II
08 sep-27 thurs Hossam, Rashedul Grouping of tibbles
09 oct-02 tues Rashedul, Sherrie Tidy data, reshaping
10 oct-04 thurs Rashedul, Chad, Hossam Guest Lecture: Rashedul, on table joins
11 oct-09 tues Hossam, Sherrie Advanced R programming; file I/O
12 oct-11 thurs Hossam, Rashedul The joy of Factors
13 oct-16 tues Rashedul, Sherrie Revisit ggplot; plotly; practicalities of daily figure-making
14 oct-18 thurs Rashedul, Chad The model-fitting paradigm in R; broom; deep thoughts about data analytic work

Expectations:

  • Show up to every class, and
  • Bring a laptop to every class

Evaluation

To gain marks in either STAT 545 or 547, you'll be completing five assignments, and submitting two peer reviews for each assignment. Participation counts too!

NOTE: You must have a GitHub account (free) to earn marks in this course, and then complete the course survey. Between STAT 545 and 547, you only need to complete the survey once.

Here's the breakdown of your course grade for either course:

Assessment Weight
5 Assignments 75% (15% per assignment)
10 Peer Reviews 15% (3% per assignment)
Participation 10%

There is no final exam for either course.

Auditing students must still complete and submit all assessments, to be graded on a pass/fail basis.

For more information on each assessment, navigate to the corresponding page in the navigation table above.

Teaching Team

Here is your dedicated teaching team!

Teaching Member Position Contact
Vincenzo Coia Instructor Email: [email protected] GitHub: @vincenzocoia Twitter: @VincenzoCoia LinkedIn: vincenzocoia
Chad Fibke Teaching Assistant GitHub: @ChadFibke
Hossameldin Mohammed Teaching Assistant GitHub: @hsmohammed; LinkedIn
Rashedul Islam Teaching Assistant GitHub: @rashedul, LinkedIn
Sherrie Lau Teaching Assistant GitHub: @sherrie9

Please see the "Conversation" section below to determine who to get in touch with for what, and how.

Office hours: Want to talk about the course outside of lecture? Let's talk during these dedicated times (generally, 11:00-12:00 every Monday, Tuesday, Wednesday). You're always welcome to schedule alternative times, too.

STAT 547:

Teaching Member Date Time Place
Rashedul Tue, Oct 23 11:00 - 12:00 ESB 3174
Vincenzo Wed, Oct 24 11:00 - 12:00 ESB 1043
Chad Mon, Oct 29 11:00 - 12:00 ESB 3174
Hossam Tue, Oct 30 11:00 - 12:00 ESB 3174
Vincenzo Wed, Oct 31 11:00 - 12:00 ESB 1043
Chad Mon, Nov 05 11:00 - 12:00 ESB 3174
Rashedul Tue, Nov 06 11:00 - 12:00 ESB 3174
Vincenzo Wed, Nov 07 11:00 - 12:00 ESB 1043
Sherrie Mon, Nov 12 11:00 - 12:00 ESB 3174
Rashedul Tue, Nov 13 11:00 - 12:00 ESB 3174
Vincenzo Wed, Nov 14 11:00 - 12:00 ESB 1043
Hossam Mon, Nov 19 11:00 - 12:00 ESB 3174
Chad Tue, Nov 20 11:00 - 12:00 ESB 3174
Vincenzo Wed, Nov 21 11:00 - 12:00 ESB 1043
Sherrie Mon, Nov 26 11:00 - 12:00 ESB 3174
Hossam Tue, Nov 27 11:00 - 12:00 ESB 3174
Vincenzo Wed, Nov 28 Thurs., Nov. 29 10:50 - 11:20; 12:30-13:00 ESB 1043; ESB 3174
Sherrie Mon, Dec 03 11:00 - 12:00 ESB 3174
Rashedul Tue, Dec 04 11:00 - 12:00 ESB 3174
Vincenzo Wed, Dec 05 11:00 - 12:00 ESB 1043

STAT 545:

Teaching Member Date Time Place
Rashedul Tue, Sept 04 11:00 - 12:00 ESB 3174
Vincenzo Wed, Sept 05 11:00 - 12:00 ESB 1043
Sherrie Mon, Sept 10 11:00 - 12:00 ESB 3174
Chad Tue, Sept 11 11:00 - 12:00 ESB 3174
Vincenzo Wed, Sept 12 11:00 - 12:00 ESB 1043
Hossam Mon, Sept 17 11:00 - 12:00 ESB 3174
Rashedul Tue, Sept 18 11:00 - 12:00 ESB 3174
Vincenzo Wed, Sept 19 11:00 - 12:00 ESB 1043
Sherrie Mon, Sept 24 11:00 - 12:00 ESB 3174
Rashedul Tue, Sept 25 11:00 - 12:00 ESB 3174
Vincenzo Wed, Sept 26 11:00 - 12:00 ESB 1043
Chad Mon, Oct 01 11:00 - 12:00 ESB 3174
Hossam Tue, Oct 02 11:00 - 12:00 ESB 3174
Vincenzo Wed, Oct 03 11:00 - 12:00 ESB 1043
Sherrie Mon, Oct 08 11:00 - 12:00 ESB 3174
Chad Tue, Oct 09 11:00 - 12:00 ESB 3174
Vincenzo Wed, Oct 10 11:00 - 12:00 ESB 1043
Chad Mon, Oct 15 11:00 - 12:00 ESB 3174
Hossam Tue, Oct 16 11:00 - 12:00 ESB 3174
Vincenzo Wed, Oct 17 11:00 - 12:00 ESB 1043
Sherrie Mon, Oct 22 11:00 - 12:00 ESB 3174
Rashedul Tue, Oct 23 11:00 - 12:00 ESB 3174
Vincenzo Wed, Oct 24 11:00 - 12:00 ESB 1043

Do you need my permission to enroll or audit in this course? I'm happy to sign the Change of Registration form if there's still room -- come see me.

Conversation, and Getting Help

Are you stuck? First, try to get unstuck by yourself by following the advice of stat545.com: help-general.

While you are getting started, we recommend you seek help within the STAT545 community first, before, e.g. posting to external forums. We are more cuddly.

The instructor and TAs stand ready to assist you, but your peers will also be a great source of good questions and answers. For that reason, we encourage you to seek help in ways that are visible to others.

The options are sort of listed in order of preference. But we realize every situation is different and your comfort level with these approaches will change as you learn more.

  1. Want to talk about content/coding issues? Post an Issue in the Discussion (public) repository.
  2. Want to talk about the course? Post an Issue in the Discussion-Internal (private) repository.
  3. Want to talk in person? Come visit us during office hours!
  4. Want to privately contact Vincenzo? Feel free to send me an email.
    • I look forward to receiving your email, though I do encourage you to post in one of the Discussion repositories unless it's really not appropriate for either platform.

Basic Issue workflow

Some advice on opening an Issue on GitHub:

  • Give it a specific title.
    • BAD: "aaaaaarrrrrrgh!", "things not working", "i need help"
    • GOOD: 'error when indexing a matrix: "incorrect number of dimensions"'
  • Stay specific and be complete-but-concise in the body of the description. Don't expect your helper to play 20 questions with you.
  • (Optional) Tag someone:
    • To get the attention of the teaching team, add the @2018_teaching_team tag to notify all five of us.
  • Don't just create Issues -- also respond to them! Think about this in terms of adding to the conversation, not in terms of "correctness".
  • Don't forget to click "Submit new issue"!

Typically, this will trigger an email to the person/team you tagged. The title of your issue will be in the subject line, so I repeat, make it specific. Your description will become the body of the email. At the bottom will be a link to the issue on GitHub.

If all goes well, your helper will respond. I almost always do this directly via GitHub, though simply replying to the email basically works. In any case, this back-and-forth will show up as a series of comments on your original issue. It's like an email dialogue but better:

  • It's embedded in a relevant Organization/project/repo, so it will be easier to find later vs. digging out of your giant vat of unfiled email.
  • It's potentially visible to others (depending on the repo), which could save us from asking/answering the same questions repeatedly.
  • The whole discussion will be mirrored via email, so that still serves as a great way to prompt participants to tune in.
  • Later you can get fancy and refer to commits and other issues within the repo in slick ways.

Once the problem is resolved, the issue can be closed. Note that closed issues remain accessible, in case anyone needs to consult them in the future.

Annotated Resources

Here are the resources we will be referring to throughout the course, along with a brief description of the resource.

Overarching resources:

  1. http://stat545.com
    • As mentioned earlier, this website can be thought of as a textbook for STAT 545/547.
  2. R for Data Science (aka "r4ds"), by Garrett Grolemund and Hadley Wickham.
    • STAT 545/547 closely mirrors the topics of this book, making this book more of a true "textbook" for the course.

Resources for more specific topics:

  1. Happy git with R, by Jenny Bryan and the STAT 545 TA's
    • Great for marrying git, GitHub, R, and RStudio in your workflow.
  2. Advanced R, by Hadley Wickham
    • If you want to learn more about R as a programming language, this is a very readable and concise way of doing so.
  3. ggplot2 book, by Hadley Wickham
    • Useful for digging deeper into ggplot2.
  4. RMarkdown book, by Yihui Xie et al.
    • Brand new! Looks like a comprehensive resource for everything R Markdown related.
  5. R packages by Hadley Wickham
    • Readable, approachable help for making R packages.
  6. Advanced R, by Hadley Wickham
    • If you want to learn more about R as a programming language, this is a very readable and concise way of doing so.

classroom's People

Contributors

vincenzocoia avatar hsmohammed avatar rashedul avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.