Project of Data Analysis and Statistical Inference course on Coursera, February 2014 edition.
The project studies the relationship between the highest degree earned by United States residents and their family income in constant dollars.
Access to education and its funding is the subject of many discussions on social mobility and redistribution of income. The study explores data from a long running social survey to verify one of the main topic of these discussions: if family income is related to education level.
The study uses General Social Survey (GSS) data for the year 2012.
A RStudio project with code tree organized with ProjectTemplate.
The main files are:
- README.md: this file
- data/statistic_project_gss.Rdata: raw data in R format
- LICENSE: copyright and license information
- build-data-analysis.R: R script to create the analysis document in md, html and pdf format
- data-analysis.Rmd: analysis document in R markdown format
You have to install R, RStudio, ProjectTemplate, knitr, and Pandoc with a LaTeX distribution that supports XeLaTeX engine.
To create the document:
- clone this repository
- open RStudio or a R console and set the working directory to
src
directory (use setwd()) - source
build-data-analysis.R
script
In the src
directory you find the files data-analysis.md
, data-analysis.html
, data-analysis.pdf
.