- Web scraping with R
- Pre requirements:
- Topics:
- Assignments
- Schedule
- Week 1
- Week 2
- Week 3
- Week 4
- Contact
In this repository you will find the code of the "Coding 2: Web Scraping with R" course in the 2020/2021 Fall term, part of the MSc in Business Analytics at CEU.
- Installed R and R studio
- "rvest", "data.table", "jsonlite" package installed
install.packages(c("rvest", "data.table", "jsonlite"))
- SelectorGadget extension for browser. (Chrome is the preferred browser)
- rvest, html_nodes, html_attr, html_table, lapply, sapply, for loop, if statment
- GET, POST, ad header and data to requests
- Selenium
- Multi threading with R and python
- Scrape news site(25%)
- Work with api(25%)
- Final assignments(50%)
6 X 100 mins
Schedule
- Loops, sapply, lapply, for while
- Functions
- Rvest
- HTML structure
- Object selector with css SelectorGadget
- Processing html documents with
rvest
- Economist scraper
- Yachtworld scraper
- Tasks : ultimatespecs scraper
- Processing Json objects
- Jsonlite
- Json in html document imdb
- Json in html document payscale
- GET
- Task exchange rate
- Nasdaq data
- Working with api
- Coingecko api
- Tradingview
- Eu fundings
- Task forbes