Data Mining using R
Welcome to Clemson University's Cyberinfrastructure and Technology Integration Training Workshops!
This workshop focuses on data mining techniques in R, with the following learning objectives:
- Understand data acquisition: downloading from static links, crawling through entire websites, and using headless browser application to mine websites with delayed-response data
- Understand data management: organizing data directories, working with databases
- Understand HPC concepts: automating data-mining process through the Palmetto Supercomputer
This is an intermediate workshop, and it is recommended that you either have attended or are comfortable with materials presented in the following workshops:
Important!
Prior to coming to the workshop, please make sure that you have completed the preparation steps to set up the required libraries.