Big Data - MapReduce program and Visualisation for Word count and Word co-occurence for data on NBA teams, using Hadoop.
- Part 1 - Collecting data on NBA teams from 3 different sources: NY Times, Twitter and Common Crawler using APIs
- Part 2 - MapReduce program for calculating Word Count on sample data, using Apache Hadoop.
- Part 3 - MapReduce programs for calculating Word Count and Word Co-occurence for the NBA data collected from all 3 sources, using Apache Hadoop. Plus Visualizations of these results using d3.js. And comparision of the visualizations from all 3 sources for similarity.
- Website - A website to view all the visualizations in a easier way.