GithubHelp home page GithubHelp logo

philipehuan / stockmarketanalysis Goto Github PK

View Code? Open in Web Editor NEW

This project forked from aknakshay/stockmarketanalysis

0.0 1.0 0.0 64 KB

Using the power of Big Data Tools to analyze Stock Market

Python 38.62% Java 61.38%

stockmarketanalysis's Introduction

Stock Market Analysis

Using the power of Big Data Tools to analyze Stock Market

Stocks selected:

NASDAQ: GOOGL,MSFT,ORCL,FB,AAPL,TSLA

NSE: TCS,INFY

Data Retrieval

Stock Prices

For collection of per minute Stock prices, Alphavantage API is used to retrieve prices for companies listed on NASDAQ. Read in Detail here

For NSE, a scraper is written in Python which scrapes the latest prices for each minute. Read in Detail here.

However the problem faced here was that at certain 1 or 2 minute interval, price won't get updated on the NSE website. For the same, data interpolation is done.

Twitter Data

Collected Twitter Data using Python with Twitter API. Read in Detail here.

Also collected Twitter Data using Flume. Had to modify Flume's Twitter's package code for the same. Read in Detail here

Data Collection

Data from Twitter is stored on the Data Lake. For the purpose of this project, Cloudera Datalake has been used.

Data Preparation

The twitter data is processed to correct the spellings of the text. It is done using a JAVA library called Language Tool.

Model - Sentiment Analysis

On the twitter data, Stanford Core NLP library is used to tokenize, annotate sentences, part of speech tagging, syntactic analysis and sentiment analysis using Stanford's pre-trained model. With the same, sentiment value of each tweet is obtained. Multiplication of Number of followers and Sentiment value for that tweet is aggregated per minute Read in detail here

Read about our progress on our blog

stockmarketanalysis's People

Contributors

aknakshay avatar nikpbn avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.