GithubHelp home page GithubHelp logo

Shikhar Tiwari's Projects

credit-card-fraud-detection icon credit-card-fraud-detection

Applied SVC classifier and Logistic Regression classifier algorithm onto credit card transaction dataset to detect any fraud.

custom-data-visualization-using-matplotlib icon custom-data-visualization-using-matplotlib

Created a dynamic graph using matplotlib to better judge probabilistic data generated through the election dataset. A generated graph changes its colour w.r.t change in y-axis values.

hypothesis-testing-using-t-test icon hypothesis-testing-using-t-test

Hypothesis: University towns have their mean housing prices less effected by recessions. Performed a T-test to compare the ratio of the mean price of houses in university towns the quarter before the recession starts compared to the recession bottom.

long-term-stock-price-growth-prediction-using-nlp-on-10-k-financial-reports icon long-term-stock-price-growth-prediction-using-nlp-on-10-k-financial-reports

A 10-K FInancial Report is a comprehensive report which must be filed annually by all publicly traded companies about its financial performance. These reports are filed to the US Securities Exchange Commission (SEC). This is even more detailed than the annual report of a company. The 10K documents contain information about the Business' operations, risk factors, selected financial data, the Management's discussion and analysis (MD&A) and also Financial Statements and supplementary data. I have been expected to build an NLP pipeline that ingests 10-K reports of various publicly traded companies and build a machine learning model which can uncover the hidden signals to predict the long term stock performance of a company from the 10-K docs using the ‘Loughran McDonald Master Dictionary’. The Dictionary contain words that are specifically curated in the context of financial reports

network-connectivity icon network-connectivity

Importing and analyzing an internal email communication network between employees of a mid-sized manufacturing company. Each node represents an employee and each directed edge between two nodes represents an individual email. The left node represents the sender and the right node represents the recipient.

nlp-on-10k-documents icon nlp-on-10k-documents

This projects helps scraping and analysing the 10K and 10Q documents filed by publicly traded companies to the SEC.

project-report icon project-report

This project's sole aim is to find out whether there exists any relationship between the World's University Ranking and the expenditure made by each country for their respective education system.

regex-assignment icon regex-assignment

The goal of this assignment is to correctly identify all of the different date variants encoded in this dataset and to properly normalize and sort the dates.

salary-and-new-connections-predictions-using-networkx icon salary-and-new-connections-predictions-using-networkx

By using Networkx and ML algorithms created a model to predict whether or not employees in a given company are receiving a management position salary. Also predicted future connections between the employees of the network.

sec-10k-item-1a-ml-kmean-clustering icon sec-10k-item-1a-ml-kmean-clustering

This project was my final project for the UOFM data analytics certificate program used ML to cluster text files and validated those clusters using stock market data

sentiment-analysis icon sentiment-analysis

Exploratory sentiment analysis of a firm's management discussion from 10K annual SEC filing

social-media-sentiment-analysis icon social-media-sentiment-analysis

Using Text Mining and Natural Language Processing Techniques pre- processed 50k tweets. Visualized the impact of hashtags on tweets sentiment using Seaborn. Applied machine learning models, calculated f1_scores, accordingly used the best model for sentiment prediction.

spelling-recommender icon spelling-recommender

Created three different spelling recommenders, that each take a list of misspelled words and recommends a correctly spelled word for every word in the list. Each spelling recommender uses different Jaccard distance metrics. For every misspelled word, the recommender find the word in correct spellings that has the shortest distance, and starts with the same letter as the misspelled word, and return that word as a recommendation.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.