Challenge Question: AI-based Algorithmic Earnings Forecasting
Challenge: Develop an AI application that leverages the power of open-source generative AI models, natural language processing, sentiment analysis using NLP, and machine learning for financial forecasting. Utilize alternative data sources such as news sentiment and social media trends to predict company earnings and financial performance.
FinPredict: Leveraging sentiment analysis, real-time data, and the Google Trends API for actionable insights. Analyzing news sentiment, social media trends, and search peaks on Google Trends to predict company earnings and financial performance. The value of a business can be noticeably impacted whenever a document, a dramatic event, or even a press note is released. Regardless of the connotation, whether positive or negative, it is important to notice how it affects the popularity and overall external reviews on a company.
- Utilize VaderSentiment for sentiment analysis, with a focus on news sentiment and social media trends, and employ NLP and correlation analysis to gauge market sentiment effectively.
- VanderSentiment will process a user-defined data set and be trained in order to assign a numerical weight for the purpose of binary encoding words into a 'score' system to base off of the popularity for the company mentioned.
- Utilize VaderSentiment to process NLP and correlation analysis effectively, and analyze textual data to provide the foundation for predictive insights.
- Every comment acquired by accessing the name of a company in a post made on a finance forum. Using categorical encoding, positive, negative, and neutral keywords will map values of 1, -1, and 0 reespectively in order to have weigths to plot.
- Access real-time financial and economic data, keeping a watchful eye on critical financial events that influence market dynamics.
- Create visually appealing plots and graphs to gain deeper insights into sentiment trends and market behaviors.
- After cleansing the data, we had to come up with a simple yet professional visualization system.
- Utilize BeautifulSoup to scrape data from various sources, enhancing our dataset for analysis.
- Using the Reddit API, and the most up-to-date web traffic resources from Google Trends, we were able to map, categorize, and display forum sentiments on the chart as a numerical representation.
- Efficiently process and manipulate financial data, ensuring its readiness for analysis.
- All graphs and csv files were created from scratch using data cleansing and data analytics.
- Leverage the GNews API to aggregate and analyze news data, offering real-world context for our predictions.
- Utilize the Google Trends API to identify key times with search peaks.
- Enhance analysis during these critical periods by incorporating data from news sources.
- We incorporated data collection on a 7-day timeframe in order to get the most recent web traffic data to also limit the number of events that prompted the web traffic changes.
- Display visually appealing graphs that are easy for the user to understand.
This project is licensed under the MIT License.