GithubHelp home page GithubHelp logo

yagniksojitra / growthgenie-hackathon Goto Github PK

View Code? Open in Web Editor NEW

This project forked from vishal105/growthgenie-hackathon

0.0 0.0 0.0 1.73 MB

Shell 0.48% Python 81.49% Jupyter Notebook 18.03%

growthgenie-hackathon's Introduction

Personal Finance :

๐ŸŒŸ Featured on Streamlit community forum ๐ŸŒŸ

coverage version

Clairvoyant, an EXL company, is a global technology consulting and services leader that helps organizations in their business transformation by maximizing the value of data through actionable insights. Founded in 2012, Clairvoyant is headquartered in Phoenix, AZ, with offices in the US, Canada, and India and serves marquee clients in the financial services, retail, healthcare, and tech industries.

home_image

High Level Design :

  • Data Source: The data source is shown as a computer. This can be any device or system that generates data, such as a web server, a database, or an IoT device.
  • Data Ingestion: Data is ingested into Azure Data Lake Storage Gen2 using the HTTP Connector. This connector can be used to ingest data from a variety of sources, including web APIs, databases, and files.
  • Data Transformation: Once data is in Azure Data Lake Storage Gen2, it can be transformed using various tools and services. The diagram shows Azure Data Factory being used for transformation. Azure Data Factory is a serverless data integration and ETL/ELT service that can be used to orchestrate and automate data movement and transformation.
  • Data Storage: After transformation, the data is stored in Azure Data Lake Storage Gen2. This is a scalable and secure storage system that can store petabytes of data.
  • Data Analysis: Data can be analyzed using a variety of tools and services. The diagram shows Azure SQL Database and Streamlit being used for analysis. Azure * * SQL Database is a relational database service that can be used to run SQL queries on the data. Streamlit is a Python library that can be used to create interactive data dashboards and applications.
  • Data Publication: Data can be published to other systems and services. The diagram shows Azure Data Lake Storage Gen2 being used to publish data to Azure Blob * Storage and ML Models. Azure Blob Storage is another object storage service that can be used to store data. ML Models can be used to train and deploy machine learning models on the data.
  • Azure Data Lake can be used to store, transform, and analyze data from a variety of sources. This can be used to power a variety of applications, such as data analytics, machine learning, and artificial intelligence.

Here are some additional details that are not shown in the diagram:

  • The data can be structured, semi-structured, or unstructured.
  • The data can be in real-time or batch-oriented.
  • The data can be accessed from anywhere in the world.
  • The data is secure and can be protected with access control and encryption.

Approach :

  • So Initially I have used an .ipynb file to do the preprocessing and do some visualization

  • Then I have made another Utilities Folder which contain BusinessAnalysis.py CustomerAnalysis.py to implement all the functions related to preprocessing and plotting

  • I have imported the same file in app.py and used it along with streamlit to build the app.


Features :

  • Shows multiple analytical charts to help me better understand the details.
  • Connected to the database and automated.
  • Answers few predefined quick QNA type questions.
  • Responsive layout, can be opened in any device.

How to run?

To run the app you need to download this repository along with the required libraries and it the command line you have to write streamlit run app.py to run.


Document Structure

Personal Finance 
โ”‚
|---- __pycache__
|
|---- .streamlit
|   |---- config.toml
|
|---- dataset 
|   xlsx files
|
|---- utilities
|   |---- __pycache__
|   |---- BusinessAnalysis.py
|   |---- CustomerAnalysis.py
|   |---- AzureSqlLoader.py
|   |---- testclass.ipynb
|   
|
|
|---- app.py
|---- auth.py
|---- user_dashboard.py
|---- dataframe_visualisation.py
|---- markdown.py
|---- Procfile 
|---- README.md
|---- requirements.txt
|---- setup.sh


Technologies used :

  • python library - numpy, pandas, seaborn, matplotlib, streamlit
  • version control - git
  • backend - streamlit
  • concept - OOP
  • Cloud Technologies used- Azure Storage, Azure Data Factory, Azure SQL, Azure DataLake, Azure WebApp.

Tools and Services :

  • IDE - Vs code
  • Application Deployment - Azure WebApp
  • Code Repository - GitHub
  • Dataset -Azure SQL DB, Storage Account and Data Lake.


How to Setup:

Welcome to my GrowthGenie app! To get started, follow the steps below to set up your environment variables:

Setup Environment Variables

  1. Create a Virtual Environment:

    python3 -m venv venv
    source venv/bin/activate
    
  2. Install Dependencies:: pip install -r requirements.txt

  3. Set Environment Variables: echo "export SERVER=my_server_address" >> venv/bin/activate echo "export DATABASE=my_database_name" >> venv/bin/activate echo "export USER=my_database_user" >> venv/bin/activate echo "export PASSWORD=my_database_password" >> venv/bin/activate echo "export AZURE_KEY=my_azure_key" >> venv/bin/activate

  4. Activate the Virtual Environment and run the app: source venv/bin/activate streamlit run app.py

If you Liked this project the you can consider connecting with me:

growthgenie-hackathon's People

Contributors

shreyansbardia557 avatar akriti982 avatar vsojitra-genysoft avatar vishal105 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.