GithubHelp home page GithubHelp logo

codershiyar / data_analyse Goto Github PK

View Code? Open in Web Editor NEW
5.0 2.0 0.0 656 KB

This project is a data analysis (data science) tool with scripts to analyze images and text data and store results in a Microsoft Access database.This project also includes a set of Power BI dashboards that are connected to the database and provide a visual representation of the analyzed data.

Jupyter Notebook 96.81% Python 3.19%
data-analysis data-science data-storage power-bi sentiment-analysis data-analytics data-visualization image-analysis sentiment-classification sentiment-classifier sentimental-analysis text-analysis data-analysis-python microsoft-access word-counting data-visualization-project data-visualizations

data_analyse's Introduction

Data Science tool for data analysis

  • This project is a data analysis tool that includes several scripts to analyze images and text data and store results in a Microsoft Access database.
  • While the project was completed in under 10 hours due to time constraints, there may be opportunities to further optimize the code for better performance and functionality.
  • As a diligent student at The Hague University of Applied Sciences, I am also a proficient software engineer with experience overseeing large-scale programming projects across diverse industries. In addition to my studies, I research and explore innovative ways to teach programming and design to students of all levels. You can find me online under the name "Coder Shiyar"

Dashboard of Image Analysis Results

Dashboard of words Analysis Results

Dashboard of Sentiments Analysis Results

Installation

Clone this repository to your local machine using https://github.com/codershiyar/data_analyse.

The following libraries are required to run this project:

  • extcolors:A library used to extract the most common colors in an image.
  • os: A library used for interacting with the operating system, such as accessing files and directories.
  • collections: A library used for handling collections of data, such as counting occurrences of items in a list or dictionary.
  • pyodbc:A library used for connecting to and interacting with databases using SQL commands.
  • concurrent.futures: A library used for running tasks asynchronously and in parallel.
  • math: A library used for mathematical operations and functions.
  • csv: A library used for reading and writing CSV files.
  • re: A library used for working with regular expressions to search and manipulate text.
  • json: A library used for reading and writing JSON files.
  • matplotlib.pyplot as plt: A library used for creating visualizations and plots in Python.

Usage

This project includes several scripts to analyze data and store results in a Microsoft Access database. Here is a brief description of each script:

  • database.py allows you to interact with an access database using SQL commands.
  • images_analyse.py analyzes images and returns the most used colors in pictures.
  • popular_words.py analyzes texts to get popular words and saves the results to the database.
  • sentiment_analyse.py analyzes customer sentiment by their reviews and saves the results to the database.

The data folder contains a CSV file which contains image filenames and text data for analysis.

Additionally, the PowerBI folder contains a Power BI project that is connected to the database and includes several dashboards with analysis results.

To run any of the scripts, simply run python script_name.py in your terminal.

Web content scraper

  • If you need to gather and analyze data such as photos or videos, this project provides a helpful solution: https://github.com/codershiyar/web-content-scraper

data_analyse's People

Contributors

codershiyar avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.