GithubHelp home page GithubHelp logo

amazon_expense_tracker's Introduction

Amazon Order History Scraper

This project allows you to extract and analyze your own Amazon order history. It uses Python, Selenium and BeautifulSoup for web scraping and DuckDB for data loading and analysis. The scraper is designed to navigate through your Amazon orders, save the details of each item and compile it into a CSV file.

Getting Started

Start by installing the required Python packages like Selenium, BeautifulSoup, and DuckDB. Set up your Amazon username and password in an .env file for secure access.

How it Works

The AmazonScraper class automates the login process and navigates through your order history pages to save the details of each order. The scraped order data is then parsed using BeautifulSoup and organized into a pandas DataFrame. This DataFrame is saved as a CSV file, allowing you to perform any desired data analysis.

You can then load this CSV into a DuckDB database, which is designed for fast analytic queries. This project provides several example SQL queries to analyze your spending by categories, specific categories, and more.

The goal of this project is to empower you with an understanding of your personal spending habits on Amazon. Be aware that the actual navigation might vary based on the actual layout of Amazon's site and your account settings, which might require adjustments in the code.

Note

This project is intended for personal use and not for misuse or violation of Amazon's Terms of Service. Please ensure you comply with these terms when using this tool.

amazon_expense_tracker's People

Contributors

liorkaufman avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.