GithubHelp home page GithubHelp logo

dnnbuddy / pc-part-dataset Goto Github PK

View Code? Open in Web Editor NEW

This project forked from docyx/pc-part-dataset

0.0 0.0 0.0 25.25 MB

A dataset of PC parts scraped from PCPartPicker

License: MIT License

TypeScript 100.00%

pc-part-dataset's Introduction

PC Part Dataset

A dataset of PC parts scraped from PCPartPicker.

Part count: 60,506

Last updated: March 27, 2024

Download

The parts are available in JSON, JSON Lines, and CSV format. You can find them in the ./data directory.

JSON Property Descriptions

Check out API.md for JSON property descriptions of each product category.

Contents

  • General

    • CPUs
    • CPU Coolers
    • Motherboards
    • Memory
    • Storage
    • Video Cards
    • Cases
    • Power Supplies
    • Optical Drives
    • Operating Systems
    • Monitors
    • External Storage
  • Accessories / Other

    • Case Accessories
    • Case Fans
    • Fan Controllers
    • Thermal Compound
    • UPS Systems
  • Expansion Cards / Networking

    • Sound Cards
    • Wired Network Adapters
    • Wireless Network Adapters
  • Peripherals

    • Headphones
    • Keyboards
    • Mice
    • Speakers
    • Webcams

Running the Scraper

Warning Make sure to turn on your VPN before running this scraper!

  1. Run npm install
  2. Run npm run start
  3. Wait about 5-10 minutes for the scraper to finish doing its thing.
  4. Everything will be emitted to a directory named data-staging.

If you don't want to scrape every part, you can specify individual parts by passing them as arguments. You can find the list of part type IDs in API.md.

Example usage:

# Only get CPUs and video cards
npm run start -- cpu video-card

package.json Scripts

  • start => Starts the scraper.
  • dev => Starts the scraper without typechecking.
  • count => Runs counter.ts; gets the number of parts from ./data-staging/json or argv[1].
  • output => Runs output.ts; outputs JSONL and CSV formats to ./data-staging or argv[1].
  • zip => Zips the JSON, JSONL, and CSV folders in ./data.

License

MIT

pc-part-dataset's People

Contributors

docyx avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.