GithubHelp home page GithubHelp logo

content-scraper's Introduction

Content Webscraper

This is a content webscraper used to get information from http://shirts4mike.com/shirts.php. It retrieves information about shirt items on the site.

This is node.js/npm and Ramda project. It uses JSDOM for the HTML DOM scrapping and node-csv-stringify for the CSV stringifying of JSON objects.

Running The Application

Just run the application with the command line: npm start. This ensures that the application will run with all of its features.

How To Use

Once the application has run, it will scrape the necessary information needed and then create a CSV file with the results in it. The CSV file is named based off of the current date and is located in the data directory.

It obtains information about the title, price, the image url, the url link, and the time it was obtained from each shirt item. The same properties are listed as columns in the CSV, with each row representing each shirt.

If contents are successfully scraped, a success message is displayed and the application ends. In contrast, if the application runs into an error, the error is logged to the console and is also logged to a file in logs/scraper-error.log.

NOTE: The application may take some time to process depending on the internet connection. If nothing appears to be happening, just wait. A message will eventually pop up.

content-scraper's People

Contributors

braagaa avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.