GithubHelp home page GithubHelp logo

fborowiec / scrapers Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 194 KB

Home Page: https://github.com/FBorowiec/scrappers

License: Other

Starlark 32.02% Python 66.63% Shell 1.35%
scraper scraping-websites scraping-python python rest-api rest restful-api

scrapers's Introduction

Script for scraping websites data

Repository where I store my web scraper.

Ex.: The Mr.Lodge web scraper acquires the latest information about apartments on the mrlodge.com website.

The result is a .csv file containing data of all listed apartments (example for the 11.12.2020):

Index Rent Location Rooms Area
0 € 5700 Gräfelfing 5.5 ca. 205 m²
1 € 5500 Gräfelfing 5.0 ca. 305 m²
2 € 5500 Munich-Maxvorstadt 6.0 ca. 180 m²
3 € 5000 Munich-Maxvorstadt 5.0 ca. 158 m²
4 € 4980 Munich-Isarvorstadt 5.0 ca. 185 m²
5 € 4500 Grünwald 4.0 ca. 172 m²
6 € 4300 Munich-Bogenhausen 5.0 ca. 147 m²
7 € 4250 Munich-Gärtnerplatzviertel 3.0 ca. 140 m²
8 € 3450 Munich-Schwabing 4.0 ca. 110 m²
9 € 1490 Munich-Nymphenburg 1.0 ca. 30 m²
10 € 3150 Munich-Altstadt 3.0 ca. 90 m²
11 € 1990 Munich-Bogenhausen 2.0 ca. 85 m²
12 € 1850 Munich-Berg am Laim 2.0 ca. 65 m²
13 € 1290 Munich-Bogenhausen 1.0 ca. 31 m²
14 € 8200 Grünwald 7.0 ca. 280 m²
15 € 3250 Munich-Au-Haidhausen 3.5 ca. 106 m²
16 € 2950 Munich-Glockenbachviertel 3.5 ca. 91 m²
17 € 2940 Gräfelfing 3.5 ca. 119 m²
18 € 2800 Munich-Maxvorstadt 3.0 ca. 80 m²
19 € 2650 Munich-Maxvorstadt 3.0 ca. 77 m²

How to run the code locally with Bazel already installed on host

Bazel installation

Install Bazel

Once you have successfully installed Bazel you can run the code using:

bazel run //:mr_lodge -- --output_folder=/path/to/output_folder/

Run the code inside a container

You can use my following Docker image to instantiate a container locally with Ubuntu and Bazel already installed:

docker run -it --rm framaxwlad/ubuntu_dev:latest

There you can simply clone the repository:

git clone https://github.com/FBorowiec/scrapers.git
cd scrapers/

And use the aforementioned commands to run the program:

bazel run //:mr_lodge -- --output_folder=/path/to/output_folder/

scrapers's People

Contributors

fborowiec avatar janboro avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.