GithubHelp home page GithubHelp logo

tsvetang2 / scrappingtasks Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 141 KB

This repository will contain my interview tasks, assigned during the interview with companies

Python 100.00%
automation interview interview-practice interview-test python python3 tasks test

scrappingtasks's Introduction

Please choose 2 out of the 3 following tasks and complete them:


Task 1) Please list at least 5 public tenders awarded from the following regions - Europe (EU-level, or any European country) and The United States. At least 2 of the 5 awarded companies should be a publicly listed company.

  • Please send excel/google sheet file with the following information: country, tender title/short description, awarded company name, if the awarded company is publicly listed - its ticker, main stock exchange and ISIN number, and link to the tender award notice. For more information about the awarded public tenders, you can read the Alerts section on the TenderAlpha website.

Task 2) Please extract the awarded contracts information from the official UK public procurement website following the steps:

Visit the website: https://www.contractsfinder.service.gov.uk/Search/Results Create a search with the following parameters:

  • Keywords: financial or database or data
  • Procurement Stage: Awarded contract
  • Date range: 31/10/2022 โ€“ 04/11/2022

Download the data as .xml file

Read the .xml file and extract the following data from the notices:

  • Publish date
  • Date of award
  • Title
  • Short description
  • Awarded company
  • Awarded company address
  • Awarded value
  • url

Save the output in .excel file


Task 3) Scrape the quotes from http://quotes.toscrape.com/ (all pages) and save the output in csv, json or whatever format you prefer. Each entity should have:

  • text - quote text (required)
  • author - the author of the quote (required)
  • tags - string with the tags of the quote separated with ';' (if they exists)
  • url - the url where you can find the quote

Example of quote entity:

  • {
  • 'text' : 'The world as we have created it is a process of our thinking. It cannot be changed without changing our thinking.',
  • 'author': 'Albert Einstein',
  • 'tags' : 'change;deep-thoughts;thinking;world',
  • 'url'. : 'http://quotes.toscrape.com/'
  • }

scrappingtasks's People

Contributors

tsvetang2 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.