GithubHelp home page GithubHelp logo

ruhamleal / tech-news-scraper Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 20 KB

A Python project to scrape data from trybe's blog that catches technology news and articles

Dockerfile 1.12% Python 98.88%
mongodb pytest python web-scraping

tech-news-scraper's Introduction

Tech News Scraper ๐Ÿ’ป

Summary
  1. About the Project
  2. Technologies
  3. Features
  4. How to Execute the Project
  5. Contact

About the Project

Tech News is a Python-based data scraping project that uses Beautiful Soup to perform web scraping on Trybe's blog and fetch technology news and articles, storing them in a non-sequential MongoDB database.


Technologies

โœจ Show Tecnologies
  • PYTHON 3
  • MONGODB
  • PYMONGO
  • PYTEST
  • BEAUTIFUL SOUP
  • FLAKE

Features

  • Collect news and articles from Trybe's blog using web scraping and store them in a MongoDB database
  • Search news by title
  • Search news by date
  • Browse news by category
  • List the top 5 categories with the most news

How to Execute the Project

To run the project locally, follow the steps below.

  1. Check that your machine has the minimum configuration for the project to run;
  • Python 3;
  • Docker;
  • Docker-compose version equal to or greater than 1.29.2.
  1. Clone the repository;
git clone https://github.com/RuhamLeal/tech_news_scraper.git
  1. Navigate to the root of the project;
cd ./tech_news_scraper
  1. Create and activate the virtual environment.
python3 -m venv .venv

source .venv/bin/activate
  1. Install dependencies in the virtual environment.
python3 -m pip install -r dev-requirements.txt
  1. If you don't have MongoDB installed locally, launch it via Docker.
docker-compose up -d mongodb

7.Run the following command to access the menu.

tech-news-analyzer

Contact

Ruham Leal
Email: [email protected]
Linkedin

Voltar ao topo

tech-news-scraper's People

Contributors

ruhamleal avatar trybe-tech-ops avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.