GithubHelp home page GithubHelp logo

mleopol1 / web-scraping-mars Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 137 KB

This repository involves web scraping and analyzing various sites about Mars, with deliverables that store and visualize the scraped data.

Jupyter Notebook 100.00%
beautifulsoup matplotlib pandas python web-scraping

web-scraping-mars's Introduction

Web Scraping Mars

This project involves web scraping of Mars news articles and Mars weather data, and performing an analysis on the latter half.

Deliverable 1: Scrape Titles and Preview Text from Mars News

In the part_1_mars_news.ipynb Jupyter notebook, automated browsing is used to visit the Mars NASA news site and extract text elements using a Beautiful Soup object. Titles and preview text of the news articles are extracted, each title-and-preview pair are stored in a Python dictionary, and each dictionary is given two keys: title and preview. Finally, all the dictionaries are stored in a Python list and the list is printed in the notebook.

Deliverable 2: Scrape and Analyze Mars Weather Data

In the part_2_mars_weather.ipynb Jupyter notebook, automated browsing is used to visit the Mars Temperature Data Site and extract data in the HTML table using a Beautiful Soup object. The scraped data is assembled into a Pandas DataFrame with columns having the same headings as the table on the website. The data types are examined and cast to the appropriate types as necessary. Finally, the dataset is analyzed using pandas functions and matplotlib in order to answer and visualize various questions about the data. The dataset is then exported as a csv file into the Resources folder.

web-scraping-mars's People

Contributors

mleopol1 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.