GithubHelp home page GithubHelp logo

fealt / shark-attacks-australia Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 8.03 MB

🦈 'Where in Australia to build a "shark-free" family resort?' – Project 1 @ironhack Data Analytics Bootcamp

Home Page: https://fealt.github.io/shark-attacks-australia/

License: MIT License

Jupyter Notebook 94.82% JavaScript 0.04% CSS 1.01% HTML 4.13%
data-analysis python html-css-javascript ironhack

shark-attacks-australia's Introduction

Data Analytics Bootcamp – Project 1

Where in Australia to build a 'shark-free' family resort?

Lake Tyers Beach

image source


Time's short?

👉 Click here to check my presentation, hosted by GitHub Pages ♡


Shark Attacks – data cleaning and manipulation with Pandas – is my first project at Ironhack's Data Analytics Bootcamp (2021). The given dataset was extremly messy and dirty, so the main pythonic challange here was to have it clean and usable. But before starting to transform beast into beauty I was also challanged to develop a story based on a business question to answer. So – Where in Australia to build a 'shark-free' family resort? – sounds familiar to my background in the construction industry and is also a huge coding job: lots to clean, search for supporting datasets and, of course, fun! 👨🏻‍💻


Project main objectives


▫️ Use storytelling with data to answer a 'business' question.

▫️ Apply different cleaning and manipulation techniques to make a messy dataset usable.

Client


▫️ Shark-free Hotels & Resorts is a 'worldwide to be' hotel chain, since it's missing a branch
  in Australia.

▫️ To date, all of its other houses are built in "safe" beaches – with no sight of sharks.

▫️ Main clientèle – all kinds of families, with/without kids.

Cleaning & Co.


▫️ Cleaned columns include:
  `year`
  `type (provoked/unprovoked)`
  `fatal (y/n)`
  `area`
  `location`
  `sex`
  `age`

▫️ Developed a cleaning strategy for column `location` to get coordinates, applying `GeoPy`.

▫️ To support the analysis, 5 extra datasets were used:
  1. Hotels in Australia, key findings:
    - Top 3 States by number of hotels are New South Wales, Queensland and Victoria.
    - Accomodation rate mean for Australia around 65% and almost all states follow the mean.
  2. Short-term visitors in Australia, key findings:
    - Australia celebrates an incredible growth in short-term visitors over the last 40 years.
    - Over 200% rise from 1990 to 1997 and almost 170% rise from 2010 to 2018.
    - Turist growth in relation to total shark attacks, refer to the Jupyter notebook file for
      more details.
  3. Australia cities database.
  4. Top 20 beaches in Australia (self-made dataset).  
  5. List of beaches in Australia (self-made dataset).


Analysis, Worldwide

▫️ Based on 230 years of available data, Australia is the second country in the world with most shark attacks (1338); behind USA (2229) and before Mexico (579).

▫️ Top 3 countries come up with 65% of all incidents.

▫️ 22% end up deadly.

▫️ Almost 90% of attacked individuals are male.


image source

Analysis, Australia

▫️ In 85% of all fatalities, it was possible to locate the coordinates.

▫️ Number of tourists exploded in the last 40 years: over 200% rise from 1990 to 1997 and almost 170% from 2010 to 2018.

▫️ So did shark attacks, top 10 years included: 2016, 2015, 2014, 2009, 2012, 2017 and 2018.

▫️ For a "small" State, Victoria has almost 20% of total Hotels in Australia. And the best room occupancy rate among them all, over 70%.

▫️ Among the top 3 States, only in Victoria there haven't been any deadly shark attacks in the past 40 years.

▫️ Lake Tyers Beach is a top 20 beach in Australia!


image source

Conclusion


▫️ Despite a full coast with shark attacks, there is a `shark free` area in Victoria State.

▫️ It's named `Lake Tyers Beach` and is also a top 20 beach in Australia! Ranked #16.

▫️ Therefore a `safe` place for `Shark-free Hotels & Resorts` to start hosting in Australia.



Deliverable files in this repository

  • Cleaned final dataset (./assets): shark_au_df.csv
  • Data analysis in a Jupyter notebook: project_01_shark_attack.ipynb
  • Formal presentation – Storytelling with data –, done in HTML5 CSS3 JavaScript : check it here

Data

  1. Given dataset

  2. Extra datasets used

  3. Created datasets based on

    • List of beaches in Australia: Wikipedia.
    • Brad Farmer's 101 Best Australian Beaches: Traveller.

Tech

  • Python @ Jupyter Notebook
  • Pandas / Numpy
  • Geopy / Nominatium (Python client for geocoding)
  • Viz: seabron / plotly


shark-attacks-australia's People

Contributors

fealt avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.