GithubHelp home page GithubHelp logo

cw_city_growth's Introduction

City Growth

Introduction

City growth can be attributed to many things, and infrastructure development is a heavy indicator of such activity. Building Permit, and Water usage data were included to proxy infrastructure growth. In a similar light, the economic growth can be modelled using Employment, Housing Value, School Enrollment. General city growth may be attributed to the classic Population increase, although this type of growth alone doesn’t imply successful growth. Do note that the antecedent can be made for city shrinkage below:

  • Building Permits
    • Infrastructure growth: Good marker of city growth as they mean growth for commercial and residential communities
  • Employment Rate
    • Economic growth: This particular point describes the usage rate of the workforce -- more growth would incentive immigration and later tax revenue growth
  • Flight Destinations
    • May be a good indicator of a city’s growth as proxied by demand as a destination city (although this may be due to tourism)
  • Housing Values
    • Housing prices are a great indicator of a community's demand
  • Population
    • Classic descriptor of city growth, but can be used in congruence with other data points
  • School Enrollment
    • Economic growth: more students enrolled leads to a more highly educated community which can be a proxy for a more in-tune workforce in the future
  • Water Usage
    • Infrastructure growth: Ex) Can be used in association with Population data to proxy water efficiency

Requirements

All data in this directory was obtained using either Python scripts or wget from Linux’s command line. The following libraries will be needed for the scripts to function:

  • Python3
    • BeautifulSoup4
    • json
    • requests
    • time
  • Linux
    • wget

To do

Given more time, I would gather more data on infrastructure usage/spending over metropolitan cities (electricity, oil/gas, sewage). I would also want to know how much a city spends on their roads and the average commute time of its citizens. Another avenue to take is to use the urbansprawl package to quantify the physical expansion of cities into surrounding areas.

Technologically, I would like to expand on the Flight data scraping mechanism to get data all the way back to 2004 manipulating the body of a POST request.

Metadata

Building Permits

Obtained from the Census Bureau’s Building Permits Survey using the permits.py script. Building permit data was downloaded by value of property (tb3vXXXXYY.txt), and by number of units approved (tb3uXXXXYY), where XXXX is the year and YY is the month. The text files can be parsed using Python’s stripping functionality.

Time range: 2004-01 – 2019-03

Employment

Obtained from the Bureau of Labor Statistics using wget on Linux. This data set quantifies the Civilian Workforce, Employed, Unemployed, Unemployment Rate by major Metropolitan areas on a monthly-basis. The data is stored in a text file, which can later be parsed with Python’s stripping functionality for processing in a data frame.

Time range: 1990-01 – 2019-03

Flights

Flight data was scraped from the Bureau of Transportation Statistics database using wget. The data describes the monthly routes taken by airlines on a monthly basis. File type: CSV

Housing Values

Data acquired from Zillow on 2019-05-20 using the values.py Python script. Aggregated data on this page is made freely available by Zillow for non-commercial use. The data is stored in either csv or xlsx files, easily parsable formats.

Time range: 2010-01 – 2019-04

Population

Obtained from the Census Bureau using wget. This data set quantifies the population over metropolitan areas annually over an 8-year period (2010-2018). The data is in the csv format, which can easily be parsed into manipulatable data.

School

Obtained from the National Center for Education Statistics using the school.py script. The data shows school enrollment statistics from 1990 and projects figures until 2027 by State and Geographic region.

Water

Obtained from the U.S. Geological Survey using the water.py script. The data shows water usage statistics at a county-level from 1985 to 2015, every 5 years. The data is stored in text files, which can be parsed with Python’s strip function.

cw_city_growth's People

Contributors

adamburns avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.