GithubHelp home page GithubHelp logo

pysalscipy's Introduction

Geographic Data Science with PySAL and the pydata stack

Bio

Sergio Rey is professor of geographical sciences and core faculty member of the GeoDa Center for Geospatial Analysis and Computation at the Arizona State University. His research interests include open science, spatial and spatio-temporal data analysis, spatial econometrics, visualization, high performance geocomputation, spatial inequality dynamics, integrated multiregional modeling, and regional science. He co-founded the Python Spatial Analysis Library (PySAL) in 2007 and continues to direct the PySAL project. Rey is a fellow of the spatial econometrics association and editor of the journal Geographical Analysis.

Dani Arribas-Bel is Lecturer in Geographic Data Science and member of the Geographic Data Science Lab at the University of Liverpool (UK). Dani is interested in undestanding cities as well as in the quantitative and computational methods required to leverage the power of the large amount of urban data increasingly becoming available. He is also part of the team of core developers of PySAL, the open-source library written in Python for spatial analysis. Dani regularly teaches Geographic Data Science and Python courses at the University of Liverpool and has designed and developed several workshops at different levels on spatial analysis and econometrics, Python and open source scientific computing.

Prerequisite skills

The workshop is based on a courses taught to Geography undergraduates with no previous programming experience, but compressed into the appropriate length. Some statistical background and familiarity with Python are recommended to follow along comfortably, but there is no requirement to be knowledgeable in the Python geospatial stack.

Description of the tutorial

This two-part tutorial will first provide participants with a gentle introduction to Python for geospatial analysis, and an introduction to version PySAL 1.11 and the related eco-system of libraries to facilitate common tasks for Geographic Data Scientists. The first part will cover munging geo-data and exploring relations over space. This includes importing data in different formats (e.g. shapefile, GeoJSON), visualizing, combining and tidying them up for analysis, and will use libraries such as pandas, geopandas, PySAL, or rasterio. The second part will provide a gentle overview to demonstrate several techniques that allow to extract geospatial insight from the data. This includes spatial clustering and regression and point pattern analysis, and will use libraries such as PySAL, scikit-learn, or clusterpy. A particular emphasis will be set on presenting concepts through visualization, for which libraries such as matplotlib, seaborn, and folium will be used.

Detailed outline

Part I

  1. Software and Tools Installation (10 min)

  2. Spatial data processing with PySAL (45 min)

    a. Input-output

    b. Visualization and Mapping

    c. Spatial weights

  3. Exercise (10 min.)

  4. ESDA with PySAL (45 min)

    a. Global Autocorrelation

    b. Local Autocorrelation

    c. Space-Time exploratory analysis

  5. Exercise (10 min)

Part II

  1. Spatial clustering a (30 min)

    a. Geodemographic analysis

    b. Regionalization

  2. Exercise (10 min)

  3. Spatial Regression (30 min)

    a. Overview

    c. Basic spatial regression: spatial lag and error model

  4. Exercise (30 min)

  5. Point Patterns (30 min)

    a. Kernel Density Estimation visualization

    b. Centrography and distance based statistics

  6. Exercise (10 min)

Installation materials

Participants should have installed the following dependencies:

  • Anaconda or MiniConda Python distributions for Python 2.7. See installation instructions on the links.

  • git

  • A conda environment loaded with all the dependencies can be installed by running the pydata.sh script available as part of the envs repository (Github link). To install it, follow these instructions:

    • Clone the repository on your machine:

    > git clone https://github.com/darribas/envs.git

    • Navigate into the folder:

    > cd envs

    • Run the script:

    > bash pydata.sh

Background materials

An extensive collection of noteboks and slides related to this tutorial may be found in the following url's:

pysalscipy's People

Contributors

sjsrey avatar darribas avatar sjrey2 avatar

Stargazers

 avatar

Watchers

 avatar  avatar James Cloos avatar  avatar

Forkers

darribas

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.