GithubHelp home page GithubHelp logo

kmyles82 / stl-crime-data Goto Github PK

View Code? Open in Web Editor NEW

This project forked from openstl/stl-crime-data

0.0 0.0 0.0 70.37 MB

A repo to scrape and store the crime data from the St. Louis Metropolitan Police Departments website.

Jupyter Notebook 100.00%

stl-crime-data's Introduction

stl-crime-data

Overview

This repo serves as a place for the St. Louis Crime Data from the St. Louis Metropolitan Police Departments website. The data extends back to January 2008 and goes through to the present. There are scripts to reproduce our scraping process and cleaning process. While we provide a cleaned version of the data, the raw files are provided along with our scripts so anyone can reproduce the data set.

Files

raw_data: The raw data files scraped from the website. Raw data is comma seperated.

clean_data: The dataset cleaned and enriched with additional information from other data sets. Cleaned data is tab-delimited.

CrimeDataFrequentlyAskedQuestions.pdf: The FAQ from SLMPD's website describing the raw_data sets.

UCRCodes.csv: Lookup table for the 26 types of crimes as defined by the FBI using the Uniform Crime Repoting (UCR) Codes (mentioned in the FAQ above but completed with some digging through FBI's website and the raw_data).

neighborhood_lookup.csv: Lookup table for to enrich the data with names of neighborhoods found in the FAQ above.

scraping_stl_crimedata.ipynb: IPython notebook used to scrape the raw_data from the website.

cleaning_stl_crime_data.ipynb: IPython notebook used to transform the raw_data into clean_data.

Data Dictionary for clean_data (raw_data is described in FAQ above)

Column Format Description
FileName string File name correspoding to the raw_data file where the row originated from
CADAddress string Address number where incident occured as reported by 911 caller. Not as accurate as the corresponding ILEADSAddress
CADStreet string Stree name where incident occured as reported by 911 caller. Not as accurate as the corresponding ILEADSAddress
CodedMonth string YYYY-mm ; Year and month reported for some fields
Complaint string Number assigned to incident (possibly unique?)
Count int64 Integer representing how the crime affects the total crime number. Summing on this column will give you the total number of crimes
Crime string 6-digit crime code, as defined by Uniform Crime Reporting guidelines
ShortCrimeCode string First two digits in 'Crime'
UCRType int64 1 or 2, denotes the type of crime as denoted by the Uniform Crime Reporting guidelines. Type 1 crimes are typically much worse than Type 2 Crimes
UCRCrime string High Level crime categories
DateOccured string Date the incident was reported to have occured
Description string Description of the crime described by 'Crime' column
District int64 1-9; indicates which police district a crime was reported in
FlagAdministrative string Y/N flag to denote an administrative edit (usually to the crime code)
FlagCleanup string Y/N flag to denote some sort of cleanup (never been flagged as of October 2015)
FlagCrime string Y/N flag to denote the incident as a new crime (versus reporting a crime that happened in the past)
FlagUnfounded string Y/N flag to denote if the incident was unfounded
ILEADSAddress string Address number logged on official police report
ILEADSStreet string Street name logged on official police report
LocationComment string Additional comments about the location
LocationName string Additional information to help identify location (St. Louis Zoo, Scottrade Center, etc.)
Neighborhood int64 Number representing neighborhood incident was reported in
NeighborhoodName string Neighborhood name, matched from 'Neighborhood'
NeighborhoodPrimaryDistrict float64 Primary police district the neighborhood lies in
NeighborhoodAddlDistrict float64 Additional police districts that the neighborhood might be in
Latitude float64 Latitude coordinates in WSG84
Longitude float64 Latitude coordinates in WSG84

stl-crime-data's People

Contributors

kylesykes avatar mattpitlyk avatar dgarbuz avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.