Jeffrey Guntzel's Projects
Django app to consume and store 990 data and metadata
metadata describing the 990 xml release, to be used by 990-xml-reader and related projects
IRSx: Turn the IRS' versioned XML 990 nonprofit annual tax returns into standardized python objects, json, or human readable text with original line number and description.
A Python data analysis library that is optimized for humans instead of machines.
A script for Adobe Illustrator that converts your Illustrator artwork into an html page.
machine learning/artificial intelligence notes
Export Airtable data to YAML, JSON or SQLite files on disk
OS X menubar status indicator
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
D-Lab's 3 hour introduction to basic Bash commands and using version control with Git and Github.
Download audio from youtube-dl sources and import into beets
Documentation repository for Bunch.app
Download U.S. census data and reformat it for humans
Customizable cheat sheet system for OS X
Themes compatible with MultiMarkdown Composer 5 and nvUltra
This course is a rigorous, year-long introduction to computational social science. We cover topics spanning reproducibility and collaboration, machine learning, natural language processing, and causal inference. This course has a strong applied focus with emphasis placed on doing computational social science.
A new version of the cook county jail scraper, inspired by the Supreme Chi-Town Coding Crew
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Python CLI tool and library for diffing CSV and JSON files
Based on shell scripts authored by Jeff Severns Guntzel to help data journalists use Christopher Groskopf's 'csvkit' utility library to audit a csv file without opening it, and then backup and move the the csv file to a project directory before working on it.
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
A collection of the work of Edward Estlin Cummings, as it enters the public domain.
This repository serves as a template for all D-Lab workshops stored on GitHub. Use this template if you're creating a new D-Lab workshop.
Data and code behind the articles and graphics at FiveThirtyEight
For students of https://projects.propublica.org/graphics/ida-propublica-data-institute
Materials for D-Lab / UC Berkeley Graduate Division's Data Science for Social Justice summer workshop. These materials provide an introduction to Python, natural language processing, text analysis, word embeddings, and network analysis. They also include discussions on critical approaches to data science to promote social justice.
An open source multi-tool for exploring and publishing data
A highly-opinionated Xcode project template to build a new macOS app with.