GithubHelp home page GithubHelp logo

mw10104587 / data2-philanthropy Goto Github PK

View Code? Open in Web Editor NEW
3.0 1.0 2.0 14.58 MB

A project working on NYC cultural institutions' philanthropy data

Python 0.14% Jupyter Notebook 99.68% HTML 0.07% PHP 0.01% JavaScript 0.10% CSS 0.01%

data2-philanthropy's Introduction

data2-Philanthropy

This is a project about 20 Cultural Institutions' philanthropy data in New York City. We tried to find out the cross giving, which a board member of museum A donates to museum B, and the board member of museum B donates back. The project is initiated in the class of Mark Hansen's data II in Columbia University.

Download Full Story Here

Data

We collected the data from the following 20 Institutions, said to be the richest institutions based on ranking by Crain’s New York.
  • Carnegie Hall
  • Lincoln Center
  • The Metropolitan Museum of Arts
  • The Metropolitan Opera
  • Museum of Modern Art
  • New York Philharmonic
  • American Museum of Natural History
  • Museum of Arts and Design
  • Brooklyn Public Library
  • Frick Collection
  • New York Public Library
  • 9/11 memorial
  • Cooper Hewitt/Smithsonian
  • New York Transit Museum
  • The Morgan Library and Museum
  • WNET (Thirteen, WLIW, etc.)
  • 92 Street Y
  • Roundabout Theatre Company
  • WNYC
  • Brooklyn Academy of Music

We extracted donor names and amounts from their annual report, by using an online service pdftable, and turned them into a 42,955 rows csv file.

Tools

- R - Python - d3.js - Gephi

Some Results

Percentage of Board Member's donation

Each bar represents the percentage of an institution's total donation income that was provided by other institution's board members. Lincoln Center benefited most from cross-giving.

Donors connections

Each cluster represents an institution. The inner group of circles at the center of each institution represents the institution's board members. Blue arrows represent donations. The outer circles of each cluster represent the institution's most powerful donors. The red arrows call attention to donors who appear in multiple clusters--donors who donate to multiple institutions. This network shows how big each institution's donor network is and which institution's share similar donor bases

Cross Giving Chart

Institutions are along the circumference according to the percentage of board members who donated to them. Colored chords connecting two institutions represent reciprocal relationships (the institutions have donated to each other through their board members). Gray chords connecting two institutions represent non-reciprocal relationships (one institution gave to the other, but the other did not give back). Lincoln Center, 9/11 Memorial and Mets are the largest player in the backscratching game

Institution Board Member Donation

*Each large orange circle represents an institution. The smaller circles clustered around it are that institution's board members. The blue arrows represent donations that board members made to institutions other than the one to which they belong. These donations are made in the board member's name. The green arrows represent donations from an organization, foundation, company or other group affiliated with a board member made to an institution to which the board member does not belong." *

###Code

#####Python/Parse pdf to csv File

python ParsePDF2CSV.py file.csv stackType Year Institution [Fund Type]

We extract donors data with pdf table, and download excel from the website. Some times when the names of donors are long, like: Jackie Harris Hochberg and Robert J Hochberg, it often turns into two row.

Donors
Jackie Harris Hochberg
and Robert J Hochberg

We manually move the second half of the data back to the first row, for all of these long donor names, and use this ParsePDF2CSV.py file to make it a csv file, adding year, institution and fund type into the data, trying reduce manual effort for journalists.

Year Donor Amount Type Institution
2013 The Starr Foundation "$25,000,000" 9/11 Memorial
2013 Bank of America "$20,000,000" 9/11 Memorial
2013 Bloomberg Philanthropies "$15,000,000" 9/11 Memorial
2013 JPMorgan Chase Co "$15,000,000" 9/11 Memorial

This is a command line tool that tackles with three kind of format.

  1. par: the amount donation of each donor is listed parrallel to the donors name
  2. stack: the amount of donation of each donors is stacked on the same column
  3. stack-with-fundtype: the amount of donation is stacked with donors, and the fund type is parrallel to donors list.

#####Python/Amount Parser File

Different institutions have different way of showing the amount of donation in their annual reports. We parse different format of donation amount into an upper bound and a lower bound (some of them are ranges). We also classify them into 5 classes according to their amoun of donation to simplify our visualization.

Class Amount
1 1,000,000+
2 750,000 +
3 500,000 +
4 250,000 +
5 249,999 -

#####Python/ Generate Node Graph Data File

python GenerateNodeGraphData.py donors-list.csv boards.csv smallest_donation_class>

This file takes the a donor list of 20 institutions and a list of all board members, and outputs two file for plotting a node graph, one node.csv and a edge.csv. The class for amount of donations to be taken into consideration should also be specified in the command. The classes are mentioned in the last paragraph.

Future Plan

  1. Currently there are a lot of donors who donate together, like Jackie Harris Hochberg and Robert J Hochberg, but we still take them as one person. We plan to parse them and split them into different rows in the future.
  2. Currently there are similar names for companies and people, we plan to implement Fuzzy String matching.
  3. Visualize data into web portable format. (Now on Gephi)
  4. We plan to interview members of several boards, and philanthropy experts to understand more insight about this phenomenon.

data2-philanthropy's People

Contributors

mw10104587 avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Forkers

eagles0607

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.