GithubHelp home page GithubHelp logo

kmcelwee / fortune-100-blm-report Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 1.0 25.32 MB

Analysis of Fortune 100 twitter data

Home Page: https://kmcelwee.com/fortune-100-blm-report/site/

Python 2.29% HTML 40.55% JavaScript 2.76% CSS 2.05% Jupyter Notebook 52.34%
blm-dataset fortune-500 corporate-social-responsibility twitter-api blm blm-tweets

fortune-100-blm-report's Introduction

Hello! My name's Kevin ๐Ÿ‘จ๐Ÿปโ€๐Ÿ’ป

I earned a degree in Music at Princeton in 2018 and soon after discovered programming through data journalism. For two years, I worked for electric utilities, buiding models that increase efficiency on the grid; then for two years, I returned to Princeton as a programmer on staff, writing software for professors and students. I moved to Boston in 2022 and now work at AWS, helping build a cloud-based file system called FSx.

Check out my repos that explore...

โšก The Energy Sector

Partnered with rural utilities, national labs, and the Department of Energy, I studied how deep learning can be used to predict energy consumption. Increasingly accurate energy predictions can help utilities save a lot of money through a process called "peak shaving." Read about my deep learning research.

After utilities showed more interest in my research, I partnered with Burt County Public Power and NRECA to build a web app that automatically draws in weather data and load data every day to generate a forecast. Check out the product we built.

๐Ÿ“ˆ Data Journalism

๐Ÿค– Twitter bots

fortune-100-blm-report's People

Contributors

kmcelwee avatar

Stargazers

 avatar

Watchers

 avatar

Forkers

dzoladz

fortune-100-blm-report's Issues

Rounding errors found in IDs

Oembed process revealed IDs that had been rounded:

1265004698148495400
1265812305817862100
1266080125805834200

This exists within the dataset in fortune-100-blm-dataset. I believe I manually entered values for Lowe's because of API limits, so that might be what's happening.

  • Request oembed for all tweets ending in 00 to double check that it's limited to these tweets.
  • Dig into fortune-100-blm-dataset repo and double check scripts.
  • Pandas automatically reads the ID column as an integer. Research that this doesn't cause issues.

Improve OEmbed error handling

  • Create a file reviewed-oembed-errors.txt that lists IDs reviewed in #14.
  • Update collect-twitter-oembeds.py to print out errors that are new.
  • Update collect-twitter-oembeds.py to include returned messages with "error" to be included in the "error" field.

Create legend

This isn't straightforward in D3. Not sure how to best move forward

Highlight by tag

Create an interface that highlights (with large yellow circle?) the tweets with the selected tags

Twitter screenshots useful?

Some of the media within tweets could be useful, and it might be a last-ditch way to get twitter-like tooltips.

Spike: How to make tweet histograms larger

We have both the static images and the interactive visualizations. For static images, we can place the legend above the histogram instead of to the right. For dynamic images, see how we can increase circle size, reduce margins, and ensure that the graph is taking up the full given width. Also consider widening default text block width.

Wrap report better

Container within the report to hold paragraphs and headers? Otherwise it's just explicitly declaring the width of headers and paragraphs and images

Spike on publishing options

Isn't combining markdown with charts what Jupyter was made for? But I want a friendlier medium to any reader... so a proper website? I don't want to invest so much time in that! I could generate a website from Markdown which points to the files generated here. The only concern is interactive graphs, but D3 has certainly been used in GitHub pages elsewhere.

Double check every tweet

After finishing data visualization, run a final pass through all tweets to ensure consistency

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.