GithubHelp home page GithubHelp logo

pecanstreet-load's Introduction

Dataport Pecan Street dataset

The provided scripts can be used to download and exploratively analyze the Dataport Pecan Street dataset. The data is provided by Dataport and Pecan Street Inc., and a key to access the data must be obtained from https://dataport.cloud/.

Python setup

To run the scripts successfully it is remommented to use anaconda to recreate the environment, as there have been issues with different version of the DB drivers.

See here, for more information about anaconda environments.

Run the following command after installing anaconda:

conda env create -f environment.yml

Then use either of the following commands to activate the environment:

On Windows:

activate pecanstreet 

On Linux:

source activate pecanstreet 

Then, run the following scripts within the environment.

download_pecanstreet.py

The script is currently setup to download the data for 2 years (2015-2016) for Boulder and Austin (the largest cities with available data) and also downloads the associated weather data. It fills holes of 3 hours by linear interpolation and holes larger than 3 hours to up a day by using the values of 24 hours before. If still N/A values are existing, the ID is dropped from the final results (resulting in 200 households for Austin and 20 households for Boulder).

Resulting files output in *.csv format.

Datetime format is YYYY-MM-DD hh:mm:ss±hh:mm, where the part "±hh:mm" is the offset of UTC, hence the time is local time.

Weather

Weather data is ouput per city and is of format:

  • temperature (in °C)
  • apparent_temperature (in °C)
  • dew_point
  • humidity
  • visibility
  • pressure
  • wind_speed
  • cloud_cover
  • wind_bearing
  • precip_intensity
  • precip_probability

Load

The output load files are one for each configured aggregation level. For single households one column per household is created and the id is the id used by the Pecanstreed database (see table university.metadata for properties of the households), for aggregations, which are created pseudo-randomly (with a seed), a increasing id is given. Load is in average kW per interval. The files are created for 15-min and hourly intervals.

analyze_pecan_data.py

For each created load file, the script creates the following plots:

  • Autocorrelation Plot (ACF)
  • Scatter plot showing the correlation of temperature and load
  • the trace of a week

pecanstreet-load's People

Watchers

James Cloos avatar

Forkers

somniferume

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.