GithubHelp home page GithubHelp logo

full-stack-data-engineer's Introduction

So You Want To Be A Data Engineer?

UPDATES COMING

With the use of codespaces and my website coming along, I plan to take a different approach to this content. While it is on hold for now, I will revisit it as a series of codespaces that can be worked through to help develop core DE skills, using the codespaces and online data sources as a way to easily integrate the practices on any machine from anywhere in a seamless way.

If anyone is interested in helping me make this kind of content available, please feel free to ping me here. I welcome the support!


What does it mean to be a full stack data engineer? Like a full stack developer handles both backend and frontend development for a website, a full stack engineer is precisely somebody that develops for a data plaform from data collection to its consumption.

This training material will cover this in 4 sections.

  1. Data collection, tools, and pipelines
  2. Storage systems and data modeling
  3. Algorithms for analytics and dimensional modeling
  4. Data consumption, reporting, and dashboarding

Data Sets

We will use a number of large data sets throughout this build. If you cannot use the full referenced data set, a sample amount will be included here. Simply copy it to your data folder and proceed accordingly.

Brent Ozar has a 10 GB (1 GB download) at https://downloads.brentozar.com/StackOverflow2010.7z

This is very MVP at the moment and will be updated regularly until version 1.0

version 0.1.0

full-stack-data-engineer's People

Contributors

bryangoodrich avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.