GithubHelp home page GithubHelp logo

hieutrluu / airflow-summit-airbyte-2022 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from airbytehq/airflow-summit-airbyte-2022

0.0 0.0 0.0 36 KB

git push your data stack with Airbyte, Airflow, and dbt - 2022 Airflow Summit

Home Page: https://www.airbyte.com

Shell 58.26% Python 34.57% Dockerfile 7.17%

airflow-summit-airbyte-2022's Introduction

git push your data stack with Airbyte, Airflow and dbt

For Airflow Summit 2022, by @marcosmarcm and @evantahler from Airbyte

CI

What:

Links:

This project configures a sample data stack orchestrated by Airflow, using Airbyte to Extract and Load data, and dbt to Transform it.

Running Locally:

  1. Install Docker Desktop and Python 3 (if you are on MacOS, you already have Python 3).

  2. Create {HOME}/.octavia and add the following credentials for using a local postgres database managed by Docker:

POSTGRES_HOST=host.docker.internal
POSTGRES_PASSWORD=password
POSTGRES_USERNAME=demo_user
POSTGRES_DATABASE=postgres
  1. Create the profile dbt in {HOME}/.dbt/profiles.yaml
config:
  partial_parse: true
  printer_width: 120
  send_anonymous_usage_stats: false
  use_colors: true
normalize:
  outputs:
    prod:
      dbname: postgres
      host: host.docker.internal
      pass: password
      port: 5432
      schema: public
      threads: 8
      type: postgres
      user: demo_user
  target: prod
  1. Run the whole data stack using ./tools/start.sh. This will install local requirements (PyYAML) and run everything though Docker. The script will exit when complete, but the Docker containers will remain running.

In your browser:

  • Visit http://localhost:8080/ to see the Airflow UI (user: airflow, password: airflow) and your completed DAG.
  • Visit http://localhost:8000/ to see the Airbyte UI and your completed Sync.
  • Visit your local postgres database (localhost:5432) with the username=demo_user and password=password to see the staged and transformed data.

Shut it down

Run ./tools/stop.sh to stop the Docker containers.

Testing

This repository is tested using Github Actions.

airflow-summit-airbyte-2022's People

Contributors

evantahler avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.