Data lineage made simple. Grai makes it easy to understand how your data relates together across databases, warehouses, APIs and dashboards.
- Pre-built connectors. Automatically synchronize lineage from across the stack so your metadata is never out of date.
- Centralized data tests. Write data validation tests which run whenever upstream data sources change (coming soon).
- Integrated with git. Run data validation tasks as part of your CI/CD process to test changes everywhere you data is used. No coordination meetings required.
- Your data, your cloud. Grai is fully open sourced and self hosted. You maintain full control over your data and hosting environment.
git clone https://github.com/grai-io/grai-core
cd grai-core/grai-server
docker compose up
The server should now be available at http://localhost:8000/admin.
Default login credentials:
username: [email protected]
password: super_secret
You can also explore the API backend at http://localhost:8000/docs.
Check out this guide for a walk through populating lineage using the postgres connector.
Although you'll need to provide your own postgres container you can also get started with a basic image of Grai is
avaialable at ghcr.io/grai-io/grai-core/grai-server
.
Programmatically interact with your data lineage from the command line using the grai-cli.
pip install grai-cli
Provides programmatic access to the data lineage server.
pip install grai-client
Dynamically interact with the data lineage graph for your organization.
pip install grai-graph
Postgres:
pip install grai-source-postgres
DBT (coming soon):
pip install grai-source-dbt
More to come here but if you're interested in contributing or just want to chat, drop an email to [email protected].
Or join our discord!