GithubHelp home page GithubHelp logo

davidmr001 / wherehows Goto Github PK

View Code? Open in Web Editor NEW

This project forked from datahub-project/datahub

0.0 2.0 0.0 15.54 MB

Data Discovery and Lineage for Big Data Ecosystem

Home Page: https://github.com/linkedin/WhereHows/wiki

License: Apache License 2.0

Java 34.72% Shell 0.07% Dockerfile 0.07% Python 37.69% JavaScript 3.94% TypeScript 17.02% HTML 3.49% Nearley 0.08% CSS 2.92%

wherehows's Introduction

WhereHows Build Status latest Gitter PRs Welcome

WhereHows is a data discovery and lineage tool built at LinkedIn. It integrates with all the major data processing systems and collects both catalog and operational metadata from them.

Within the central metadata repository, WhereHows curates, associates, and surfaces the metadata information through two interfaces:

  • a web application that enables data & linage discovery, and community collaboration
  • an API endpoint that empowers automation of data processes/applications

WhereHows serves as the single platform that:

  • links data objects with people and processes
  • enables crowdsourcing for data knowledge
  • provides data governance and provenance based on ownership and lineage

Who Uses WhereHows?

Here is a list of companies known to use WhereHows. Let us know if we missed your company!

How Is WhereHows Used?

How WhereHows is used inside of LinkedIn and other potential use cases.

Documentation

The detailed information can be found in the Wiki

Examples in VM (Deprecated)

There is a pre-built vmware image (about 11GB) to quickly demonstrate the functionality of WhereHows. Check out the VM Guide

WhereHows Docker

Docker can provide configuration free dev/production setup quickly, please check out Docker Getting Start Guide

Getting Started

New to Wherehows? Check out the Getting Started Guide

Preparation

First, please setup the metadata repository in MySQL.

CREATE DATABASE wherehows
  DEFAULT CHARACTER SET utf8
  DEFAULT COLLATE utf8_general_ci;

CREATE USER 'wherehows';
SET PASSWORD FOR 'wherehows' = PASSWORD('wherehows');
GRANT ALL ON wherehows.* TO 'wherehows'

Execute the DDL files to create the required repository tables in wherehows database.

Build

  1. Get the source code: git clone https://github.com/linkedin/WhereHows.git
  2. Put a few 3rd-party jar files to wherehows-etl/extralibs directory. Some of these jar files may not be available in Maven Central or Artifactory. See the download instrucitons for more detail. cd WhereHows/wherehows-etl/extralibs
  3. From the WhereHows root directory and build all the modules: ./gradlew build
  4. Start the metadata ETL and API service: ./gradlew wherehows-backend:runPlayBinary
  5. In a new terminal, start the web front-end: ./gradlew wherehows-frontend:runPlayBinary. The WhereHows UI is available at http://localhost:9001 by default. You can change the port number by editing the value of project.ext.httpPort in wherehows-frontend/build.gradle.

Roadmap

Check out the current roadmap for WhereHows.

Contribute

Want to contribute? Check out the Contributors Guide

Community

Want help? Check out the Gitter chat room and Google Groups

wherehows's People

Contributors

alyiwang avatar arkadiuszosinski avatar camelliazhang avatar chrisjamesc avatar cptran777 avatar czbernard avatar dmoore247 avatar drgrove avatar ericsun2 avatar hangzhang925 avatar igbopie avatar jdaghlian-fit avatar jerrybai2009 avatar jywadhwani avatar kiranalda avatar mars-lan avatar msknapp avatar newsummit avatar nvemuri1 avatar richardxin avatar shpark76 avatar shridharsattur avatar simplesteph avatar sunzhaonan avatar theseyi avatar tsukaby avatar visokoo avatar wenhuaopenx avatar zhangloo333 avatar ziveo avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.