GithubHelp home page GithubHelp logo

arpendu11 / graph-based-data-lake Goto Github PK

View Code? Open in Web Editor NEW
7.0 1.0 2.0 58 KB

An ETL application which is written in Quarkus, Spark SQL Streaming, Neo4j and various types of Databases and stores. It also covers the devops frameworks like Jenkins CI/CD, docker and Kubernetes.

Java 88.32% HTML 11.68%
quarkus-maven quarkus-resteasy quarkus-kafka spark-sql spark-streaming-kafka reactive-kafka-programming quarkus-neo4j etl-application kubernetes docker neo4j-graph neo4j-search

graph-based-data-lake's Introduction

graph-based-data-lake project

An ETL application which is written in Quarkus, Spark SQL Streaming, Neo4j and various types of Databases and stores. It also covers the devops frameworks like docker and Kubernetes.

This project uses Quarkus, the Supersonic Subatomic Java Framework.

If you want to learn more about Quarkus, please visit its website: https://quarkus.io/.

Explanation

You can follow this series to know stage by stage implementation: Full Stack: Remastering Master Data Management into graph like data

You can find the repo being used for:

Running the application in dev mode

You can run your application in dev mode that enables live coding using:

./mvnw clean compile quarkus:dev

Packaging and running the application

The application can be packaged using ./mvnw clean package. It produces the graph-data-lake-1.0.0-SNAPSHOT-runner.jar file in the /target directory. Be aware that it’s not an über-jar as the dependencies are copied into the target/lib directory.

The application is now runnable using java -jar target/graph-data-lake-1.0.0-SNAPSHOT-runner.jar. It also creates automated Kuberenetes yaml and json in /target/kubernetes/kubernetes.yml or /target/kubernetes/kubernetes.json which defines the Deployment and Service component required to run the container.

Creating a native executable

You can create a native executable using: ./mvnw clean package -Pnative.

Or, if you don't have GraalVM installed, you can run the native executable build in a container using: ./mvnw clean package -Pnative -Dquarkus.native.container-build=true. You can then execute your native executable with: ./target/graph-data-lake-1.0.0-SNAPSHOT-runner

Or, if you want to run a minimal install docker build, the you can run the executable build using: ./mvnw clean package -Pnative -Dnative-image.docker-build=true.

If you want to learn more about building native executables, please consult https://quarkus.io/guides/building-native-image.

Build Docker image

You can build a docker image using: docker build -f src/main/docker/Dockerfile.jvm -t quarkus/graph-data-lake-jvm Then run the container using: docker run -i --rm -p 8080:8080 quarkus/graph-data-lake-jvm

If you want to include the debug port into your docker image you will have to expose the debug port (default 5005) like this : EXPOSE 8080 5050 Then run the container using : docker run -i --rm -p 8080:8080 -p 5005:5005 -e JAVA_ENABLE_DEBUG="true" quarkus/graph-data-lake-jvm

Build and Run Kubernetes cluster

If you have minikube installed, then check for minikube status Update the image name and tag after docker build in kubernetes.yml and kuberenets.json

Now run your cluster using: kubectl apply -f target/kubernetes/kubernetes.yml If you want to scale the application to multiple replicas, the you can use: kubectl scale --replicas=10 --deployment graph-data-lake

It takes only few seconds to spin up those containers.

graph-based-data-lake's People

Contributors

arpendu11 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.