GithubHelp home page GithubHelp logo

dataflow-flow-centric-poc's Introduction

 


trevis-ci Check last build on Travis-CI



Flow Centric PoC

Complex json structures coming from different sources? We want to fix your problem. Take a look to get inspired!!!

You have small but complex data, and you want easily get ready for any analytics tools with less burn down and expenses possible. So we tried at the end the use of BSON library and the Spring Cloud Dataflow framework, for running a 'continuous streaming'...

First level PoC development.

In a real world we should provide at least a similar architecture:


As a Poc we can start providing in a single Maven project, all we can provide it's just a small subset of the minimum viable architecture, as shown here:


Results

In the MongoDb instance you can find the flow-centric database.


After a cycle of data streaming we have two MongDb collections per each category.


Into data collections we can find some ready indexes:


Here how MongDb collections appears.


And into any of the data collections we have elements that track :

  • metadata partition

  • metadata document id

  • index

  • model name

All as shown in following images:


Spring Cloud Configuration

The Spring Cloud Config server takes the configuration from a specific repository, as follows:

It provides some profiles:

  • dev (source_dev, process_dev, sink_dev)

  • compose (source_compose, process_compose, sink_compose)

  • kubernetes (source_kubernetes, process_kubernetes, sink_kubernetes)

  • local (not ready)

Test locally without building the code

We provide a docker compose to simulate base environment.

Please visit folder scripts

Information about docker compose here and command line reference here.

Enjoy tour journey in Spring Cloud Dataflow Framework.

Docker images repository

Here the docker images sources repository:

Coming soon

Upcoming branch with a compose whicb main news are Spring Cloud Dataflow Server and the Spring Cloud Skipper Server you can scale as you can with you system resources and you will be able to scale individually for the 3 microservices, registered into the Server via catalogue (source, process and sink). We will provide as well the 3 analytics microservices with some automation on the definition and recognition of model types, indexes, and some new spatial concepts. Autoplacing indexes required by the analytics nodes, via model databdatabase (missing in this release). The use of another streaming engine will realize the data push in the metadata sourcing microservice channels and it will be used to pushback responses from the analytics sink moctoservice after the analytics group computation. So let's get ready for a more intensive experience on the dataflow universe ...

License

The library is licensed with CC0 v. 1.0 clauses, with prior authorization of author before any production or commercial use. Use of this library or any extension is prohibited due to high risk of damages due to improper use. No warranty is provided for improper or unauthorized use of this library or any implementation.

Any request can be prompted to the author Fabrizio Torelli at the follwoing email address:

[email protected]

dataflow-flow-centric-poc's People

Contributors

hellgate75 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.