GithubHelp home page GithubHelp logo

wanshicheng / datacompare Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dromara/datacompare

0.0 0.0 0.0 11.1 MB

big data comparison and data profiling platform: low code,data comparison and data profiling

Home Page: https://www.reddit.com/r/bigdata/comments/zlm2mw/design_and_practice_of_open_source_big_data/

License: Apache License 2.0

JavaScript 26.77% Java 43.55% CSS 4.76% ANTLR 0.95% HTML 23.97%

datacompare's Introduction

dataCompare

EN doc CN doc

Introduction

dataCompare is a database comparison and profiling platform

(1)support Hive table data comparison, MySQL、Doris data comparison, realize automatic configuration for data comparison, avoid frequent SQL writing for processing

(2)support easy configuration for data profiling

image

image

Features

data-compare

(1)Interface-level interactive data comparison task configuration, low code and small amount of configuration to quickly generate comparison tasks

(2)Magnitude comparison, consistency comparison, automatic difference case discovery

(3)JDBC databases such as MySQL, Apache Hive, and Apache Doris are currently supported

(4)Already supports the comparison results to automatically send email alarm reports

data-profiling

(1)Data detection can be completed with low code and a small amount of configuration

(2)Primary key, enumeration value, null value detection

Software Architecture

image

Technology stack:

End:Spring boot + Mybatis

DataBase:MySQL

Parsing Engine:Antrl

Big Data:Hive、Spark

System flowchart

输入图片说明 输入图片说明 输入图片说明 输入图片说明

img_1.png

img.png

Demonstration of system functionality

Home image

data-compare:

DbConfig

mysql config

image

hive config

image

job config image

Comparison results are displayed image

image

3fd83de9c582347f7f88cc82f438db4

data-profiling:

job config

img_2.png

profiling result

img_3.png

img_4.png

image

The system running environment

java jdk8

mysql 5.7.36

Runing config

(1)Run the SQL files in the SQL directory in the database, create database and tables

(2)build jar using the source code of the project: mvn clean package -Dmaven.test.skip=true -Ptest

(3)edit database config information of application.yml

(4)run java -jar -Dspring.config.location=application.yml dataCompare.jar (application.yml and jar must in the same directory)

(5)visit http://127.0.0.1/ (UserName:admin PassWord:admin123)

Environment installation configuration

(1)If you want to implement Hive data comparison configuration, you need to install the Hive environment first(The installation documentation refers to the docker quick installation of the Hive environment:https://blog.csdn.net/ifenggege/article/details/107860477)

(2)After installation, when creating a new data source connection, select Hive at the address jdbc:hive2://ip:10000

Technological Communication

image

Star History

Star History Chart

Thanks

Thanks ruoyi Provides front-end services

datacompare's People

Contributors

zhugezifang avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.