oelesin / custom-etl-spark2es Goto Github PK
View Code? Open in Web Editor NEWThis repo contains a custom ETL pipeline from MSSQL database to Elasticsearch with Apache Spark. It also creates an Avro/Parquet file that is saved into HDFS for persistence
License: Apache License 2.0