This project is the solution for the Data Engineering assignment given in the below link: https://bitbucket.org/fetchrewards/data-engineering-take-home/src/master/
- Clone this repo.
git clone https://github.com/prasadashu/data-engineering-fetch-rewards.git
- Go into the cloned repo.
cd data-engineering-fetch-rewards
- Run
make
command to install dependencies.
make pip-install
- Run
make
command to configure aws shell.
make aws-configure
- Pull and start docker containers.
make start
- Run Python code to perform ETL process.
make perform-etl
- To validate the messages loaded in Postgres
psql -d postgres -U postgres -p 5432 -h localhost -W
-
Credentials and database information
- username=
postgres
- password=
postgres
- database=
postgres
- username=
-
If
psql
binary is not installed on Ubuntu based distros, install it using the below command.
apt install postgresql-client
- The
ip
anddevice_id
fields are masked using base64 encryption. - To recover the encrypted fields, we can use the below command.
echo -n "<sample_base64_encrypted_string>" | base64 --decrypt