This project requires Python 3.11 and the following Python libraries installed:
- PySpark 3.5
- pyspark.ml
- pyspark.sql
- SynapseML
- Pandas
- Seaborn
- Matplotlib
The SaaS business must manage churn. Getting subscriptions to continue is becoming central to business. In addition, we can stop services that should be unnecessary if we know the criteria for stopping unwanted subscriptions. I am very interested in how we deal with churn as I am also in this business.
Sparkify.ipynb
Sparkify.html
data
|- mini_sparkify_event_data.json
The main findings of the code can be found at the post available here.
- Udacity for providing an excellent Data Scientist training program and the project along with the data.