This dataset collects information from 100k medical appointments in Brazil and is focused on the question of whether or not patients show up for their appointment. A number of characteristics about the patients are included in each row.
ScheduledDay tells us on what day the patient set up their appointment. Neighborhood indicates the location of the hospital. Scholarship indicates whether or not the patient is enrolled in Brasilian welfare program Bolsa Familia. Be careful about the encoding of the last column: it says ‘No’ if the patient showed up to their appointment, and ‘Yes’ if they did not show up.
This project was written in python using Anaconda's jupyter notebook python 3.6 or higher version. The following following packages needs to be installed in other to effectively run:
pandas Numpy Matplotlib Csv seaborn
The projects goal is to wrangle the noshowappointment dataset and make Exploratory Data Analysis to discover insightful contents through asking questions.
This can be found at the end of the investigate_a_dataset.html file.