https://youtu.be/AdK1KyHy87o?si=1TKDGUKbD1gbV_r2
https://adult-censusincome-prediction.onrender.com
The Goal is to predict whether a person has an income of more than 50K a year or not. This is basically a binary classification problem where a person is classified into the
50K group or <=50K group.
The classical machine learning tasks like Data Exploration, Data Cleaning, Feature Engineering, Model Building and Model Testing. Try out different machine learning algorithms that’s best fit for the above case.
The dataset is taken from a Kaggle. You can download the dataset from here
Applying machine learing tasks like Data Exploration, Data Cleaning, Feature Engineering, Model Building and model testing to build a solution that should able to predict the premium of the personal for health insurance.
-
Data Exploration : Exploring the dataset using pandas, numpy, matplotlib, plotly and seaborn.
-
Exploratory Data Analysis : Plotted different graphs to get more insights about dependent and independent variables/features.
-
Feature Engineering : Numerical features scaled down and Categorical features encoded.
-
Model Building : In this step, first dataset Splitting is done. After that model is trained on different Machine Learning Algorithms such as:
- Logistic Regression
- Decision Tree Classification
- Random Forest Classification
- Gradient Boosting Classification
- XGBoost Classification
-
Model Selection : Tested all the models to check the RMSE & R-squared.
-
Pickle File : Selected model as per best RMSE score & R-squared and created pickle file using pickle library.
-
Webpage &Deployment : Created a web application that takes all the necessary inputs from the user & shows the output. Then deployed project on the AWS Platform. =======
-
Webpage &Deployment : Created a web application that takes all the necessary inputs from the user & shows the output. Then deployed project on the Render Platform.
bc6ac5c4f91f55061482a3a8b501cdb844a3cbdc
1) Pandas
2) Numpy
3) Matplotlib, Seaborn, Plotly
4) Scikit-Learn
5) Flask
6) HTML
7) CSS
1) Python
2) Front-end : HTML, CSS
3) Back-end : Flask
4) Deployment : Heruko
Input the characteristics of the diamond when prompted, such as 'age','workclass', 'fnlwgt', 'education', 'marital-status','occupation','relationship', 'race', 'sex', 'capital-gain','capital-loss', 'hours-per-week', 'country'.
Experiment with different machine learning algorithms to improve prediction accuracy. Perform feature engineering to extract more relevant information from the dataset. Collect additional data to further train and validate the model.
➡️ABHISHEK UPADHYAY ➡️Email id: [email protected]