GithubHelp home page GithubHelp logo

shreyapatil1199 / data_science_salary_prediction Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 11.97 MB

The objective of the project is to conduct a comprehensive analysis of a dataset of data science job postings, identifying the most important factors that influence salaries. Build predictive models that can be used to predict salaries for data science professionals, taking into account factors such as experience level, education, skills etc.

Jupyter Notebook 100.00%
adaboost-regressor data-cleaning decision-tree-regression exploratory-data-visualizations extreme-gradient-boosting-regressor gradient-boosting-regressor multiple-linear-regression random-forest-regressor ridge-regression

data_science_salary_prediction's Introduction

Data_Science_Salary_Prediction

image

License: CC0-1.0

GitHub release (latest by date)

GitHub last commit

Python 3

Table of Contents

Objective

The objective of this GitHub project is to conduct a comprehensive Data Science Job Salaries Regression Analysis. This project aims to:

1. Explore and Analyze Data: Collect and preprocess job salary data to gain insights into trends and patterns within the data science job market.

2. Build Regression Models: Develop regression models to predict salaries based on various features, such as job title, location, experience, and skills.

3. Evaluate Algorithms: Compare and evaluate different regression algorithms to identify the most effective models for salary prediction.

4. Provide Insights: Share meaningful insights and conclusions derived from the analysis, helping job seekers, employers, and policymakers make informed decisions.

By achieving these objectives, this project aims to empower stakeholders in the data science job market with valuable insights, enhance predictive modelling skills, and contribute to the broader data science community.

Prerequisite

To run this analysis, you need the following prerequisites:

Python 3

Jupyter Notebook (optional)

Pandas

Matplotlib (for data visualization)

Seaborn (for enhanced data visualization)

Data Description

1. Unnamed

Data Type: Integer (int64)

Description: An index or identifier for each data record.

2. work_year

Data Type: Integer (int64)

Description: The year in which the job information was recorded or applicable.

3. experience_level

Data Type: Object (String)

Description: The level of experience required or possessed for the job, categorized into different levels (e.g., Junior, Mid-Level, Senior).

4. employment_type

Data Type: Object (String)

Description: The type of employment associated with the job (e.g., Full-Time, Part-Time, Contract, etc.).

5. job_title

Data Type: Object (String)

Description: The title or name of the job position.

6. Salary

Data Type: Integer (int64)

Description: The salary associated with the job position, denominated in the local currency.

7. salary_currency

Data Type: Object (String)

Description: The currency in which the salary is denominated.

8. salary_in_usd

Data Type: Integer (int64)

Description: The salary is converted into United States Dollars (USD) for standardization or comparison purposes.

9. employee_residence

Data Type: Object (String)

Description: The location or residence of the employee, often specified by country or region.

10. remote_ratio

Data Type: Integer (int64)

Description: The ratio or percentage of remote work allowed or expected for the job position.

11. company_location

Data Type: Object (String)

Description: The location of the company or employer, often specified by country or city.

12. company_size

Data Type: Object (String)

Description: The size category of the company, typically categorized by the number of employees (e.g., Small, Medium, Large).

data_science_salary_prediction's People

Contributors

shreyapatil1199 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.