GithubHelp home page GithubHelp logo

shanuhalli / project-resume-classification Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 1.0 14.42 MB

The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.

Jupyter Notebook 99.87% Python 0.13%
lemmitization regular-expression text-mining count-vectorizer ner word-clouds stemming nltk resume-classification classification

project-resume-classification's Introduction

Project-Resume-Classification

Problem Statement & Business Objectives: The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.

Abstract:

A resume is a brief summary of your skills and experience. Companies recruiters and HR teams have a tough time scanning thousands of qualified resumes. Spending too many labor hours segregating candidates resume's manually is a waste of a company's time, money, and productivity. Recruiters, therefore, use resume classification in order to streamline the resume and applicant screening process. NLP technology allows recruiters to electronically gather, store, and organize large quantities of resumes. Once acquired, the resume data can be easily searched through and analyzed.

Resumes are an ideal example of unstructured data. Since there is no widely accepted resume layout, each resume may have its own style of formatting, different text blocks and different category titles. Building a resume classification and gathering text from it is no easy task as there are so many kinds of layouts of resumes that you could imagine.

🔹The basic data analysis process performed such as data collection, text mining, data cleaning, exploratory data analysis, data visualization.

🔹Building a Machine learning model for Resume Classification using Python and basic Natural language processing techniques.

🔹Used Python's libraries to implement various NLP techniques like tokenization, lemmatization, parts of speech tagging, etc.

🔹A resume classification analyzes resume data and extracts the information into the machine-readable output. It helps automatically store, organize, and analyze the resume data to find out the candidate for the particular job position and requirements.

🔹The aim of this project is achieved by performing the various data analysis methods and using the Machine Learning models and Natural Language Processing which will help in classifying the categories of the resume and building the Resume Classification Model.

In this work, I compare different types of machine-learning algorithms.

  1. K-Nearest Neighbors
  2. Decision Tree
  3. Random Forest
  4. Support Vector Machine
  5. Logistic Regression
  6. Bagging Classifier
  7. Ada Boost Classifier
  8. Gradient Boosting
  9. Naive Bayes

All models above show the following train and test accuracy:

✍️ Author

Connect with me

LinkedInInstagramFacebookGmailWhatsapp

project-resume-classification's People

Contributors

shanuhalli avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Forkers

kshitij-2107

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.