GithubHelp home page GithubHelp logo

portfolio's Introduction

AI IN THE WORKPLACE: ANALYZING EMPLOYEE SENTIMENTS

Jonathan Asanjarani
Department of Data Analysis and Visualization, CUNY Graduate Center
LLM Fall 2023: Large Language Models and ChatGPT
Professor Michelle McSweeney
December 18, 2023

Abstract

This project aims to employ BERT for classifying employee sentiments toward their company, utilizing natural language processing (NLP) and sentiment analysis. BERT, a language model hosted by Hugging Face, will be applied to a sentiment classification task using TensorFlow for fine-tuning. The dataset, sourced from Kaggle here, comprises job reviews from Glassdoor. The focus is on providing an overview of BERT model setup and tuning for employee sentiment analysis, along with exploring societal and ethical considerations tied to language model use in workplace sentiment monitoring.

Background

BERT

This project leverages BERT, a language model developed in 2018, utilizing neural networks for text representation. BERT understands word context by examining surrounding words on both sides, providing a more accurate representation of linguistic relationships. The architecture incorporates an “Attention Mechanism,” embedding weighted values for each word. The “Transformer” model, based on this mechanism, consists of an encoder with two layers. BERT, being an encoder-only model, stands for Bidirectional Encoder Representations from Transformers. It employs bidirectional training using a “masked language model” (MLM) pre-training objective. This allows BERT to develop accurate contextual relationships without rebuilding the entire model, making it versatile for specific tasks.

Sentiment Analysis

Sentiment analysis studies opinions, sentiments, and emotions using computational methods in the context of NLP. Large Language Models (LLMs) perform well in simple sentiment analysis tasks, classifying sentiment as positive or negative. However, for more complex tasks requiring a deep understanding of specific sentiments, LLMs may not outperform traditional models trained with specific, in-domain data.

Social & Ethical Implications

Social Implications

Leslie Hammer states that employees feeling uncared for or untrusted by employers can lead to lower commitment, perceived psychological safety, and higher stress levels. Companies using AI for workforce monitoring report negative mental health effects among employees. There is skepticism and concern among workers regarding potential misuse of collected information. While some anticipate improved workplace security, others express worries about inappropriate behavior and misuse.

Ethical Implications

Large Language Models and AI tools in workplace surveillance raise ethical concerns. Companies may monitor employees extensively, posing risks of micromanaging and potential AI training on private or unethically sourced data. Cases like Barclays Bank's fine for unauthorized employee monitoring underscore the need for clear regulations balancing surveillance and privacy. The increased demand for workplace monitoring solutions during the pandemic, without adequate regulations, emphasizes the importance of ethical guidelines as AI technologies advance.

Method

The BERT model was fine-tuned using a Kaggle dataset of employee reviews. The dataset, accessible here, consists of job reviews from Glassdoor. TensorFlow, an open-source machine learning platform, was employed for fine-tuning. The BERT Tokenizer was used for text tokenization, and the dataset was converted into a TensorFlow dataset. The model was compiled with an "Adam" optimizer for prediction enhancement and "sparse categorical crossentropy" for evaluation. Training involved two epochs, and the model was tested on a separate set of reviews from CareerBliss.

The code for this project is available on GitHub.

Results

During the evaluation phase, the employee sentiment analyzer accurately predicted sentiments in a sample of 10 employee reviews. The model demonstrated proficiency in discerning and categorizing sentiments, aligning with employees' expressions in all instances.

Conclusion

BERT effectively classified binary sentiment relationships, accurately categorizing basic employee sentiments. However, limitations exist in this binary classification approach, as it excludes nuanced or subtle employee reviews. Future research should incorporate a broader spectrum of sentiments, including neutral or mixed emotions, to provide a more comprehensive understanding of employee feedback and enhance sentiment classification.

References

portfolio's People

Contributors

jdasanja avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.