GithubHelp home page GithubHelp logo

rpavani1998 / nlp-techniques-stancedetection Goto Github PK

View Code? Open in Web Editor NEW

This project forked from vidhishah22/nlp-techniques-stancedetection

0.0 1.0 0.0 5.22 MB

NLP Techniques for stance detection to detect quality of the news source

Home Page: https://youtu.be/5H8WUdT_10c

Jupyter Notebook 100.00%

nlp-techniques-stancedetection's Introduction

NLP-Techniques-StanceDetection

Objective :

Fake news is defined as a “made up story with an intention to deceive, geared towards getting clicks”. Stance Detection is one of the factors influencing fake news detection. It is not practical for humans to fact check every piece of information produced by the media. Hence, the goal of this project is to use natural language processing (NLP) techniques to automate stance detection to determine the quality of the news source. It considers what other organizations write about the same headline. A body of the text is claimed to agree, disagree, discuss, or be unrelated to a headline.

Technical Skills: Jupyter Notebook, Google Collab, Python, NLP, Nltk, TF-IDF, Cosine Similarity, Data-Cleaning, Machine Learning Classification Models, ROC Curves

Dataset(s) :

From the FakeChallenge.org a dataset has been provided which consists of a headline and a body of text. This body of text may be from a different article. The output of the system will be the stance of the body of text related to the title.

Two csvs that I have used to implement the system are:

“Train_bodies.csv”: Contains Body ID and Body Text - no of records : 2532

“Train_stances.csv”: Contains Headline, Body ID and Stance - no of records : 49972

List of Files :

  1. Refer to the pdf document "NLTK_Basics.pdf" to get to know basic terminologies in NLTK world

  2. Refer to the research paper "Research_Paper_AlternusVera.pdf" showcasing 4 different factors influencing fake news detection including stance detection. This research paper is combined work in a team of 4

  3. Refer to the YouTube video link below to get a quick tour:

    https://youtu.be/5H8WUdT_10c

Comaprison of Algorithms

alt text

I achieved best accuracy of 89.88% as compared to other less performing models with just 43% accuracy score

ROC Curve

alt text

Thank You,

Vidhi Shah

nlp-techniques-stancedetection's People

Contributors

vidhishah22 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.