GithubHelp home page GithubHelp logo

callatrumor's Introduction

CALL ATTENTION TO RUMORS: DEEP ATTENTION BASED RECURRENT NEURAL NETWORKS FOR EARLY RUMOR DETECTION

In Partial Fulfillment of the Requirements for the Final Project in Software Engineering (Course 61771)

Authors:

  • Shay Axelrod
  • Yana Mamedov

Supervisor:

  • Prof. Zeev Volkovich

Introduction

The rise in social media usage in recent years has made it a powerful platform for spreading rumors. The spread of rumors can pose a threat to cybersecurity, social, and economic stability worldwide. This project aims to detect the spread of rumors by identifying them on social networks in the early stages of their propagation in an automated way. RNNs have been proven to be effective in recent machine learning tasks for handling long sequential data. Three main challenges in early rumor detection must be addressed: (1) the system must adopt new features automatically and should not be hand-crafted; (2) the solution uses RNN. This algorithm has some well-known problems that prevent the processing of exceedingly long sequences of texts; (3) many duplications of posts with different contextual focuses must be handled. An attention-based RNN powered by long short-term memory (LSTM) with term frequency-inverse document frequency (tf-idf) mechanisms is proposed to overcome these challenges. In the project, the system detects rumors automatically using a deep attention model based on recurrent neural networks (RNN). For simplicity, the model is pre-trained using a dataset from social media sources. The model gets textual sequences of information from posts as input and constructs a series of feature matrices. Then, the RNN with an attention mechanism automatically learns new and hidden text representations. The attention mechanism is embedded in the system to help focus on specific words for capturing contextual variations of relevant posts over time. In the end, an additional hidden layer with a sigmoid activation function uses those text representations and predicts, as output, whether a text is a rumor or not. Furthermore, the system enables a trainer to train the algorithm with some new datasets.

For a better understanding of how this works, please read our research paper.

To see our GUI in action, please watch our short video example: video

UML

Our Use Case Diagram Use Case Diagram

Our Class Diagram Class Diagram

For more information, test results, and more, read our research paper.

callatrumor's People

Contributors

shayaxelrod avatar

Watchers

 avatar

Forkers

yanamamedov

callatrumor's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.