GithubHelp home page GithubHelp logo

grseb9s / uncertaintydetection Goto Github PK

View Code? Open in Web Editor NEW

This project forked from pajean/uncertaintydetection

0.0 1.0 0.0 35.65 MB

MUD (Multiple Uncertainty Detection) allows to detect uncertainty in natural language.

License: MIT License

Python 100.00%

uncertaintydetection's Introduction

MUD

Author:

  • Pierre-Antoine Jean

Co-authors:

  • Sebastien Harispe
  • Sylvie Ranwez
  • Patrice Bellot
  • Jacky Montmain

MUD (Multiple Uncertainty Detection) allows to detect linguistic uncertainty in natural language. It is based on the statistical analysis of multiple lexical and syntactic features used to characterize sentences through vector-based representations that can be analyzed by proven classification methods.

Dependences

MUD.py requires:

Folder

Data folder contains WikiWeasel, BioScope and SFU data (after the conversion of the XML format) and various files build by the program.

Scripts description

Binomial_law.py allows to calculate the binomial law for lemms, bigrams, trigrams and PoS patterns. Moreover, this script calculates p(U).

Binomial_law.py uses pattern_cue_pos.py to build files lemms, bigrams, trigrams and PoS patterns with their p(w) value.

Then, MUD.py allows to build features matrice, executes the SVM method and print precision, recall and F-measure in case of evaluation mode.

Run

Run MUD.py with at least one option:

  • Choose your training corpus

    . w = WikiWeasel - Wikipedia articles - (semantic uncertainty and discourse-level uncertainty)

    . b = BioScope - biomedical papers - (semantic uncertainty)

    . sfu = SFU corpus - gereral corpus - (semantic uncertainty)

  • (optional) your file with one sentence per line.

Examples :

  • python MUD.py w path_your_file
  • python MUD.py path_your_file in this case the training corpus by default is SFU.
  • python MUD.py w is the evaluation mode with WikiWeasel.

Your uncertain sentences are in Data/Results file.

References

  • Un modèle probabiliste pour la détection de l’incertitude dans le langage naturel. Pierre-Antoine Jean, Sebastien Harispe, Sylvie Ranwez, Patrice Bellot, Jacky Montmain. CORIA, 2016.
  • Uncertainty detection in natural language:a probabilistic model. Pierre-Antoine Jean, Sebastien Harispe, Sylvie Ranwez, Patrice Bellot, Jacky Montmain. WIMS, 2016.

uncertaintydetection's People

Contributors

pajean avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.