GithubHelp home page GithubHelp logo

mdineskumar / coling2020-tutorial Goto Github PK

View Code? Open in Web Editor NEW

This project forked from grammatical/coling2020-tutorial

0.0 0.0 0.0 2.62 MB

This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A Crash Course in Automatic Grammatical Error Correction, COLING 2020.

TeX 100.00%

coling2020-tutorial's Introduction

A Crash Course in Automatic Grammatical Error Correction

This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A Crash Course in Automatic Grammatical Error Correction, COLING 2020.

Links and materials:

Tutorial outline

Part I: Introduction

  • About the tutorial
  • Task definition
  • Challenges

Part II: Historical and recent approaches

  • Rule-based methods
  • Language models
  • Error-type classifiers
  • Statistical machine translation
  • Deep neural networks
  • Shared tasks

Part III: Data and evaluation

  • Data annotation
  • Error corpora
  • Evaluation metrics
  • Human evaluation

Part IV: Neural grammatical error correction

  • Neural approach to GEC
  • GEC as low-resource NMT
  • Data sparsity
  • Correction efficacy
  • Beyond the NMT framework

Part V: Recent and future work

  • Findings from the BEA-2019 shared task
  • Towards unsupervised GEC
  • Non-English languages
  • Future work

List of GEC resources

Data sets, evaluation scripts and other resources related to the field of automatic grammatical error correction.

Datasets

Publicly available error corpora for English:

System outputs:

  • System outputs from the CoNLL 2014 Shared Task [download]
  • System outputs from the BEA 2019 Shared Task [download]

Publicly available error corpora for other languages:

Metrics

Shared Tasks

  • BEA 2019 Shared Task: Grammatical Error Correction [website] [paper]
  • NLPCC 2018 Shared Task 2 - Grammatical Error Correction for Chinese [website] [paper]
  • Automated Evaluation of Scientific Writing Shared Task 2016 [website]
  • The Second QALB Shared Task on Automatic Text Correction for Arabic 2015 [paper]
  • CoNLL-2014 Shared Task: Grammatical Error Correction [website] [paper]
  • CoNLL-2013 Shared Task: Grammatical Error Correction [website] [paper]

Other materials

(This list is incomplete, please feel free to open a pull request if you would like to add something to the list)

Reference

@inproceedings{grundkiewicz-etal-2020-crash,
  title = "A Crash Course in Automatic Grammatical Error Correction",
  author = "Grundkiewicz, Roman and Bryant, Christopher and Felice, Mariano",
  booktitle = "Proceedings of the 28th International Conference
               on Computational Linguistics: Tutorial Abstracts",
  month = dec,
  year = "2020",
  address = "Barcelona, Spain (Online)",
  publisher = "International Committee for Computational Linguistics",
  url = "https://www.aclweb.org/anthology/2020.coling-tutorials.6",
  pages = "33--38",
}

coling2020-tutorial's People

Contributors

snukky avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.