GithubHelp home page GithubHelp logo

wmt21-qe-tudarmstadt's Introduction

WMT21 Quality Estimation Entry

Entry of students of the TU Darmstadt for the WMT21 quality estimation shared task (task 1 and 2).

Language adapters and tokenizers for mulitlingual BERT & Sinhala/ Khmer provided by Jonas Pfeiffer. Please cite UNKs Everywhere: Adapting Multilingual Language Models to New Scripts and MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer if you use them.

Installation

Since our implementation relies on the use of adapters, the framework must be present on the system. It can be installed by running

pip install adapter-transformers

Further information on installing the adaper framework can be found on https://adapterhub.ml/.

Use Fine-Tuned Adapters

We release our trained adapters for XLM-R (base & large) and mBERT (trained with language adapters + additional embeddings) on AdapterHub and on Hugging Face Hub.

Usage

Our model can be run by passing a config.yaml file containing all hyperparameters to the run.py's main method.

python run.py configs/config.yaml

An exemplary config.yaml can be found below:

do_train: True
model: bert-base-multilingual-cased
output_dir: results/qe-da/testing
max_seq_len: 50
task: qe_da
report_to: none
madx2: True
architecture: base
reduction_factor: 8
dropout: 0.1
no_lang: False
predict: False
debug: False
boosting: False
train:
  train_batchsize: 8 # should evenly divide 7000 for multi pair training to work with the current hack
  eval_batchsize: 50 # must evenly divide 1000 for multi pair training to work with the current hack
  max_steps: 6000
  logging_steps: 10
  eval_steps: 250
  gradient_accumulation_steps: 1
  save_total_limit: 2
  amp: True
  epochs: 1
  pair: # list of pairs or just a pair
    - [en, de]
    - [en, zh]
    - [et, en]
    - [ne, en]
    - [ro, en]
    - [ru, en]
    - [si, en]
test:
  batchsize: 32
  pairs:
    - [en, de]
    - [en, zh]
    - [et, en]
    - [ne, en]
    - [ro, en]
    - [ru, en]
    - [si, en]

wmt21-qe-tudarmstadt's People

Contributors

eliasheltrop avatar

Stargazers

Himashi Rathnayake avatar Wei Zhao avatar

Watchers

Gregor Geigle avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.