Name: Language Technology at the University of Helsinki
Type: Organization
Bio: Projects and resources developed in the Language Technology Research Group at the University of Helsinki.
Twitter: HelsinkiNLP
Location: Helsinki, Finland
Blog: https://blogs.helsinki.fi/language-technology/
Language Technology at the University of Helsinki's Projects
Dialectologically annotated and normalized dataset of dialectal Finnish tweets
Word-aligned version of the Norwegian Dialect Corpus
Additional Notebooks for the Building NLP Applications course
Data and scripts for a diagnostics test suite which allows to assess whether an NLU dataset constitutes a good testbed for evaluating the models' meaning understanding capabilities.
This repository contains data and scripts to reproduce the results from our paper: How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets.
The Open Parallel Corpus
API for searching corpora from OPUS
OPUS-CAT is a collection of software which make it possible to OPUS-MT neural machine translation models in professional translation. OPUS-CAT includes a local offline MT engine and a collection of CAT tool plugins.
c++ mosestokenizer (OPUS fork)
Index of resources in OPUS
OPUS repository interface
Open neural machine translation models and web services
Fast and secure translation on your local machine, powered by marian and Bergamot.
Translation Bot between Ukrainian and Czech.
development data for OPUS-MT
Makefile recipes shared between all leaderboard repos
A map of available translation models
benchmarks for evaluating MT models
Training open neural machine translation models
Translation demonstrator
OPUS website files
OpusFilter - Parallel corpus processing toolkit
A hub of OpusFilter configurations