GithubHelp home page GithubHelp logo

versteisch-bahnhof's Introduction

versteisch-bahnhof

Find the full description of the hands-on task here: https://tiny.cc/versteisch-bahnhof

versteisch-bahnhof is a Swiss German dialect predictor using TF-IDF vector representations and a Random Forest classifier.

The evaluation is based on a publicly available Swiss German kaggle competition. This dataset is based on four different dialects:

BE Bernese
LU Lucerne
ZH Zurich
BS Basel

Whereby the training set consists of 15573 example sentences, wheres as the test set consists of 2499 example sentences.

Requirements

Python3 is required.

First, install pipenv using pip:

pip install --user pipenv

Installation

To load all dependencies into an own virtual environment:

pipenv install

Next, you can import the created virtual environment into your preferred IDE and activate it in your shell:

pipenv shell

Usage

You can train the model either by train_dialect (fixed parameter setting) or train_dialect_hyperparameter (grid search over different parameter settings). In both cases, the best parameters are logged to the console.

versteisch-bahnhof's People

Contributors

posedge avatar kutkopy avatar

Stargazers

 avatar  avatar

Watchers

James Cloos avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.