GithubHelp home page GithubHelp logo

kexuanzhang / chinese-idioms-translation-with-neural-network Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 1.0 219 KB

A project that investigate on more accurate English translation of Chinese idioms with neural network

Jupyter Notebook 100.00%

chinese-idioms-translation-with-neural-network's Introduction

Chinese-Idioms-Translation-with-Neural-Network

This is a project for the course CSC413 2023 Winter, taught by Jimmy Ba and Bo Wang, at the University of Toronto, produced by Kexuan Zhang and William Zhang

With the advent of large language models like GPT-3, BERT, we have achieved remarkable performance in natural language processing. However, when it comes to the task regarding picking up the underlying meanings, most of the language models fail to maintain a good accuracy. One such example is the translation of idiom in Chinese language which has a compact representation (normally 4 characters), while contains deep-seated metaphorical meanings. Therefore, in this project, we fine-tuned two pre-trained models, Davinci and M2M100, to perform language translation that aims to pick up the underlying meanings of Chinese idioms, and then analyze the performance of each based on various metrics: BLEU, METEOR, and COMET.

Quick Guide

  • The Models folder contains the ipynb files that demonstrate the process of fine-tuning Davinci and M2M100 models
  • The Evaluation folder contains the ipynb file that demonstrate the process of testing the two fine-tuned model, and calculate BLEU, METEOR, and COMET metrics
  • The link to the dataset we use to train the models: PETCI data
  • The report file is the paper that summarize this project

chinese-idioms-translation-with-neural-network's People

Contributors

kexuanzhang avatar williamqd avatar

Stargazers

 avatar

Watchers

Kostas Georgiou avatar  avatar

Forkers

williamqd

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.