Official repository for the Opensource Text dataset for NMT for local languages in West Africa (EWE Corpus) and implement the Yodi model after hand.
Note: This repository will evolve into the official repository for the Yodi model, once the necessary data is gathered.
• Develop a Machine Translation Text and Speech Dataset NMT for local languages in West Africa (EWE Corpus)
-> Develop & Measure the accuracy or the performance of the Yodi model from this dataset for text-to-text translation.
Remark: Getting accurate data and labeled data from available sources online or in local written papers would be necessary for machine sentence translation.
####### -> Develop the metrics for the dataset, How many total sentences for each file, and how many words?
Feel free to add your labeled or unlabeled NMT data to the repository and perform the measurements as well.
Instructions are available in project_contributions_instructions.txt
Please register at https://sites.google.com/umbaji.org/yodi/home to build the largest NMT text Dataset for West Africa.