GithubHelp home page GithubHelp logo

moses-smt / giza-pp Goto Github PK

View Code? Open in Web Editor NEW
264.0 24.0 85.0 262 KB

GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package also contains the source for the mkcls tool which generates the word classes necessary for training some of the alignment models.

C++ 97.70% Makefile 0.78% C 1.14% Shell 0.38%

giza-pp's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

giza-pp's Issues

Installing Giza++ on Ubuntu 19.1

I entered the following commands and got the below error:

**git clone https://github.com/moses-smt/giza-pp.git

cd giza-pp

make**

/usr/bin/ld: HMMTables.cpp:(.text.startup+0x19): undefined reference to`std::ios_base::Init::~Init()'

collect2: error: ld returned 1 exit status

make[1]: *** [Makefile:37: GIZA++] Error 1

make[1]: Leaving directory '/home/user/giza-practice/giza-pp-master/GIZA++-v2'

make: *** [Makefile:7: gizapp] Error 2

segmentation-fault-core-dumped (died with signal 11, with coredump in moses)

I encountered the following error "died with signal 11, with coredump" when using giza, how can I solve it? my command is :

"./GIZA++ -S ../giza_example/news-commentary-v8.fr-en.clean.fr.vcb -T ../giza_example/news-commentary-v8.fr-en.clean.en.vcb -C ../giza_example/news-commentary-v8.fr-en.clean.fr_news-commentary-v8.fr-en.clean.en.snt -CoocurrenceFile ../giza_example/corp.cooc -outputpath ../giza_example"

Who can help me, thanks!

Null-Word Alignment Probabilities

hi, i want to know how to set the null-word probability in IBM Model 1, the probability of inserting a null after a source word. I have just seen the setting of IBM Model 3,
Hoping your response , thank you

installation problem

make -C GIZA++-v2
make[1]: Entering directory '/mnt/c/giza/giza-pp/GIZA++-v2'
mkdir optimized/
g++ -Wall -Wno-parentheses -O3 -funroll-loops -DNDEBUG -DWORDINDEX_WITH_4_BYTE -DBINARY_SEARCH_FOR_TTABLE -c Parameter.cpp -o optimized/Parameter.o
make[1]: g++: Permission denied
make[1]: *** [Makefile:92: optimized/Parameter.o] Error 127
make[1]: Leaving directory '/mnt/c/giza/giza-pp/GIZA++-v2'
make: *** [Makefile:7: gizapp] Error 2

Issues running plain2snt.out

I'm Trying to run plain2snt.out on two text files in colab, but I get the error /content/giza-pp/GIZA++-v2/plain2snt.out:/content/giza-pp/GIZA++-v2/plain2snt.out: cannot execute binary file.

I installed the library like this:
!git clone https://github.com/moses-smt/giza-pp.git
%cd /content/giza-pp/GIZA++-v2/ !make
and then ran plain2snt.out like this: !bash /content/giza-pp/GIZA++-v2/plain2snt.out /content/drive/MyDrive/big_corp_arabic.txt /content/drive/MyDrive/big_corp_english.txt.

I missing something? I'm working in Google Colab by the way.

why my *.A3.* output file looks like this?

1 1 8 100 1
5 2 8 100 1
4 3 8 100 1
2 4 8 100 1
5 5 8 100 1
4 6 8 100 1
4 7 8 100 1
0 8 8 100 1
3 9 8 100 1

should it be same sentense?
not sure if someone will reply

c++ library

hi guys, i'm trying to make "make" (haha lol), but i'm getting this error:

./mystl.h:28:10: fatal error: 'tr1/unordered_map' file not found

this is for the calling in line 28 in file mystl.h

please pin me if you can help me!.

Can giza++ work as train-test mode?

Assume that now I have lots of parallel sentences, I can use giza++ to obtain the parameters and the align result.
If I have other sentences in future, can I use the obtained parameters above to obtain the align result? (without retraining)

How can I solve"std::bad_aloc"

When I raise the number of parallel data to 30000+, the terminal shows"terminate called after throwing an instance of 'std::bad_alloc'
what(): std::bad_alloc".

installation problem

I want to use it on windows through Cygwin ,but i am unable to run makefile.

Please convey if any knowing.
Thanks in anticipation

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.