GithubHelp home page GithubHelp logo

Comments (6)

LiyuanLucasLiu avatar LiyuanLucasLiu commented on May 23, 2024

I'm not sure what happened. But it seems that the fb_lstm_out may not be the expected shape. It could be (my guess) caused by somehow ''illegal input format''

Hope it can help you :-)

from lm-lstm-crf.

erayyildiz avatar erayyildiz commented on May 23, 2024

Hi Liyuan, thanks for your reply but I don't think so the problem is about my data. I am using a samle pos tagging dataset which is in conll format you described in readme file (word tag\n).

The shape of fb_lstm_out is [13, 10, 600]. 600 is the output vector of bidirectional lstm layer and 10 is batch size I am using. I think 13 is the number of words. Is the shape of fb_lstm_out correct? Do you have any advice for me, what should I check else?

from lm-lstm-crf.

LiyuanLucasLiu avatar LiyuanLucasLiu commented on May 23, 2024

It seems correct. Have you tried the default dataset? i mean wsj / conll03 / conll00 #9 . I think the first thing we should do is to make sure it's not caused by runtime environments.

from lm-lstm-crf.

erayyildiz avatar erayyildiz commented on May 23, 2024

I tried the default dataset and the I got the same error again. I think it is about my environment. I have an IBM PowerAI machine and its cpu architecture is powerpc64le which is quite different than intel cpu architectures. Although I install pytorch on anaconda for powerpc, it seems there are still some environmental problems with pytorch.

from lm-lstm-crf.

LiyuanLucasLiu avatar LiyuanLucasLiu commented on May 23, 2024

Are you using cpu for the training? If so, you should set --gpu -1.
I think the architecture would not be a big issue, but the dependent libraries may have some problems.
Besides, i would recommend you to re-install pytorch or anaconda.

from lm-lstm-crf.

erayyildiz avatar erayyildiz commented on May 23, 2024

I tried both cpu and gpu for training and it did not make any changes. I will try to reinstalling pytorch and anaconda. I will inform you about the results. Thanks.

from lm-lstm-crf.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.