Comments (6)
I'm not sure what happened. But it seems that the fb_lstm_out
may not be the expected shape. It could be (my guess) caused by somehow ''illegal input format''
Hope it can help you :-)
from lm-lstm-crf.
Hi Liyuan, thanks for your reply but I don't think so the problem is about my data. I am using a samle pos tagging dataset which is in conll format you described in readme file (word tag\n).
The shape of fb_lstm_out
is [13, 10, 600]
. 600 is the output vector of bidirectional lstm layer and 10 is batch size I am using. I think 13 is the number of words. Is the shape of fb_lstm_out
correct? Do you have any advice for me, what should I check else?
from lm-lstm-crf.
It seems correct. Have you tried the default dataset? i mean wsj / conll03 / conll00 #9 . I think the first thing we should do is to make sure it's not caused by runtime environments.
from lm-lstm-crf.
I tried the default dataset and the I got the same error again. I think it is about my environment. I have an IBM PowerAI machine and its cpu architecture is powerpc64le which is quite different than intel cpu architectures. Although I install pytorch on anaconda for powerpc, it seems there are still some environmental problems with pytorch.
from lm-lstm-crf.
Are you using cpu for the training? If so, you should set --gpu -1
.
I think the architecture would not be a big issue, but the dependent libraries may have some problems.
Besides, i would recommend you to re-install pytorch or anaconda.
from lm-lstm-crf.
I tried both cpu and gpu for training and it did not make any changes. I will try to reinstalling pytorch and anaconda. I will inform you about the results. Thanks.
from lm-lstm-crf.
Related Issues (20)
- train_w.py Error HOT 2
- Mismatch of performance between this repository and the paper HOT 6
- How can I include handcrafted features in NER ?
- evaluator.calc_score HOT 3
- where is the dictory './data'
- AttributeError: module 'torch._C' has no attribute '_cuda_setDevice' HOT 3
- KeyError in predictor.py class predict HOT 1
- Question about POS performance HOT 2
- How to deal with sentences with different lengths?
- train_w.py Error, TypeError: can't convert np.ndarray of type numpy.object_ HOT 2
- Is word level bi-lstm reflected in the code ? HOT 1
- About the score given a sequence and a target
- anyone can update to pytorch1.0
- Missing "eval_batch" in train_w.py line 163 HOT 1
- RuntimeError: HOT 1
- dropout
- RuntimeError: expand(torch.LongTensor{[50, 1]}, size=[50]): the number of sizes provided (1) must be greater or equal to the number of dimensions in the tensor (2)
- How do you tune the model to get a large # of keywords outputted by the CRF layer? HOT 1
- Incorrect Precision Output for test_rec HOT 1
- Line 530 in utils.py is too slow with huge datasets HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lm-lstm-crf.