leviswind / pytorch-transformer Goto Github PK

View Code? Open in Web Editor NEW

236.0 3.0 57.0 254 KB

pytorch implementation of Attention is all you need

Python 100.00%

pytorch attention-is-all-you-need translation transformer

pytorch-transformer's Issues

How to add some functionalities to this code?

Hi. Please i would like to add some features to this code that i have read on beam search. Like coverage penalty and length normalization. But i don't know where to start. Can you help please?

there are many errors in this implementation, it's really difficult to run it.

1st, backend.Embedding is removed in pytorch.
2nd, the tensors in this repo are not in the same device type, this really messed up the code, I tried to modify this device the tensors in the code stored on, but the more I modify, the more errors it raised, finally I have no choice but giving up read and deploy this code. I suggest you try to define you code purely on cpu, not adding many codes in this repo to clarify than you run some operations on cuda, this really mess this repo up if you didn't make it clear that all the operation is compatible with the storing device type of all the variables.

Problem NotImplementedError？

When I run the train.py，report this error

How to solve this problem？THKS

Error whilte running train.py

Traceback (most recent call last):
File "train.py", line 90, in
train()
File "train.py", line 81, in train
writer.export_scalars_to_json(hp.model_dir + '/all_scalars.json')
AttributeError: 'SummaryWriter' object has no attribute 'export_scalars_to_json'

How can i correct it, please?

https://github.com/leviswind/pytorch-transformer/blob/master/modules.py#L69

https://github.com/Kyubyong/transformer/blob/master/modules.py#L36

AttributeError: Could not find function class for [Embedding]

Am I running the train file because of the pytorch version problem?

leviswind / pytorch-transformer Goto Github PK

pytorch-transformer's Issues

How to add some functionalities to this code?

Hi. Please i would like to add some features to this code that i have read on beam search. Like coverage penalty and length normalization. But i don't know where to start. Can you help please?

there are many errors in this implementation, it's really difficult to run it.

Problem NotImplementedError？

Error whilte running train.py

can eval.py infer on one sentence?

Question of the parameter 'sinusoid'

A little misktake in modules.py

A little difference between tf version and your code in layer_normalization

AttributeError: Could not find function class for [Embedding]

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs