adamklec / copynet Goto Github PK

View Code? Open in Web Editor NEW

95.0 95.0 27.0 66 KB

An implementation of CopyNet

Home Page: https://arxiv.org/abs/1603.06393

Python 100.00%

copynet's People

Contributors

Stargazers

Watchers

copynet's Issues

pt file？

Trouble ask next, "pt" file is what ah, where to download?Thank you very much!

Could you please give us the data files? so that we can rerun this code.

Thanks for sharing the code. when I try to run this code, I create the datasets follow your description, Each file should have 2 lines of text. The first is the input sequence, the second is the target output sequence. But the error notices that lacking the file of cleaned_first_names.txt , so Could you please give us the data files? so that we can rerun this code. thanks again!

Why is my loss in the verification set going up but my loss in the training set going down

excuse me，after i run your code in my dataset，i found the va_loss is going up ，but train loss is going down.so
So，Have you ever encountered this situation when working with your own data set.
Thanks for sharing

Issue with gen_scores and copy_scores computation

Code starts here

transformed_hidden2 = self.copy_W(output).view(batch_size, self.hidden_size, 1)
copy_score_seq = torch.bmm(encoder_outputs, transformed_hidden2) # this is linear. add activation function before multiplying.
copy_scores = torch.bmm(torch.transpose(copy_score_seq, 1, 2), one_hot_input_seq).squeeze(1) # [b, vocab_size + seq_length]
missing_token_mask = (one_hot_input_seq.sum(dim=1) == 0) # tokens not present in the input sequence
missing_token_mask[:, 0] = 1 # tokens are not part of any sequence
copy_scores = copy_scores.masked_fill(missing_token_mask, -1000000.0)

gen_scores = self.out(output.squeeze(1)) # [b, vocab_size]
gen_scores[:, 0] = -1000000.0 # penalize tokens in generate mode too`

I have some issues with your above computation of copy_scores and gen_scores. Please let me know if I am wrong anywhere.

1.) In the computation of copy_scores, it is mentioned in the paper to multiply encoder_outputs with a weight matrix and apply activation function and then, multiply with the decoder RNN's hidden state. But your code seems to be doing totally different i.e. multiplying weight matrix with output of decoder RNN and multiplying the result with encoder_outputs. There is no non-linearity here.

2.) In the gen_scores computation, your code multiplies the output to a weight matrix where as in the paper, it is mentioned to compute the way it's done in Attention RNN encoder-decoder but between the one-hot encoding of word and the decoder RNN's hidden state. This is totally different from your implementation.

Can you please let me know if I misunderstood anything?

Thanks in advance!

Firstly，Thanks for sharing the code.But when I was rerun this code,there is a problem:whatever model name I chose from path ./model/ .it's reads unrecognized arguments.I don't know what happened

question about dataset (toy dataset)？

i wonder if the task of implement is for text summarization? could you share a toy_dataset so that i can make my own dataset? just an data example?

adamklec / copynet Goto Github PK

copynet's People

Contributors

Stargazers

Watchers

Forkers

copynet's Issues

pt file？

Could you please give us the data files? so that we can rerun this code.

Why is my loss in the verification set going up but my loss in the training set going down

Issue with gen_scores and copy_scores computation

code is not working at all

does this code work at all?

Result

Firstly，Thanks for sharing the code.But when I was rerun this code,there is a problem:whatever model name I chose from path ./model/ .it's reads unrecognized arguments.I don't know what happened

question about dataset (toy dataset)？

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs