qywu / ardm Goto Github PK
View Code? Open in Web Editor NEWAlternate Recurrent Dialog Model
Alternate Recurrent Dialog Model
in your "multiwoz/MultiWOZ Multi-Turn Train.ipynb" have read some processed multiwoz data, as
`with open("../yichi_data/clean_train_data.json") as f:
train_data = json.load(f)
with open("../yichi_data/val_data.json") as f:
val_data = json.load(f)
with open("../yichi_data/test_data.json") as f:
test_data = json.load(f)`
what do you do to get this data and you used multiwoz2.1 or multiwoz2.0 ? thanks
I know the context supported by GPT2 is 1024, but I assume there's some technique they utilized to train and generate dialogues longer than that in their results. Also, I saw many gpt2-based repos training text with length longer than 1024. Can you please explain what's necessary to train longer dialogues? And, would you consider implementing that?
I want to do some training in chinese and find your python lib "chinese-gpt", but I can't find any user guide about it. https://github.com/qywu/Chinese-GPT/tree/master/tutorials seems had be removed by you.
Hello! Just read your paper. Very interesting. When will it be ready for testing? I would like to try your model on open-domain chit-chat data. Do you think it would work well on it?
Keep up the good work.
if sum([len(item) for item in batch[0][1]]) > 1024:
train_data = [data[idx] for idx in indices[100:]]
val_data = [data[idx] for idx in indices[:100]]
self.tokenizer.max_len = 1500
# tokenizer weird behavior
I am faces few issues in ARDM Inference file of colab
I am getting error while using
model_A.load_state_dict(model_A_states)
model_B.load_state_dict(model_B_states).
Its unable to match all the keys.
I tried adding strict = False
model_A.load_state_dict(model_A_states,strict =False)
model_B.load_state_dict(model_B_states , strict = False)
The code runs but few keys don't match.
On proceeding in this manner I am facing error here
logits, past = model_A(prev_input, past=past)
error : past argument not supported. I tried removing past argument
but it gives error for the next line of code.
logits = logits[:, -1, :] / temperature
Can you please help me fix this code.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.