GithubHelp home page GithubHelp logo

mgalley / dstc7-end-to-end-conversation-modeling Goto Github PK

View Code? Open in Web Editor NEW
174.0 14.0 31.0 33.79 MB

Grounded conversational dataset for end-to-end conversational AI (official DSTC7 data)

Home Page: http://workshop.colips.org/dstc7/

Python 92.91% Shell 1.61% Makefile 5.48%
dialogue conversational-ai dataset end-to-end-machine-learning dialogue-systems

dstc7-end-to-end-conversation-modeling's Introduction

DSTC7: End-to-End Conversation Modeling

DSTC7 has ended on January 27, 2019. This github project is still available 'as is', but we unfortunately no longer have time to maintain the code or to provide assistance with this project.

News

  • 10/29/2018: Spreadsheet containing indivdual judgments used for human evaluation.
  • 10/23/2018 and 10/15/2018: Automatic and human evaluation results posted. The code to reproduce the automatic evaluation and get the same scores can be found here.
  • 10/8/2018: Participants submitted system outputs.
  • 9/10/2018-10/8/2018: Evaluation phase, instructions here.
  • 7/11/2018: An FAQ section has been added to the data extraction page.
  • 7/1/2018: Official training data is up.
  • 6/18/2018: Trial data is up.
  • 6/1/2018: Task description is up.
  • 6/1/2018: Registration for DSTC7 is now open.

Registration

Please register [here] Registration has now closed.

Task

This DSTC7 track presents an end-to-end conversational modeling task, in which the goal is to generate conversational responses that go beyond trivial chitchat by injecting informative responses that are grounded in external knowledge. This task is distinct from what is commonly thought of as goal-oriented, task-oriented, or task-completion dialog in that there is no specific or predefined goal (e.g., booking a flight, or reserving a table at a restaurant). Instead, it targets human-like interactions where the underlying goal is often ill-defined or not known in advance, of the kind seen, for example, in work and other productive environments (e.g.,brainstorming meetings) where people share information.

Please check this description for more details about the task, which follows our previous work "A Knowledge-Grounded Neural Conversation Model" and our original task proposal.

Data

We extend the knowledge-grounded setting, with each system input consisting of two parts:

  • Conversational data from Reddit.
  • Contextually-relevant “facts”, taken from the website that started the (Reddit) conversation.

Please check the data extraction for the input data pipeline. Note: We are providing scripts to extract the data from a Reddit dump, as we are unable to release the data directly ourselves.

Evaluation

As described in the task description (Section 4), We will evaluate response quality using both automatic and human evaluations on two criteria.

  • Appropriateness;
  • Informativeness.

We will use automatic evaluation metrics such as BLEU and METEOR to have preliminary score for each submission prior to the human evaluation. Participants can also use these metrics for their own evaluations during the development phase. We will allow participants to submit multiple system outputs with one system marked as “primary” for human evaluation. We will provide a BLEU scoring script to help participants decide which system they want to select as primary.

We will use crowdsourcing for human evaluation. For each response, we ask humans if it is an (1) appropriate and (2) informative response, on a scale from 1 to 5. The system with best average Appropriateness and Informativeness will be determined the winner.

Baseline

A standard seq2seq baseline model will be provided soon.

Timeline

Phase Dates
1. Development Phase June 1 – September 9
      1.1 Code (data extraction code, seq2seq baseline) June 1
      1.2 "Trial" data made available June 18
      1.3 Official training data made available July 1
2. Evaluation Phase September 10 – October 8
      2.1 Test data made available September 10
      2.2 Participants submit their system outputs October 8
3. Results are released October
      3.1 Automatic scores (BLEU, etc.) October 16
      3.2 Human evaluation October 23

Organizers

Reference

If you submit any system to DSTC7-Task2 or publish any other work making use of the resources provided on this project, we ask you to cite the following task description paper:

Michel Galley, Chris Brockett, Xiang Gao, Bill Dolan, Jianfeng Gao. End-to-End conversation Modeling: DSTC7 Task 2 Description. In DSTC7 workshop (forthcoming).

Contact Information

  • For questions specific to Task 2, you can contact us at [email protected]. (No longer maintained.)
  • You can get the latest updates and participate in discussions on DSTC mailing list.

dstc7-end-to-end-conversation-modeling's People

Contributors

dstc-msr-nlp avatar mgalley avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dstc7-end-to-end-conversation-modeling's Issues

error "http.client.RemoteDisconnected: Remote end closed connection without response"

many errors in xxx.err files:

Traceback (most recent call last):
File "/home/ypruan/local_work/DSTC7-End-to-End-Conversation-Modeling/data_extraction_official/src/commoncrawl.py", line 74, in download
u = urllib.request.urlopen(iurl)
File "/home/ypruan/anaconda3/lib/python3.6/urllib/request.py", line 223, in urlopen
return opener.open(url, data, timeout)
File "/home/ypruan/anaconda3/lib/python3.6/urllib/request.py", line 526, in open
response = self._open(req, data)
File "/home/ypruan/anaconda3/lib/python3.6/urllib/request.py", line 544, in _open
'_open', req)
File "/home/ypruan/anaconda3/lib/python3.6/urllib/request.py", line 504, in _call_chain
result = func(*args)
File "/home/ypruan/anaconda3/lib/python3.6/urllib/request.py", line 1346, in http_open
return self.do_open(http.client.HTTPConnection, req)
File "/home/ypruan/anaconda3/lib/python3.6/urllib/request.py", line 1321, in do_open
r = h.getresponse()
File "/home/ypruan/anaconda3/lib/python3.6/http/client.py", line 1331, in getresponse
response.begin()
File "/home/ypruan/anaconda3/lib/python3.6/http/client.py", line 297, in begin
version, status, reason = self._read_status()
File "/home/ypruan/anaconda3/lib/python3.6/http/client.py", line 266, in _read_status
raise RemoteDisconnected("Remote end closed connection without"
http.client.RemoteDisconnected: Remote end closed connection without response
During handling of the above exception, another exception occurred:
Traceback
(most recent call last):
File "src/create_official_data.py", line 246, in get_submissions
s = add_webpage(s, year, month)
File "src/create_official_data.py", line 151, in add_webpage
return add_cc_webpage(submission, year, month)
File "src/create_official_data.py", line 156, in add_cc_webpage
src, date = cc.download(url, year, month, False)
File "/home/ypruan/local_work/DSTC7-End-to-End-Conversation-Modeling/data_extraction_official/src/commoncrawl.py", line 121, in download
traceback.print_exc()
NameError: name 'traceback' is not defined

It seems that remote server reject the local request, how to solve this?

Common Crawl error code 503/ 502

Hi,

Thank you for releasing the codes for data extraction. I am extracting the data based on your scripts and I noted some errors in the log file. Most of them are Common Crawl error code 502/503 and there seems to be 5 retry attempts.

Will this affect the quality of my dataset? Do I need to run the scripts again?

A sample logs are show below:
Common Crawl error code 502, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2013-20-index?url=http%3A%2F%2Fwikipedia.org%2Fwiki%2FErnest_Hemingway%23Cuba_and_the_Nobel_Prize%2F&output=json Common Crawl error code 503, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2015-22-index?url=http%3A%2F%2Fwww.dailymotion.com%2Fvideo%2Fxx2dlk_y2-2yyyyyy_lifestyle%23from%3Dembediframe%2F&output=json Common Crawl error code 503, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2015-27-index?url=http%3A%2F%2Fwww.dailymotion.com%2Fvideo%2Fxx2dlk_y2-2yyyyyy_lifestyle%23from%3Dembediframe%2F&output=json Common Crawl error code 503, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2016-40-index?url=http%3A%2F%2Fwww.dailymotion.com%2Fvideo%2Fxx2dlk_y2-2yyyyyy_lifestyle%23from%3Dembediframe%2F&output=json Common Crawl error code 503, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2017-09-index?url=http%3A%2F%2Fwww.dailymotion.com%2Fvideo%2Fxx2dlk_y2-2yyyyyy_lifestyle%23from%3Dembediframe%2F&output=json Common Crawl error code 503, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2017-13-index?url=http%3A%2F%2Fwww.dailymotion.com%2Fvideo%2Fxx2dlk_y2-2yyyyyy_lifestyle%23from%3Dembediframe%2F&output=json Common Crawl error code 503, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2017-17-index?url=http%3A%2F%2Fwww.dailymotion.com%2Fvideo%2Fxx2dlk_y2-2yyyyyy_lifestyle%23from%3Dembediframe%2F&output=json Common Crawl error code 503, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2017-51-index?url=http%3A%2F%2Fwww.dailymotion.com%2Fvideo%2Fxx2dlk_y2-2yyyyyy_lifestyle%23from%3Dembediframe%2F&output=json Common Crawl error code 503, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2018-17-index?url=http%3A%2F%2Fwww.dailymotion.com%2Fvideo%2Fxx2dlk_y2-2yyyyyy_lifestyle%23from%3Dembediframe%2F&output=json Common Crawl error code 503, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2014-10-index?url=http%3A%2F%2Fwww.dailymotion.com%2Fvideo%2Fxx2dlk_y2-2yyyyyy_lifestyle%23from%3Dembediframe%2F&output=json Common Crawl error code 502, waiting 3 seconds... (retry attempt 1/5), url: http://index.commoncrawl.org/CC-MAIN-2014-10-index?url=http%3A%2F%2Fen.wikipedia.org%2Fwiki%2FHaggis%23Outside_Scotland%2F&output=json

ERROR 404: Not Found

Hi,
I am currently collecting the DSTC7 training dataset.
However, there is a problem in DSTC7 collecting code so that I cannot collect 2015~2016 data with the instructions according to the DSTC7 website.
I think it is because the filename extensions, which were .bz2, have been modified to .zst in 2019.
file link
RS_2015-01 error log.txt
It would be appreciated if you could tell me how to collect those files into my dataset.
Thank you.

Best regards,
Jieun Han

"can't fetch: [http//:xxxxxx]"

Can't fetch: [http://www.skysports.com/iframe/widget/video/4zczVidzpB39HP6au2nXU-nGqXbxvow1#ooid=4zczVidzpB39HP6au2nXU-nGqXbxvow1] submission month: [2015-09]

in xxx.log files.

does this mean there is no such history in remote server?

how the "makefile" generate 'dev' file?

Hi, I have looked into "makefile " and "src/makefile.offitial.targets", and I found it can generate 'train' files , but i cannot find any statement in these two files that used to derive 'dev' files. I'm not familiar with "make", can you make some explanation for me?

error "Makefile:16: recipe for target 'data-official/2011-09.convos.txt' failed"

when I ran the "make -j4", the following error rised:
Makefile:16: recipe for target 'data-official/2011-09.convos.txt' failed make: *** [data-official/2011-09.convos.txt] Hangup Makefile:16: recipe for target 'data-official/2011-10.convos.txt' failed make: *** [data-official/2011-10.convos.txt] Hangup Makefile:16: recipe for target 'data-official/2011-11.convos.txt' failed make: *** [data-official/2011-11.convos.txt] Hangup Makefile:16: recipe for target 'data-official/2011-12.convos.txt' failed make: *** [data-official/2011-12.convos.txt] Hangup
then the crawling process exit. Is this due to the unstable internet connection?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.