GithubHelp home page GithubHelp logo

cord19's Introduction

CORD-19 Information Aggregator

by Graham Neubig, Emma Strubell, Zhengbao Jiang, Zi-Yi Dou and others at the Carnegie Mellon University Language Technologies Institute

View the extracted information here

This is a tool to browse answers the scientific literature may provide regarding various questions about the novel coronavirus and COVID-19. Click the questions below to see a list of answers with links to the sources that provided them.

We are looking for help improving this tool! If you are familiar with reading the medical literature and could give fine-grained feedback please contact us at [email protected]. Or, if you are a programmer and could help please feel free to contribute to this repository!

Information extraction script

See the extraction directory.

Docs

Data extraction

Our data is based on the CORD-19 dataset. Ask the authors of this repository for access if you're interested.

  • text-only/*.oie

    Each line has <sub, rel, obj> triplets extracted from the corresponding sentence in text-only/*.sent. Triplets are separated by \t and each triplet is of the format subject|||relation|||object, e.g. he#0,2|||'s on#3,8|||outside#13,20. Subjects, relations, and objects are continuous spans of tokens in the format of text#start_char,end_char, e.g. he#0,2. Note that text of the span might only be a substring of the string spanning from start_char to end_char.

cord19's People

Contributors

jzbjyb avatar neubig avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

cord19's Issues

Better display of extracted information

A few improvements to the display of extracted information may be warranted:

  1. Make it clear what keywords are being searched for each of the questions.
  2. Better explanation of the OpenIE extractions. Maybe highlight the words in the extracted sentences.

Make it possible to do regexes over noun phrases in text

Currently regexes in text can't be on the borders of the template. For example the following is not allowed:

[X] is a risk factor of [Y]

It would be nice if we could do regexes that matched NPs, so we would be able to solve this issue.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.