Scientific Named Entity Recognition (NER) Dataset
Judged dataset for NER in scientific documents
Dataset contains the folowing files:
SIGIR_judged.csv
- Judged n-grams for SIGIR 2012 Collection, in the formatn-gram,paper title,IS_VALID
, whereIS_VALID
can be either 0 or 1Physics_judged.csv
- Judged n-grams for Physics (arxiv HEP-PH) Collection, in the formatn-gram,arxiv ID,IS_VALID
, whereIS_VALID
can be either 0 or 1maxent_last20.csv
- Entities extracted from SIGIR collection by Maximum Entropy classifier, witthout judgements. Judgements are location inSIGIR_judged.csv
.