This dataset includes:
- A Gazetteer of tokens and NE tags annotated by 3 domain experts
- A Corpus of 192,000 job titles crawled from Linkedin, with NE tags prefixed using BIOES schemes
Please cite the following paper when using IPOD:
@misc{liu2019ipod,
title={IPOD: Corpus of 190,000 Industrial Occupations},
author={Junhua Liu and Chu Guo and Yung Chuen Ng and Kristin L. Wood and Kwan Hui Lim},
year={2019},
eprint={1910.10495},
archivePrefix={arXiv},
primaryClass={cs.CL}
}