GithubHelp home page GithubHelp logo

Comments (7)

kewlcoder avatar kewlcoder commented on June 17, 2024 1

@glample - Sir, if I am not wrong, the dataset you provided,namely, eng.train, eng.testa and eng.testb is free but copywrited by Reuters.
I would like to suggest you to add this warning to that particular commit so that people take proper precautions before using it.

from tagger.

zero76114 avatar zero76114 commented on June 17, 2024

@glample : Can you help us?
Your work is completely new and interesting. I am really new with Python and Theano, so I don't know how I can rerun your work. Please help me more detail.
Thanks

from tagger.

a455bcd9 avatar a455bcd9 commented on June 17, 2024

@leefionglee I think it's this data set: http://www.cnts.ua.ac.be/conll2003/ner/

The English data is a collection of news wire articles from the Reuters Corpus. The annotation has been done by people of the University of Antwerp. Because of copyright reasons we only make available the annotations. In order to build the complete data sets you will need access to the Reuters Corpus. It can be obtained for research purposes without any charge from NIST.

from tagger.

zero76114 avatar zero76114 commented on June 17, 2024

@a455bcd9 Thank you. I try to download and preprocessing data Conll2003 and after I get 3 file for English Reuter: eng.train, eng.testa and eng.testb.
@glample can you show me how i can change train.txt , dev.test and test.txt;

from tagger.

prashant-puri avatar prashant-puri commented on June 17, 2024

@glample Hey can you help me with creation of train.txt , dev.test and test.txt.
@zero76114 Hey can you help me with creation of train.txt , dev.test and test.txt.

from tagger.

Zhangzirui avatar Zhangzirui commented on June 17, 2024

@zero76114 Hi, can you help me with creation of train.txt , dev.test and test.txt.
@prashant-puri Hi, can you help me with creation of train.txt , dev.test and test.txt.

from tagger.

glample avatar glample commented on June 17, 2024

The dataset is now available on the repo.

from tagger.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.