Comments (7)
Hi,
I tried once to add gazetteers, and it was improving the score a little (not too much in English, but I think it was significant for some of the 4 CoNLL languages). However, I tried to simplify the code here and I didn't include this optional feature...
from tagger.
Thanks for the reply @glample . However for my task, I am planning on adding the gazetteer features as well as I believe it could improve my results. If you still have the gazetteers code, let me know if you could provide it(Atleast something to start off). Else, if I plan to make changes to the repo to add gazetteers features as well and send a PR, so that if somebody else needs to use it, they can.
from tagger.
Unfortunately I won't have much time this summer to work on this, but if you want to update the code and send a PR that would be great. I can help you with that. I can also send you a version of the code I used when I was using gazetteers, it's kind of dirty, but the code related to the gazetteers part is pretty short and should be easy to add to the code of this repo.
from tagger.
@glample . If you could send me that dirty bit of code for gazetteers that would be helpful. I can make changes to them and send a PR. Should not a problem from my end at all.
from tagger.
Dear @glample, I guess the update for the gazetters didnt happen. could you send me the code as well as a starting point? thx
from tagger.
from tagger.
Hi @metpallyv, uh nice thx. Since I am very new to the whole topci, could you add a brief description what kind of file is needed for the gazetters?
from tagger.
Related Issues (20)
- utils.py issue line 303 list index out of range HOT 8
- Why my program goes into infinite loop? HOT 2
- Cuda and Theano Version HOT 4
- Can you
- Can my Chinese data be used in this program?(character-level) HOT 3
- Which tokenizer did you use? HOT 11
- Script for training embeddings HOT 9
- IOError "No such file or directory: './evaluation/temp/eval.1181043.scores " HOT 3
- SGD x Adam HOT 2
- Are you planning to release models in German, Spanish and Dutch as well? HOT 1
- Confidence score for the predicted entity HOT 4
- Data size and decoding time
- Pretrained word embedding HOT 5
- Confusion about lable conversion. HOT 2
- Inconsistent conversion for IOBES to IOB HOT 1
- Token level or Entity level? HOT 2
- Equations of LSTM
- transition scores HOT 1
- How to set the parameters of a small dataset? HOT 1
- . HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tagger.