Comments (5)
Hey @usptact,
CRFsuite doesn't currently support n-best tagging.
It seems the relevant code is
crfsuite/lib/crf/src/crf1d_context.c
Line 466 in 5566039
from crfsuite.
kmike,
Thank you a lot for a pointer! I read elsewhere that for Vitterbi based algorithms one needs to increase the beam size. I am not sure what it means.
from crfsuite.
Currently the function computes a single max_score, stores a single backward link at each j, and finds a single best label sequence using these backwads links.
If I'm not mistaken, for n-best parsing you need to keep top-n max_score values, n best backward links at each position j and use them to compute n best label sequences.
There are also more efficient algorithms for n-best decoding, see e.g. http://www.keerthis.com/P12-1064.pdf for an overview.
from crfsuite.
As a side not, Wapiti CRF toolkit supports n-best decoding.
Implementation is not optimal though (see Jekub/Wapiti#2).
from crfsuite.
Thank you very much, kmike! I am playing with Wapiti right now and trying to assess the top n-best results. Up to this moment I was always relying on the top-1 result which was not the best in all the cases. I am curios whether good tagging is in the n-best results list.
from crfsuite.
Related Issues (20)
- Exclude sentence with only O HOT 1
- Character n-grams HOT 2
- R wrapper available at https://github.com/bnosac/crfsuite HOT 1
- lib/cqdb/src/cqdb.c and Wstringop-truncation HOT 3
- mersenne twister HOT 1
- meaning of min_freq HOT 3
- Old lookup3.c file, `k8` undeclared
- Deprecate Python SWIG binding and make python-crfsuite the canonical binding?
- Unable to compile a very simple Tagger with the C++ API HOT 2
- Comparison with SimString
- How do I use glove on crfsuite with python? HOT 2
- Different results across platforms (Windows, Ubuntu, etc...) HOT 2
- Why are my results so different on identical runs?
- *deleted*
- Are transition features conditioned on observations supported?
- How to install lib on the M1 MacBooks? HOT 1
- Forced decoding support for partial labelled sequence ? HOT 2
- Interpreting the CRFsuite Model File
- Hindi Language NER Training format HOT 15
- start with CRF suite in windows HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from crfsuite.