Comments (7)
Interesting bug :)
Frog relies on Ucto to handle BOM markers. (that those are evil goes without saying), so I assume the bug is more of an Ucto issue. Will move the issue to Ucto,
from ucto.
Interestingly, this bug is hard to reproduce in Ucto itself, as Ucto Frog uses the Ucto API slightly different from Ucto itself.
Still it is an Ucto problem.
from ucto.
I committed a fix in Ucto. This should solve the problem. Please test.
from ucto.
I tried to update lamachine but I got an error about installing Aptitude (even though I installed from scratch yesterday and everything worked smoothly). Maybe I can try updating just ucto using lamachine-update --only
, which packages should I specify to get the new ucto only?
from ucto.
@marijnschraagen you have to wait until @proycon updates LaMachine. In this case the Development version, until the bug fix is approved and officially relased.
Hope @proycon reacts soon....
from ucto.
Sorry for the delay! Thanks for the fix @kosloot! I'm testing it right now and will do a release straight away if this indeed fixes it.
I tried to update lamachine but I got an error about installing Aptitude (even though I installed from scratch yesterday and everything worked smoothly).
That is strange, can you create an issue if it persists?
Maybe I can try updating just ucto using lamachine-update --only, which packages should I specify to get the new ucto only?
You can do lamachine-update --only languagemachines-basic
, which includes frog and ucto. But it will only work if you're on the development version. Or just hold on until I publish the release and then it'll work in the stable LaMachine too.
from ucto.
The fix works and I have now released ucto v0.22, it should be available in LaMachine after a lamachine-update (or a fresh installation).
from ucto.
Related Issues (20)
- passthru mode should not be combined with other language options
- ucto creates invalid folia HOT 2
- Update debian package for v0.21
- is this correct handling of FoLiA paragraphs with embedded Part nodes? HOT 4
- -T full option produces invalid FoLiA HOT 1
- Tokenization of t-style element that has font_typeface Feature HOT 19
- Validation of ucto output fails due to space character in FoLiA output from Piereling HOT 7
- ucto sometimes misses out on the <t> for <p> HOT 3
- IDs in UCTO in concert with tei2folia HOT 3
- Language detection default for 'unknown' language HOT 9
- Ucto with 'detectlanguages' : failure HOT 3
- remove some deprecated options HOT 6
- Ucto aborts on FoLiA creation
- Question: Concatenating word parts at soft hyphens HOT 77
- Develop a tokenizer for Premodern Slavic
- Implement (soft)hyphen handling in Ucto analogues to foliautils
- Ucto fails on some UTF-8 characters in tei2folia generated FoLiA HOT 12
- add a batch option HOT 6
- Setting -m in container does not supress punctuation-based sentence splitting HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ucto.