GithubHelp home page GithubHelp logo

Comments (10)

kosloot avatar kosloot commented on May 23, 2024 1

OK, this seems very reasonable.
This option is now implemented. But with a default of 'full' for now, because i want to investigate the impact of changing that first.

from ucto.

kosloot avatar kosloot commented on May 23, 2024

seems reasonable, and easy to do

from ucto.

kosloot avatar kosloot commented on May 23, 2024

I added a --noredundanttext option to ucto.
a better name is welcome
please test!

from ucto.

proycon avatar proycon commented on May 23, 2024

Test seems to work fine.

from ucto.

kosloot avatar kosloot commented on May 23, 2024

One important remark about this option:
When an incoming FoLiA already has some text on <p> or <s> nodes this text is left 'as-is' AND
from FoLiA v1.5 text compatibility rules will be enforced. (the normalized deeper text must equal the normalized text of the nodes)
We could decide on REMOVING the text on <p> or <s> nodes on the fly....

from ucto.

proycon avatar proycon commented on May 23, 2024

Perhaps change it into a --textredundancy/-T parameter that takes one of three modes?

  • full - Add full text redundancy (currently the default)
  • minimal - Do not introduce extra redundancy, but do not remove any input text
  • none - No text redundancy (actively strip and only keep the deepest level (w by definition)

I'd also propose changing the default to minimal then. Full text redundancy is not necessary usually.

from ucto.

proycon avatar proycon commented on May 23, 2024

Is this option accessible already from the frog configuration?

from ucto.

kosloot avatar kosloot commented on May 23, 2024

no.
I assume this is a wish :P

from ucto.

proycon avatar proycon commented on May 23, 2024

yep ;)

from ucto.

kosloot avatar kosloot commented on May 23, 2024

your wish is our command. So a -T / --textredundancy option is added now.
Also the default is set to 'minimal'

from ucto.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.