GithubHelp home page GithubHelp logo

Comments (3)

jbarlow83 avatar jbarlow83 commented on July 1, 2024

Sadly, Ghostscript converts CCITT to JBIG2 silently and without warning, even if the JBIG2 encoding was produced by another program. JBIG2 encoding is patented (I think). I use Ghostscript to produce the final output PDF, and found out about this misfeature of theirs only after I implemented monochrome to JBIG2 encoding for ocrmypdf in an unreleased branch.

I use Ghostscript to do PDF/A conversion. Would you give up PDF/A for JBIG2?

(Aside: Make sure you are using JBIG2 in lossless mode. Lossy mode is dangerous because it can cause character substitution errors.)

from ocrmypdf.

ppaspp avatar ppaspp commented on July 1, 2024

Personally, I would prefer JBIG2 over PDF/A because the compression differences to CCITT are big. But I can image that others prefer PDF/A ...

from ocrmypdf.

jbarlow83 avatar jbarlow83 commented on July 1, 2024

The new option in v4.2 --output-type pdf lets you skip PDF/A and use PDF. If you don't specify any settings that alter the image it will pass JBIG2 through without modification.

(Any settings will cause the image to be modified and trigger CCITT transcoding: --deskew --clean-final --oversample --pdf-renderer tesseract --output-type pdfa --force-ocr)

from ocrmypdf.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.