GithubHelp home page GithubHelp logo

Unable to read in PDF about pdfio.jl HOT 6 CLOSED

sambitdash avatar sambitdash commented on May 28, 2024
Unable to read in PDF

from pdfio.jl.

Comments (6)

sambitdash avatar sambitdash commented on May 28, 2024

The file has an unusual PS definition in the CMap file:

/CIDSystemInfo <<
                   /Registry (PBPLFD+Courier+0) def
                   /Ordering (T1UV) def
                   /Supplement 0 def
>>

In this format of CMaps typically the def keyword is not used.

/CIDSystemInfo <<
       /Registry (F15+0) 
       /Ordering (T1UV) 
      /Supplement 0 >> def

The parse_dict can be made more tolerant. But I think the CMaps in these files are essentially corrupt.

from pdfio.jl.

sambitdash avatar sambitdash commented on May 28, 2024

Closing as now fixed.

from pdfio.jl.

sambitdash avatar sambitdash commented on May 28, 2024

@jacobcvt12 As you can see I have checked-in the changes and tested in the local system. However, I did not check-in test case as I don't own the file to share. Since, you own the file and if you think you can contribute to the test cases feel free to write a test case and upload the file to the test files collection as a PR so that this issue can be tracked during every build.

Thank you again for your submission and using PDFIO.

from pdfio.jl.

jacobcvt12 avatar jacobcvt12 commented on May 28, 2024

Thanks for this update. With regard to the file, it is a transcription of a oral arguments in the supreme court of the united states (accessible here). I don't own this file, but I imagine that using it in a unit test classifies as fair use. I'll work on a unit test and contribute in the near future.

from pdfio.jl.

sambitdash avatar sambitdash commented on May 28, 2024

Thanks for your help @jacobcvt12. Will look forwards to it.

from pdfio.jl.

sambitdash avatar sambitdash commented on May 28, 2024

Test case added with fc3e92c

from pdfio.jl.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.