Comments (6)
The file has an unusual PS definition in the CMap file:
/CIDSystemInfo <<
/Registry (PBPLFD+Courier+0) def
/Ordering (T1UV) def
/Supplement 0 def
>>
In this format of CMaps typically the def
keyword is not used.
/CIDSystemInfo <<
/Registry (F15+0)
/Ordering (T1UV)
/Supplement 0 >> def
The parse_dict
can be made more tolerant. But I think the CMaps in these files are essentially corrupt.
from pdfio.jl.
Closing as now fixed.
from pdfio.jl.
@jacobcvt12 As you can see I have checked-in the changes and tested in the local system. However, I did not check-in test case as I don't own the file to share. Since, you own the file and if you think you can contribute to the test cases feel free to write a test case and upload the file to the test files collection as a PR so that this issue can be tracked during every build.
Thank you again for your submission and using PDFIO.
from pdfio.jl.
Thanks for this update. With regard to the file, it is a transcription of a oral arguments in the supreme court of the united states (accessible here). I don't own this file, but I imagine that using it in a unit test classifies as fair use. I'll work on a unit test and contribute in the near future.
from pdfio.jl.
Thanks for your help @jacobcvt12. Will look forwards to it.
from pdfio.jl.
Test case added with fc3e92c
from pdfio.jl.
Related Issues (20)
- RomanNumerals fixed
- Add test cases for AGL. HOT 4
- Error in `merge_encoding!` when extracting text HOT 12
- Text extraction ignores different kinds of white spaces
- Error on open a file: ArgumentError: extra characters after whitespace in "4970\n396" HOT 1
- `zlib1` error in Windows HOT 3
- assert error reading a pdf HOT 7
- problem extracting text on a two columns layout HOT 6
- invalid or incomplete deflate data HOT 2
- Bug: error reading attached PDF, works with other PDFs. HOT 6
- Bug: no method matching read_internal_stream_data(::IOStream, ::CosDict, ::Base.GenericIOBuffer{Array{UInt8,1}}) HOT 6
- OpenSSL warning upon `using PDFIO` HOT 8
- ERROR: MethodError: no method matching String(::PDFIO.Cos.CosIndirectObject{CosName}) HOT 1
- New release? HOT 2
- ERROR: Found 'j(106)' Expected 'x' here HOT 6
- Precompilation issue with Julia v1.6 HOT 4
- TagBot trigger issue HOT 7
- Bad PDF files which have junk characters before header and after EOF marker error unexpected character. HOT 5
- Support for AbstractTrees v0.3 HOT 3
- AssertionError: Invalid interval HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pdfio.jl.