Comments (4)
Hi @ryankilroy , This issue is fixed in the new release (v3.60.0
) which can be found here https://github.com/unidoc/unipdf/releases/tag/v3.60.0. Closing this ticket as fixed.
from unipdf.
Hi @ryankilroy, after some investigation, we found out that the issue is in the ToUnicode
map provided in the document. It has an invalid code point for the character code that represented the missing letter (l
). But the reason other tools were able to extract the correct character is that they resorted to the Replacement Text
data provoded as part of the marked content. Currently, our extractor doesn't implement this feature, which is why it just took the invalid code point (which is by the way in the Private Use Area of Unicode ) and extracted it as valid text. We plan to incorporate this feature in the future and provide an update on this ticket upon its release.
Regarding your second issue, i.e., font extraction, the reason for the font extraction failure is that there is no font in pages 3 and beyond (because the pages are scanned). But the error message is not informative enough to convey this. We will update this one too.
from unipdf.
Welcome! Thanks for posting your first issue. The way things work here is that while customer issues are prioritized, other issues go into our backlog where they are assessed and fitted into the roadmap when suitable. If you need to get this done, consider buying a license which also enables you to use it in your commercial products. More information can be found on https://unidoc.io/
from unipdf.
Hi @ryankilroy , thank you for reporiting this issue. We were able to reproduce it using the sample code and sample file you provided and we are currently investigating the cause of it. We will write an update as soon as we identify the source of the issue and the fixes.
from unipdf.
Related Issues (20)
- [BUG] Proxy is not working HOT 2
- [BUG] Page break between subchapters HOT 1
- [BUG] Writing to buffer with nonroot user HOT 6
- Incorrect table display occurs when multiple rows across pages HOT 3
- [FEATURE] Add more presets and initializer for PageSize HOT 2
- [BUG] PDF Creation not working as expected HOT 7
- [BUG] Incorrect table display occurs when multiple rows across pages HOT 3
- [BUG] failed to initialize unidoc: license key already set HOT 4
- [BUG] incorrect object position when draw svg graphics HOT 7
- [BUG] Huge memory consumption when writing images to PDF HOT 6
- Render method produces poor quality images. HOT 3
- [BUG] device.Render does not handle transparency HOT 3
- [BUG] no matching font found in the system HOT 3
- [BUG] Font objects syntax error while merging a PDF document with another HOT 17
- [BUG] Unipdf is messing up the unicode while merging Searchable PDF documents and applying PDF/1-a HOT 9
- [BUG] Accessibility tags missing on download HOT 8
- [BUG] miss makezero in slice init HOT 1
- [BUG] Extracted text from table is reversed when text is styled with an underline HOT 3
- [BUG]: Not able to change text color of form fields in the PDF HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from unipdf.