GithubHelp home page GithubHelp logo

pdf2jpg's People

Contributors

pankajr141 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

pdf2jpg's Issues

Java heap size error

Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
at org.apache.pdfbox.io.ScratchFileBuffer.addPage(ScratchFileBuffer.java:132)
at org.apache.pdfbox.io.ScratchFileBuffer.ensureAvailableBytesInPage(ScratchFileBuffer.java:184)
at org.apache.pdfbox.io.ScratchFileBuffer.write(ScratchFileBuffer.java:236)
at org.apache.pdfbox.io.ScratchFileBuffer.write(ScratchFileBuffer.java:220)
at org.apache.pdfbox.io.RandomAccessOutputStream.write(RandomAccessOutputStream.java:52)
at org.apache.pdfbox.filter.DCTFilter.decode(DCTFilter.java:147)
at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69)
at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:162)
at org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:235)
at org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject.(PDImageXObject.java:125)
at org.apache.pdfbox.pdmodel.graphics.PDXObject.createXObject(PDXObject.java:70)
at org.apache.pdfbox.pdmodel.PDResources.getXObject(PDResources.java:409)
at org.apache.pdfbox.contentstream.operator.graphics.DrawObject.process(DrawObject.java:53)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:838)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:495)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
at org.apache.pdfbox.rendering.PageDrawer.drawPage(PageDrawer.java:206)
at org.apache.pdfbox.rendering.PDFRenderer.renderImage(PDFRenderer.java:145)
at org.apache.pdfbox.rendering.PDFRenderer.renderImageWithDPI(PDFRenderer.java:94)
at hd.pdf2jpg.PDF2JPGConverter.convertMulti(PDF2JPGConverter.java:76)
at hd.pdf2jpg.App.main(App.java:102)
maj 03, 2020 9:43:58 PM org.apache.pdfbox.cos.COSDocument finalize
WARNING: Warning: You did not close a PDF Document

Can this be solved somehow?

Returning non-zero exit status 1

Hi @pankajr141 ,

I am currently coming across an unusual error when running your package on AWS Lambda. For 99.9% of the time it successfully converts pdfs to jpg, which is amazing! However for the odd one or two occasions it returns a limited error message:

Command '['java', '-jar', '/opt/python/lib/python3.6/site-packages/pdf2jpg/pdf2jpg.jar', '-i', '/tmp/my_pdf.pdf', '-o', '/tmp/', '-d', '300', '-p', 'ALL']' returned non-zero exit status 1.

Would you know why this is happening? Any help around this would be much appreciated. Sadly I can't share the pdf as it is linked to my company.

no module named command

when I run the code, it has an error: no module named command. then I replace command to subprocess,it worked

Error if spaces in script path

I noticed that script is not working if it is on a path that include spaces (C:\Users\user\test pdf2jpg\test.py). I am using windows.

I fixed this inside library by adding quotas to jarPath right after it is defined inside convert_pdf2jpg function.

jarPath = f'"{jarPath}"'

关于字体相关的EOFException异常

感谢您开发的pdf2jpg库,我在使用使用的时候遇到了一点小问题,在执行pdf2jpg.convert_pdf2jpg(inputpath, outputpath, dpi=300, pages="ALL")这句话的时候,出现如图的错误,貌似是字体相关的问题,但是pdf中每一页都成功转换成了jpg,只是会报这个错,请问有什么解决办法吗?谢谢您!
snipaste20190531_234220

Execution error with cyrillic filename

Executing:

from pdf2jpg import pdf2jpg
pdf = r'C:\pdf\г. Советский Картопья вода.pdf'
outputpath = r'C:\pdf\images'
results = pdf2jpg.convert_pdf2jpg(pdf, outputpath, pages="ALL")
print('results=', results)

Gives error:

'utf-8' codec can't decode byte 0xe3 in position 65: invalid continuation byte
results= False

[WinError 2] The system cannot find the file specified

Executing the below:
from pdf2jpg import pdf2jpg
inputpath = r'D:\sample.pdf'

outputpath = r'D:\image.jpg'

To convert single page

result = pdf2jpg.convert_pdf2jpg(inputpath, outputpath, dpi=300, pages="ALL")

print(result)

Error I get:
[WinError 2] The system cannot find the file specified
False

get an error on centos7

I use this module on centos 7, but get an error that "expected str, bytes or os.PathLike object, not int", what can I do to run it on centos 7?

pdf2jpg.convert_pdf2jpg转换成jpg失败

在调用java命令时出现如下错误:
java.io.IOException: XREF for 2748:0 points to wrong object: 2749:0 at offset 1243393#################################

No such file or directory error

Anyone try out on MAC OS? I keep getting the no such file or directory error. Not sure what is the correct path format on MAC OS. Below is the error message

[Errno 2] No such file or directory: 'java -jar /Users/vincent/miniconda3/envs/MM_tuition/lib/python3.6/site-packages/pdf2jpg/pdf2jpg.jar -i "/Users/vincent/Documents/AUS/Australian National University/test.pdf" -o "/Users/vincent/Documents/AUS/Australian National University" -d 300 -p ALL': 'java -jar

AttributeError: 'function' object has no attribute 'convert_pdf2jpg'

My code is:

`def pdf2jpg():

inputpath = "File.pdf"
outputpath = myfile  #Don't worry about it, i put there path 

    # to convert all pages
result = pdf2jpg.convert_pdf2jpg(inputpath, outputpath, pages="ALL")
print(result)

if name == "main":
pdf2jpg()
`
Then i got error as:
AttributeError: 'function' object has no attribute 'convert_pdf2jpg'

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.