GithubHelp home page GithubHelp logo

lambda-poppler-precompiled's Introduction

Lambda-poppler-precompiled

pdf to img, pdf to text.. and so on!!

I have created this repo out of furstration by the lack of documantation and general help in concern to working with pdf documents on aws lambda.

Getting Started

These instructions will get you a copy of the project up to run it on lambda

Installing

A step by step on how to setup using my sample script witch uses poppler to create thumbnails for pdf files.

  1. Clone the project
git clone https://github.com/johanub/Lambda-poppler-precompiled
  1. Edit the index.py file to use your bucket
s3_bucket = s3.Bucket("<your-bucket>")
  1. Navigate into the project directorty and zip the files using this command (command only works on unix based systems)
zip -r -X "app.zip" *
  1. Now go to the aws lambda console and and go into layer

  2. Make a new layer with the poppler.zip file. For runtime just choose all the python runtimes.

  1. Create a new lambda function and upload the fileapp.zip

  2. Select the layer witch we just made.

  1. Setup a trigger on the s3 bucket where the pdf's will be uploaded

  2. Go to the bucket and make a directory called previews Explain how to run the automated tests for this system

  3. Upload a pdf to your s3 and see the magic

Based on

  • Pavinthan - Poppler for aws lambda - Pavinthan

lambda-poppler-precompiled's People

Contributors

johanub avatar

Stargazers

 avatar  avatar

Watchers

 avatar

lambda-poppler-precompiled's Issues

Dependencies ?

Not an issue about the repo but need your help

I just want to use pdftotext in my firebase functions so while files are required to run pdftotext ?

For example total size of zip is 29 mb but that contains other utils also which i don't need.

Can please tell me dependencies of pdftotext so i can only keep them to reduce the size of functions.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.