GithubHelp home page GithubHelp logo

sergiocorreia / quipucamayoc Goto Github PK

View Code? Open in Web Editor NEW
24.0 5.0 5.0 31.05 MB

dev repo for article

License: GNU Affero General Public License v3.0

TeX 0.59% Python 99.41%
ocr ocr-post-processing ocr-python poppler table-extraction table-ocr textract

quipucamayoc's People

Contributors

a-mahr avatar sergiocorreia avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

quipucamayoc's Issues

setup problems

Hi there,
I'm trying to follow the setup instructions, and come across the following problems:
Following either set of instructions (pip or git), I end up at:
git:
-setup.py file is missing, none of other files seem to have the expected setup stuff under a different name
pip3:
quipu install aws
command not found: quipu
I'm not certain the pip3 problem isn't a path issue of my own, but I believe there are missing files necessary for the git installation

aws install issue

Hello,

I am having trouble using the command quipu aws install. I have installed quipucamayoc and downloaded the aws cli and configured my credentials. I have redacted my account id below, but I have verified that it matches my account. Any help would be appreciated.

Thanks!

Setting up AWS for quipucamayoc:
11:17:26.1 | INFO | Parameters:
11:17:26.1 | INFO | - region = us-east-1
11:17:26.1 | INFO | - account_id = [redacted]
11:17:26.1 | INFO | - topic_arn = arn:aws:sns:us-east-1:[redacted]:AmazonTextractQuipu
11:17:26.1 | INFO | - role_arn = arn:aws:iam::[redacted]:role/quipu-textract
11:17:26.1 | INFO | - queue_url = https://queue.amazonaws.com/[redacted]/quipu-queue
11:17:26.1 | INFO | - queue_arn = None
11:17:26.1 | INFO | - subscription_arn = None
11:17:26.1 | INFO | - bucket = quipu-[redacted]
11:17:26.1 | INFO | Current AWS user: [redacted]
11:17:26.1 | INFO | Creating clients...
11:17:26.9 | INFO | Creating bucket "quipu-[redacted]"
11:17:27.7 | INFO | - Bucket "quipu-[redacted]" exists; deleting it
11:17:29.9 | INFO | - Making bucket private
11:17:30.1 | INFO | Creating SNS topic "AmazonTextractQuipu"...
11:17:30.8 | INFO | - Deleted SNS topic
11:17:31.2 | INFO | - SNS topic ARN = "arn:aws:sns:us-east-1:[redacted]:AmazonTextractQuipu"
11:17:31.2 | INFO | Creating SQS standard queue "quipu-queue"...
11:17:31.2 | INFO | - Retrieving list of existing queues
Traceback (most recent call last):
File "", line 198, in _run_module_as_main
File "", line 88, in run_code
File "D:\Anaconda\envs\py311\Scripts\quipu.exe_main
.py", line 7, in
File "D:\Anaconda\envs\py311\Lib\site-packages\click\core.py", line 1130, in call
return self.main(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "D:\Anaconda\envs\py311\Lib\site-packages\click\core.py", line 1055, in main
rv = self.invoke(ctx)
^^^^^^^^^^^^^^^^
File "D:\Anaconda\envs\py311\Lib\site-packages\click\core.py", line 1657, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "D:\Anaconda\envs\py311\Lib\site-packages\click\core.py", line 1404, in invoke
return ctx.invoke(self.callback, **ctx.params)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "D:\Anaconda\envs\py311\Lib\site-packages\click\core.py", line 760, in invoke
return __callback(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "D:\Anaconda\envs\py311\Lib\site-packages\quipucamayoc\cli.py", line 42, in aws
install_aws()
File "D:\Anaconda\envs\py311\Lib\site-packages\quipucamayoc\aws_setup.py", line 408, in install_aws
create_sqs_queue(quipu, sqs_client, logger)
File "D:\Anaconda\envs\py311\Lib\site-packages\quipucamayoc\aws_setup.py", line 270, in create_sqs_queue
assert quipu.queue_url == r['QueueUrl']
AssertionError

Document Class: AttributeError

I'm trying to follow the instructions in the paper and Github. I successfully install and load quipucamayoc using the pip install. However, when I run doc=quipucamayoc.Document(cache_path='./temp', clear_cache=False) I get AttributeError: module 'quipucamayoc' has no attribute 'Document'.

I'm running on a Google Colab notebook (linux), with only default installs, and poppler-utils. It's Python 3.10.12. My quipucamayoc version is 0.1.2.

When I print the dirs of the module I get: ['__builtins__', '__cached__', '__doc__', '__file__', '__loader__', '__name__', '__package__', '__path__', '__spec__', '__version__', 'aws_extract_tables', 'aws_setup', 'cli', 'install_aws', 'uninstall_aws', 'version'].

It works when I install directly from Github.

Working with Federation

This appears to not work with Federated AWS accounts. Is there anyway to work around that ?
An error occurred when calling the GetUser operation: Must specify userName when calling with non-User credentials.

I'm assuming it's looking for a local IAM user that does not exist.

quipu aws install fails

Hi,

After setting up aws via aws configure, setting region to us-east-1 (I have also tried us-east-2) and default output to JSON, I get the following error and am unable to proceed. Please see the output below. Thank you!

1:09:07.8 | INFO | - Bucket "quipu-179710001707" exists; deleting it
11:09:10.5 | INFO | - Making bucket private
11:09:10.8 | INFO | Creating SNS topic "AmazonTextractQuipu"...
11:09:11.8 | INFO | - Deleted SNS topic
11:09:12.5 | INFO | - Deleted 1 SNS topic subscriptions
11:09:14.0 | INFO | - SNS topic ARN = "arn:aws:sns:us-east-1:179710001707:AmazonTextractQuipu"
11:09:14.0 | INFO | Creating SQS standard queue "quipu-queue"...
11:09:14.0 | INFO | - Retrieving list of existing queues
Traceback (most recent call last):
File "/Users/puppy/anaconda3/bin/quipu", line 8, in
sys.exit(cli())
^^^^^
File "/Users/puppy/anaconda3/lib/python3.11/site-packages/click/core.py", line 1128, in call
return self.main(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/puppy/anaconda3/lib/python3.11/site-packages/click/core.py", line 1053, in main
rv = self.invoke(ctx)
^^^^^^^^^^^^^^^^
File "/Users/puppy/anaconda3/lib/python3.11/site-packages/click/core.py", line 1659, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/puppy/anaconda3/lib/python3.11/site-packages/click/core.py", line 1395, in invoke
return ctx.invoke(self.callback, **ctx.params)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/puppy/anaconda3/lib/python3.11/site-packages/click/core.py", line 754, in invoke
return __callback(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/puppy/anaconda3/lib/python3.11/site-packages/quipucamayoc/cli.py", line 42, in aws
install_aws()
File "/Users/puppy/anaconda3/lib/python3.11/site-packages/quipucamayoc/aws_setup.py", line 408, in install_aws
create_sqs_queue(quipu, sqs_client, logger)
File "/Users/puppy/anaconda3/lib/python3.11/site-packages/quipucamayoc/aws_setup.py", line 270, in create_sqs_queue
assert quipu.queue_url == r['QueueUrl']
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AssertionError

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.