GithubHelp home page GithubHelp logo

target-s3-parquet's People

Contributors

avadag avatar gabbytam avatar lorransr avatar marcos314 avatar ndrluis avatar thiagosdantas avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

target-s3-parquet's Issues

Remove hard requirement for aws access keys in config

Since this project uses awswrangler to manage creds that means it follows the default boto3 session procedure to find creds. See below

boto3 (by relation awswrangler) looks through the following locations to find credentials

  1. Passing credentials as parameters in the boto.client() method
  2. Passing credentials as parameters when creating a Session object
  3. Environment variables
  4. Shared credential file (~/.aws/credentials)
  5. AWS config file (~/.aws/config)
  6. Assume Role provider
  7. Boto2 config file (/etc/boto.cfg and ~/.boto)
  8. Instance metadata service on an Amazon EC2 instance that has an IAM role configured.

My use cases are all below #3 in that list. So they should be configurable, but not required if the machine I am running this on has other means of getting credentials.

Resources:
https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html#configuring-credentials
https://github.com/aws/aws-sdk-pandas/blob/main/tutorials/002%20-%20Sessions.ipynb
https://boto3.amazonaws.com/v1/documentation/api/latest/reference/core/session.html

Is this target only for Athena?

Hey @ndrluis @lorransr @Marcos314 - this is more of a question that an issue but I was just checking out this target and based on the requirement to have athena_database and using aws wrangler I was wondering if this is meant only to be used specifically with Athena or is it intending to support writing plain parquet files to S3 as well. Similar to what https://github.com/transferwise/pipelinewise-target-s3-csv does with CSVs but in Parquet format instead. No problem either way but wanted to clarify the use cases for this target ๐Ÿ˜„ .

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.