target-s3-parquet's People
Forkers
vzzarr rabidaudio johnny-nicola dantwi-cuda avadag resulyrt93 srpwnd abhii-singh gabbytam saasworksinctarget-s3-parquet's Issues
Move to_parquet partition_cols parameter to configuration
- Make it possible to define a partition columns from configuration (verifying if the column exists)
- Remove partition key from the schema definition.
Remove hard requirement for aws access keys in config
Since this project uses awswrangler to manage creds that means it follows the default boto3 session procedure to find creds. See below
boto3 (by relation awswrangler) looks through the following locations to find credentials
- Passing credentials as parameters in the boto.client() method
- Passing credentials as parameters when creating a Session object
- Environment variables
- Shared credential file (~/.aws/credentials)
- AWS config file (~/.aws/config)
- Assume Role provider
- Boto2 config file (/etc/boto.cfg and ~/.boto)
- Instance metadata service on an Amazon EC2 instance that has an IAM role configured.
My use cases are all below #3 in that list. So they should be configurable, but not required if the machine I am running this on has other means of getting credentials.
Resources:
https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html#configuring-credentials
https://github.com/aws/aws-sdk-pandas/blob/main/tutorials/002%20-%20Sessions.ipynb
https://boto3.amazonaws.com/v1/documentation/api/latest/reference/core/session.html
Add pull request and issue templates
Is this target only for Athena?
Hey @ndrluis @lorransr @Marcos314 - this is more of a question that an issue but I was just checking out this target and based on the requirement to have athena_database
and using aws wrangler I was wondering if this is meant only to be used specifically with Athena or is it intending to support writing plain parquet files to S3 as well. Similar to what https://github.com/transferwise/pipelinewise-target-s3-csv does with CSVs but in Parquet format instead. No problem either way but wanted to clarify the use cases for this target ๐ .
Add release and packaging process using conventional commits to generate the changelog
Add contribution instructions to README
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.