GithubHelp home page GithubHelp logo

Comments (11)

philss avatar philss commented on August 16, 2024 1

Yeah, I changed my mind. I think it worth to add this feature 👍

from explorer.

kimjoaoun avatar kimjoaoun commented on August 16, 2024

You mean reading a dataframe using something like this example below?

Explorer.DataFrame.from_bucket("arn:aws:s3:::bucket_name/key_name")

from explorer.

isaacsanders avatar isaacsanders commented on August 16, 2024

Or via URL, s3://bucket-name/data/date=2022-03-01/

from explorer.

isaacsanders avatar isaacsanders commented on August 16, 2024

Arrow defines “file system” types that enable this behavior.

from explorer.

philss avatar philss commented on August 16, 2024

I like the idea but I would suggest to not include such feature in the project right now because of the dependencies we would need to require.
Maybe a good approach would be to create such functionality in a separated project first. WDYT?

from explorer.

isaacsanders avatar isaacsanders commented on August 16, 2024

It’s a first class activity in Apache Arrow, so I didn’t think it would be unusual to include here. It might be the case that implementing a package with Apache Arrow bindings and using the from_arrow function could work.

from explorer.

isaacsanders avatar isaacsanders commented on August 16, 2024

This whole issue seems like it isn’t far off of what #91 is proposing, right?

from explorer.

cigrainger avatar cigrainger commented on August 16, 2024

I agree. I think the example here should give us a good start. I'll look into this.

from explorer.

josevalim avatar josevalim commented on August 16, 2024

This will be closed as part of #190.

from explorer.

isaacsanders avatar isaacsanders commented on August 16, 2024

As written, I don't see #190 as actually closing this issue.

from explorer.

josevalim avatar josevalim commented on August 16, 2024

The idea is that you would read from S3, using something like Req, and then call DF.from_parquet(...) or whatever format it is on S3.

from explorer.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.