Comments (7)
Ah.. Polars doesn't have that distinction no. So the IPC
is the stream and the .arrow
is the feather
file as the IPC data + additional headers?
Then we must add this.
from nodejs-polars.
- .arrow file that works correctly
- .ipc file that doesn't work
from nodejs-polars.
The IPC readers are implemented upstream. Could you make this issue here? https://github.com/jorgecarleitao/arrow2
from nodejs-polars.
I am a bit surprised about pl.readIPC(fromArrow)
and pl.readIPC(fromIPC)
: shouldn't these be two different signatures? One thing is to read a stream (.ipc
), the other is a file (.arrow
). I think that we are just missing a readIPCStream
in Polars' API that can read arrow streams (as opposed to arrow files).
from nodejs-polars.
Hi!
I'm keen to get this into polars, as Snowflake uses this for their response format and would be awesome to get it in for reading data straight from SF into Polars.
Here is a quick primer about the streaming files from Arrow: https://arrow.apache.org/docs/python/ipc.html
And the guide here from arrow2 about reading the stream: https://jorgecarleitao.github.io/arrow2/io/ipc_stream_read.html
IMHO, supporting files initially is fine, later can do other streaming support.
I've started looking into this, and the major blocker I can see is projections.
In arrow2, projections are not supported here: https://github.com/jorgecarleitao/arrow2/blob/main/src/io/ipc/read/stream.rs#L185
So we will need to build the projection from the chunks.
Thoughts?
from nodejs-polars.
Transfering this to the NodeJS repo as I have no way to reproduce this using Python/Rust. Not sure if this is still relevant.
from nodejs-polars.
@stinodego
Python Polars 0.19.2 throws the same error on this file: exceptions.ArrowErrorException: OutOfSpec("InvalidHeader")
df = pl.read_ipc('https://paste.c-net.org/ViperMoronic')
It seems that .ipc
file needs to start and end with ARROW1
for Polars to work
from nodejs-polars.
Related Issues (20)
- pl.readCSV fail with tab separator HOT 5
- filtering by equals doesn't work HOT 3
- `polars.SQLContext` like feature for `node-polars `
- Add `DataFrame.upsample`
- error with filter when trying documentation example HOT 1
- ```valueCounts()``` function always returns ```null```
- DataFrame - `schema_overrides` param is missing HOT 3
- Pivot not working as expected HOT 5
- Add support for reading Delta tables HOT 5
- separator in pivot function HOT 2
- pivot function documentation error HOT 2
- Series constructor ignores dtype HOT 2
- groupBy on a single column DF produces no result
- Transpose option `columnNames` is not working
- Expanded expression support for nodejs-polars HOT 4
- Panic when passing a schema that includes a `String` column HOT 1
- `Series.toTypedArrays()` fails after `.dropNulls()`
- Add support for Decimal DataType HOT 2
- Issue with `dlopen` in version 0.12.0 HOT 3
- Join suffix has no impact HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nodejs-polars.