Comments (3)
Do you mean that your original input datasets are already in zarr format? You could technically generate JSONs for each zarr input dataset and pass these to MultiZarrToZarr.
I don't think you could currently do multiple zarrs-in-zip, because ReferenceFileSystem requires a single file system to work on, and each zip counts as one.
A set of zarr datasets should really be the very simple case, since the directory structure is already of the right form, and we only ever need whole chunks. The implementation we have now is, if anything, too complicated for this case.
from kerchunk.
Kerchunk also includes SingleHdfToZarr which creates the references https://fsspec.github.io/kerchunk/reference.html#kerchunk.hdf.SingleHdf5ToZarr
An example of this can be found in the docs here: https://fsspec.github.io/kerchunk/test_example.html#single-file-jsons
And in a blog post here (a bit out of date now but should still work, fsspec-reference-maker was renamed to kerchunk recently)
from kerchunk.
My bad, I didn't read the original post close enough 😉
from kerchunk.
Related Issues (20)
- write parquet in MultiZarrToZarr HOT 2
- TIFF: internal codec (small chunks) vs. entire file as single chunk (`imagecodecs_tiff` codec) HOT 10
- Should there be a `kerchunk[parquet]` optional install for `fastparquet`? HOT 3
- Is it possible to create a kerchunk mapping that has different chunk sizes than the underlying file. HOT 4
- Regression in 0.0.8-0.0.9 release causes race condition & segfault in eccodes grib_string_length HOT 10
- Kerchunk tutorial for July ESIP Meeting HOT 14
- MultiZarrToZarr for non-spatial HDF5 files HOT 7
- Allow file scanners to write straight to parquet
- UserWarning / NotImplementedError HOT 4
- `tiff_to_zarr` ValueError: incomplete chunks are not supported by the fsspec ReferenceFileSystem HOT 4
- translate error "ValueError: Shuffle buffer is not an integer multiple of elementsize" HOT 34
- Despite more times available, dataset read from `MultiZarrToZarr` returns specific number time steps HOT 6
- Help with Parquet storage HOT 11
- Create coordinate from dimension? HOT 1
- building an index for *remote* zarr sources fails, due to `single_zarr` bug HOT 4
- Creating index for single zarr-in-zip stores HOT 2
- Support HDF5 compression filter plugins HOT 11
- Variables missing from 'scan_grib', but findable with xarray and cfgrib HOT 11
- Add `xarray.open_dataset` backend HOT 8
- Represent (coordinate) variables "symbolically" HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kerchunk.