GithubHelp home page GithubHelp logo

agstephens / can-it-kerchunk Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 1.0 675 KB

Testing kerchunk with some of our data and services

License: BSD 2-Clause "Simplified" License

Python 96.22% Shell 3.78%

can-it-kerchunk's People

Contributors

agstephens avatar nmassey001 avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Forkers

leosiqueira

can-it-kerchunk's Issues

can it?

:)

Please let me know how testing has gone for you.

Note that there is more care taken for CF times in both the single-HDF and dataset combine steps now.

Workflow for experimenting with kerchunk and read-only quobyte on JASMIN

Proposed method of demonstrating kerchunk usage with Quobyte on JASMIN

Exposing Quobyte mounts as S3

We have a demonstration Quobyte volume (the acclim Group Workspace - /gws/nopw/j04/acclim/outputs) mounted on a JASMIN server. We are able to interact with it using existing S3-compliant libraries, including s3fs and xarray.

Creating kerchunk "index" files for NetCDF

The README in this repository demonstrates how to generate a kerchunk index file for (1) a single file, and for (2) multiple files.

NOTE: There is an update to kerchunk since the scripts were created - so that you can create the aggregated files directly (rather than in a 2-stage process.

That example reads the files from object store, rather than as POSIX files.

Creating kerchunk files, on POSIX, but for object store

During early experimentation with the Quobyte S3 interface we have not managed to work out how to configure the authentication components of the s3fs and kerchunk libraries to be able to run the kerchunk scripts.

Our recommended solution to overcome these problems is:

  1. Create the aggregated kerchunk index file on POSIX (both NC files and output JSON file)
  2. Post-process the index file to convert the local POSIX paths to object store URLs
  3. Write the JSON file to the GWS
  4. Test reading the JSON file via the object store interface

Validate the outputs look like the inputs

Need to think about:

  • Check cf-decoding of times works - probably just involves passing through kwargs.
  • Comparison of metadata, bounds, coordinates and values between original and new data.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.