GithubHelp home page GithubHelp logo

datma-health / tiledb Goto Github PK

View Code? Open in Web Editor NEW
2.0 4.0 1.0 13.06 MB

TileDB

License: MIT License

CMake 2.43% C 1.55% C++ 96.02%
storage-manager arrays posix hdfs emrfs gcs cloud mpi cloud-storage azure-storage

tiledb's Introduction

tiledb's People

Contributors

andreasnoack avatar dhutchis avatar eneskuluk avatar gitmach avatar jakebolewski avatar jeffhammond avatar joshblum avatar kdatta avatar kgururaj avatar luszczek avatar mlathara avatar nalinigans avatar stavrospapadopoulos avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Forkers

kgururaj

tiledb's Issues

Allow for a force delete of TileDB elements

GenomicsDB is adding a bunch of non-TileDB elements into TileDB storage. And we see this when GenomicsDB tries deleting an existing workspace -

10:05:08.612 INFO  GenomicsDBImport - Done initializing engine
[TileDB::StorageManager] Error: Cannot delete non TileDB related element '/home/vagrant/gatk/gendb/22$4514841$4617450/genomicsdb_meta_dir'.
[TileDB::FileSystem] Error: posix: Cannot delete file; Directory not empty 
10:05:08.841 INFO  GenomicsDBImport - Shutting down engine
[October 17, 2018 10:05:08 AM PDT] org.broadinstitute.hellbender.tools.genomicsdb.GenomicsDBImport done. Elapsed time: 0.04 minutes.

Allow an option to force delete TileDB workspaces.

Allow for multiple azure credential scenarios

Support azure managed identities. @mlathara mentioned -
I think this is a good overview. In addition to the AZ CLI call I mention above, we could also make a REST call.

The use case I am thinking of here is where a user may be using a node or cluster they don't directly have control over, and might be relying on managed identity to provide credential info.

azb:// URI does not work well with vcf2genomicsdb_init in GenomicsDB

Not sure I am constructing the URI correctly. But the following does not work

% ./vcf2genomicsdb_init -w ws -o -S azb://<container>/vcfs
[E::hts_open_format] Failed to open file "azb://<container>/vcfs/t0.vcf.gz" : Protocol not supported
16:54:14.024 ERROR vcf2genomicsdb_init - pid=10937 tid=6368523 Could not open sample azb://<container>/vcfs/t0.vcf.gz with hts_open Protocol not supported
% ./vcf2genomicsdb_init -w ws -o -S azb://nalini/vcfs?endpoint=<account>.blob.core.windows.net
[E::hts_open_format] Failed to open file "azb://nalini/vcfs/t0.vcf.gz?endpoint=<account>.blob.core.windows.net" : Protocol not supported
16:55:41.334 ERROR vcf2genomicsdb_init - pid=10943 tid=6369128 Could not open sample azb://<container>/vcfs/t0.vcf.gz?endpoint=oda.blob.core.windows.net with hts_open Protocol not supported

whereas this works!

% ./vcf2genomicsdb_init -w ws -o -S az://<container>@<account>.blob.core.windows.net/vcfs

tiledb_array_overflow not working well

Two issues here -

  1. The api in tiledb.h states that tiledb_array_overflow uses the attribute id based on what was passed to tiledb_array_init. But, it actually requires the id to correspond to that in the schema. There is no direct api to map the id to that in the schema making tiledb_array_overflow rather cumbersome to use.
  2. tiledb_array_overflow does not return correct values when tiledb_array_read is called repeatedly to refresh exhausted buffers. Need to have a bunch of test cases to demonstrate and fix this issue.

Embedded copy of catch2 is too old and buggy

Hello,

The code embeds a copy of catch2, but it lies several versions behind upstream which has solved some issues, e.g. catchorg/Catch2#2421 which showed up for me when using the tip of master of your TileDB to build GenomicsDB.

In Debian unstable, linking to the Debian-packaged catch2 instead has solved this problem.

Cheers,
Pierre

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.