GithubHelp home page GithubHelp logo

Add argument for data cache about phenosv HOT 3 CLOSED

wglab avatar wglab commented on August 30, 2024
Add argument for data cache

from phenosv.

Comments (3)

Karenxzr avatar Karenxzr commented on August 30, 2024

Hi, there's a csv file saving all feature paths and summary statistics used by phenosv codes to exact features before the inference step. Users need to change corresponding paths before running PhenoSV. To avoid confusions, this step is automatically processed by running: python3 $current_dir/phenosv/setup.py --path "$fpath/data" during the set up (setup.sh)

Because the feature file size is quite large, ~40G for the light mode, I don't expect users would change the data location often. To change a file location, users can change 'PhenoSV/lib/fpath.config' and run phenosv/setup.py again.

Maybe I can update the setup.sh file and update the file location by running something like bash setup.sh --update_path /path/to/folder. Do you think this is what you want?

from phenosv.

kaichop avatar kaichop commented on August 30, 2024

I think it will be more flexible to specify data location by command line argument which overrides the information in a config file. If the argument is not specified then the location in config file can still be used.

from phenosv.

JakeHagen avatar JakeHagen commented on August 30, 2024

Thank you both for responding. Ive looked into making this change and it is a little more complicated than I first assumed. The cache directory needs to be propagated to many places in the codebase. I have opened a PR trying to accomplish this but I fear once I started testing it, I will find many places I missed.

from phenosv.

Related Issues (10)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.