GithubHelp home page GithubHelp logo

saphir746 / biobankread-bash Goto Github PK

View Code? Open in Web Editor NEW
30.0 30.0 8.0 6.69 MB

Python scripts to extract and pre-process UKB data

License: GNU General Public License v3.0

Python 96.85% Shell 3.15%

biobankread-bash's People

Contributors

bungeebill avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

biobankread-bash's Issues

Scaling, hierarchical tree parsing, general questions

Hi there! Thanks so much for the awesome package. I was in the process of writing my own phenotype parser when I found out about this, it saved me a lot of time and also provided guidance for the use cases that are specific to me.

I have some questions about the package:

  1. I was wondering if you can comment on the scaling ability of the package? I see that the package mostly uses numpy and pandas, which I assume loads all the data into memory. Will this be a problem when the dataframe queried is very big (a large number of phenotypes at a time), or when the UKB add more phenotype and more people? Are there any cases where you see the package takes performance hits or results in out of memory error?
  2. Is there currently a functionality that, for a hierarchical categorical attribute, grabs all the levels below a specific attribute? For example, if I put White for ethnic background, it would give all people who are either "White", "British", "Irish", and "Other white background"?
  3. Do you have a way of saving the newly-created, complex, phenotype definitions and/or filters for later quick reference/reproducibility?
  4. I see that you parse the html file for the field-related information. Aside from the html being UKB data access application specific, is there a reason why the data dictionary csv was not used? I'm currently using it and wonder if you avoided it because of a specific reason.
  5. Are you currently working on adding to the documentation and use cases? I'd be more than happy to document and write up my use of the package as part of the example for other people to use.

Thanks!

installation instructions, release, pip package

Hello!
Looks great, you may want to provide instructions for installation, make it pip install-able and add a release in GitHub?
It already has a setup.py so might be that the instructions are incomplete at the moment?
Best,
Antonio

ImportError: UKBr could not be loaded properly

Hi! thank you so much for sharing! this is super helpful !

I am very new to git and command line. I have tried to use BiobankRead, but I kept on getting this error message:

Traceback (most recent call last):
  File "extract_variables.py", line 406, in <module>
    raise ImportError('UKBr could not be loaded properly')
ImportError: UKBr could not be loaded properly

it will be great if you can help me with it! Appreciated!
Hannah

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.