russhyde / coxpresdbr Goto Github PK
View Code? Open in Web Editor NEWParses and summarises data from coexpresdb.jp
License: GNU General Public License v3.0
Parses and summarises data from coexpresdb.jp
License: GNU General Public License v3.0
dplyr 0.8.3 throws deprecation warnings for many NSE functions from earlier versions of dplyr
This should be really fast.
Could be implemented as:
extract_dataframe(importer) %>%
group_by(source_id) %>%
top_n(-how_many_are_required, mutual_rank) %>%
ungroup()
It's currently mapping over each source gene, extracting rows for that gene, filtering to the best hits, then running bind_rows() [ensure it still respects mr_threshold
etc]
Add the following arguments for pass-through to Rutils::bunzip2:
overwrite # overwrite the extracted archive if it already exists
remove # remove the compressed archive once it's extracted
skip # if already extracted, use the existing extracted archive
Or, pass arguments along as dots or provide bunzip2_args
covr
does not run the code for S4-object validity test functions when they are added in this format:
my_validity_fn <- function(object) {
# blajh blah blah
}
methods::setClass("className", slots = ..., validity = my_validity_fn)
I think the discussion of function-factories here explains the issue
To ensure that S4-object validity functions are ran by covr when instantiating CoxpresDbPartners and CoxpresDbAccessor objects, use the following syntax:
methods::setClass("className", slots = ..., validity = function(x) my_validity_fn(x))
setClass
for CoxpresDbPartners
setClass
for CoxpresDbDataframeAccessor
Suspect there's a bug in the .summarise_neighbours internal function in evaluate_coex_partners
We only use purrr::map
twice in R/
; so could easily rewrite to not import purrr (would probably still need to use purrr in tests; so move to suggests from imports)
Work out the origin of this warning; can the input statistics be modified to circumvent the warning being thrown?
... since the coxpresdb.jp database is now released using .zip files.
[overwrite/rewrite]_in_bunzip2
with overwrite
and rewrite
argumentsIn the current version, the coxpresdb files look like
# file for target gene_a
gene_b MR_ab COR_ab
gene_c MR_ac COR_ac
gene_d MR_ad COR_ad
...
This leads to inefficient sampling - for each gene sampled you have to re-read it's coexpression data
It would be more efficient to read in the data for all genes at one time from a single file
The file should look like
gene_a gene_b MR_ab COR_ab
gene_a gene_c MR_ac COR_ac
gene_a gene_d MR_ad COR_ad
...
gene_b ...
...
Where no correlation coefficient is available for a dataset, it shouldnt be possible to use a correlation threshold in get_coex_partners
. An error should be thrown when correlation threshold is set to non-NULL
Either
? Find out alternatives for meta-analysis of correlation scores
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.