Comments (15)
Hi Dave,
Any advance on uploading a params_file we can start to play with? The paper doesn't mention how many filters etc were used
@vagarwal87 - if you use the tensor2tensor release they have data relating to this paper (although their implementation is currently broken)
Thanks!
Jeev
from basenji.
Hey all,
I've made some decent progress on these. And I will release models soon. The biggest hold up at this point is that we're working to integrate these architectures better with the TensorFlow input data format TFRecords and the new Dataset API. You can expect that stuff soon.
Best,
Dave
from basenji.
Hi all, the tutorials and both errors Hirak encountered should be fixed and working now.
from basenji.
Hi Vikram,
Thanks for your interest, and sorry for the incomplete status here. The tutorials, data, and pre-trained models are a work in progress that I'll be fleshing out throughout the summer. I'll leave comments here as I make progress.
Best,
Dave
from basenji.
Hi Dave,
Adding my voice to the chorus here for people wanting an updated tutorial. Also, I'm wondering if it wouldn't be too much trouble to provide data used to make figures / conclusions in biorxiv paper in easily accessible form such as Google Drive link? Or perhaps whatever form a quickstart to basenji may be. Looking forward to further developments - awesome work!
Best,
Dylan
from basenji.
Hey Dave, could you post the umap_macro.bed file that you refer to in https://github.com/calico/basenji/blob/master/tutorials/preprocess.ipynb?
Or post a link to where it could be downloaded from?
from basenji.
Sorry, that was a typo. It's referring to this file: https://github.com/calico/basenji/blob/master/tutorials/data/unmap_macro.bed
from basenji.
Ah got it - I should have looking more closely for that. Thanks!
from basenji.
Hi Dave,
A quick question related to the discussion above. I'm trying to work through the gene expression tutorial (and do some further work with pre-trained models) and am having trouble loading the pre-trained models from the .tf files in basenji_test_genes.py -o data/gencode_chr9_test --rc -s --table models/params_small.txt models/gm12878_best.tf data/gencode_chr9_l262k_w128.h5
. Even after replacing _best
with _d10
and downloading the .tf.data-00000-of-00001
, .tf.meta
, and .tf.index
associated with the gm12878_d10
model, I receive the following error: tensorflow.python.framework.errors_impl.NotFoundError: Key cnn5/batch_normalization/renorm_mean_weight not found in checkpoint
. I assume that this is because I am missing a checkpoint file, which I cannot manage to find on this page. Any pointers? Thanks in advance.
Best,
Michael
from basenji.
HI Michael,
Sorry about that--I have a bit of an incompatibility right now between the master branch, and the pre-trained models. I'll clean that up next week. In the meantime, the safest bet is to work off the release branch here: https://github.com/calico/basenji/releases/tag/0.2
Best,
Dave
from basenji.
Hi Dave,
Thanks for working on this and pointing me to the release branch. I've setup the 0.2 release, but I am unfortunately still running into some issues with missing information from the checkpoint. Similar to the above error, the gene expression prediction from the tutorial fails with NotFoundError (see above for traceback): Key cnn0/batch_normalization/beta/Adam not found in checkpoint
. Any tips? Thanks in advance, and please let me know if you need any more traceback.
Best,
Michael
from basenji.
Hi Michael,
You're right, the parameters file doesn't match the pre-trained model. I will clean it up this week. In the meantime, that tutorial can still demonstrate the steps involved. And if you'd like to see the output, you can train a model for a few epochs yourself with those parameters and ignore models/gm12878_d10.tf.
Best,
Dave
from basenji.
Hi Dave,
Awesome package, I am trying to create the files according to tutorial, but failed while generating data/heart_l262k.h5
It seems due to some update basenji currently does not have genome
module
Traceback (most recent call last):
File "/home/hirak/Projects/basenji/bin/basenji_hdf5_single.py", line 918, in <module>
main()
File "/home/hirak/Projects/basenji/bin/basenji_hdf5_single.py", line 205, in main
chrom_segments = basenji.genome.load_chromosomes(fasta_file)
AttributeError: module 'basenji' has no attribute 'genome'
from basenji.
Hi Dave,
I guess it was an import issue, figured it out. The tutorial should have been run within the root directory of basenji
.
from basenji.
Hi Dave,
The package dependencies are resolved, but I am stuck with another problem in the basenji_hdf5_single.py
script
Traceback (most recent call last):
File "/home/hirak/Projects/basenji/bin/basenji_hdf5_single.py", line 924, in <module>
main()
File "/home/hirak/Projects/basenji/bin/basenji_hdf5_single.py", line 477, in main
data=seqs_na[train_indexes],
UnboundLocalError: local variable 'seqs_na' referenced before assignment
from basenji.
Related Issues (20)
- Pre-process tutorial file not working HOT 2
- tutorial training with --sample 0.05 HOT 3
- Making Akita predictions for different bin size HOT 2
- Attention layer HOT 4
- Seeking Guidance on bam_cov.py Parameters HOT 2
- When using Akita for single-cell prediction on the HCT116 cell line, the Pearson correlation coefficient is much lower. HOT 8
- Akita: Loading weights for newer models results in mismatch HOT 2
- Issue with final matrix?
- OOM issue
- Reproduce the Enformer's input sequences split HOT 1
- Prepare data HOT 1
- Problems during model training. HOT 7
- Tensorflow / Traceback error. HOT 2
- Basenji installation issues HOT 4
- Question about the preprocessing script bam_cov.py HOT 1
- basenji_read overflow HOT 1
- Error from basenji import genome HOT 1
- Question of expected data HOT 1
- AssertionError, Basenji_test.py 'Conv1D' object has no attribute 'input_shape' HOT 1
- akita train error
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from basenji.