GithubHelp home page GithubHelp logo

helmholtzai-consultants-munich / oligo-designer-toolsuite Goto Github PK

View Code? Open in Web Editor NEW
12.0 12.0 2.0 234.41 MB

Develop custom oligo design pipelines.

Home Page: https://oligo-designer-toolsuite.readthedocs.io/en/latest/

Python 100.00%
bioinformatics oligo-design probe-design spatial-transcriptomics

oligo-designer-toolsuite's People

Contributors

francesco-campi avatar isramekki0 avatar lisa-sousa avatar louisk92 avatar merdivane avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

oligo-designer-toolsuite's Issues

SCRINSHOT pipeline

Adapt scrinshot pipeline to new toolsuite structure

  • check if SCRINSHOT pipeline has all required property filters
  • check if SCRINSHOT pipeline has correct blast specificity parameters, e.g. ligation region filter
  • check if SCRINSHOT pipeline has correct blast corss-hybridization parameters
  • check if SCRINSHOT pipeline has correct set scoring parameters
  • split config into user and dev config
  • define default parameters for each config files

Oligo Scoring Functions

  • add transcript score:
    • transcript score = 1 - norm transcripts
    • number of transcripts that are targeted by the probe
    • normalize to 0-1 with 1=max transcripts per gene
  • add score

Exact Match Filter

Fix issue with exact match filter:

  • Don't use eaxct macth filter for cross hybridization because we would need to compare to the reverse complement sequence, which cannot be done with the current implementation that makes the filter very fast. Instead one can simply use the Bowtie Filter.
  • Only provide one apply function for the exact match filter, which takes a filter policy as input by which the duplicated oligos are removed, i.e. don't just remove all of them but have options on how to remove (e.g. by larger number of oligos)

Some tests are not running

  • test_sequence_generator_and_database:: test_oligo_database
  • test_sequence_generator_and_database::test_ftp_loader_ensemble
  • test_sequence_generator_and_database::test_region_generator_ncbi
  • test_sequence_generator_and_database::test_region_generator_ensemble
  • test_oligo_selection.py::test_oligosets_generation
  • test_utils.py::test_data_parser FAILED
  • test_utils.py::test_GFF_parser FAILED
  • Need to add additionally: test_fasta_parser in test_utils

Filename for regions_with_insufficient_oligos

When we create mutile databases the filename for regions with insufficiemnt oligos will always be the same and mixed up for different databases โ†’ what happens when we define two database objects?
Solution: give it the name of the database variable โ†’ Create different names depending on database variable names

Oligo Database Attributes Class

we start having many "calculate" functions in the database class which repeatedly add new attributes to the database. I would suggest to move the calculations of those attributes to a seperate class and then just call those classes in the "calculate" fucntions in the database. I would also suggest to create an attribute aclculator for each attruibute we add in the property filter class. In the property filters we can then call those attribute calculation classes which allows us to also compute attributes without running the filters. Suggestion of attributes to move into this class:

  • GC content
  • melting temp
  • padlock arms
  • isoform consensus
  • number of transcripts
  • oligo length
  • seed region
  • secondary structure
  • homodimer length

Add codecov

Use Codecov to automatically collect code coverage reports

Multiple identical(?) entries in start and end

Sometimes in the generated probes txt files I encounter multiple entries in the columns start and end, e.g. for TTLL10 (source=ncbi):

start end
1180535;1180535 1180580;1180580
1180536;1180536 1180581;1180581
... ...

I've only seen multiple identical entries so far. Can there also be different ones? If not I assume it's not the intended behaviour, right?

When running get_overlap_matrix I also get an error probably because of this.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.