GithubHelp home page GithubHelp logo

laurentnoe / yass Goto Github PK

View Code? Open in Web Editor NEW
19.0 2.0 6.0 520 KB

genomic alignment similarity search tool

Home Page: https://bioinfo.univ-lille.fr/yass/

License: GNU General Public License v3.0

Makefile 5.81% M4 0.66% C 82.55% Shell 6.65% Perl 2.22% PHP 2.12%
sequence-alignment spaced-seed dna-sequences fasta-sequences fasta genomic-data-analysis blastn blast-search sequence-alignments align-dna-sequences

yass's Introduction

Build Status Website

yass

(more at http://bioinfo.univ-lille.fr/yass/)

yass is a genomic similarity seach tool for nucleic (and only nucleic) sequences in (multi)fasta or plain text format. yass produces local pairwise alignments in yass format, blast tabular format, or PSL format.

The associated tool yass2blast.pl may be used to convert the default yass output into blast full output, into fasta alignments, or into AXT format.

The associated tool yass2dotplot.php could also be used to convert the default yass output into png or svg dotplots.

Installation

(more at http://bioinfo.univ-lille.fr/yass/download.php)

You need a C compiler and the autotools. On Linux, you can install gcc, autoconf, automake. On Mac, you can install xcode, or the command line developer tools (or you can use macports to install gcc for example).

Using the command line, type:

git clone https://github.com/laurentnoe/yass.git
cd yass
./configure --with-threads
make

or:

git clone https://github.com/laurentnoe/yass.git
cd yass
autoreconf
./configure --with-threads
automake
make

you can install yass to a standard /local/bin directory:

sudo make install

or copy the binary directly to your homedir:

cp src/yass ~/.

Command-line

(more at http://bioinfo.univ-lille.fr/yass/help.php)

common usage

-d <N> where N = [0..5], to select the output format (default is 1)
-r <N> where N = [0..2] to select the forward, reverse, or both sense on the first sequence (default is both)
-S <N>

to select only one sequence in the first multifasta file (give a number between 1 and nbparts).

By default all the sequences are processed.

scoring system

-C <N,...>

with 2,3,4 or 16 parameters to give the:

  • Match/Mismatch scores,
  • Match/Transition/Transversion scores,
  • Match/Transition/Transversion/Other IUPAC scores,
  • 4x4 ACGT matrix (and disable scoring correction algorithm).
-G <No,Ne>

with two parameters to change the cost for:

  • the very first gap opening No,
  • the subsequent extension costs Ne.
-E <N> to set the E-value threshold N (default 10).
-X <N> to set the X-drop threshold score N (default 25).

and

-L <Nl,Nk> to possibly change the Lambda Nl and K Nk values if the one computed do not correspond to your needs. (Note that the ALP tool can do the work: https://www.ncbi.nlm.nih.gov/CBBresearch/Spouge/html_ncbi/html/software/program.html?uid=6 )

search parameters

-p <"seedpattern">
 

where the seed pattern is one, or several seeds separated by comma, where each seed is a word on the "#@-" alphabet

(Note that the Iedera tool can do the design: https://github.com/laurentnoe/iedera or http://bioinfo.univ-lille.fr/yass/iedera.php )

another possibility is to use "Minimally overlapping words" such as the pattern "RYNNNNNnnnNNNN" to speed-up the search, but at a lower sensitivity.

-c <N> where N = [1..2] for single or double hit criterion

Example

A very small example:

yass                   file1.fa  file2.mfa >  yass-output.yop
yass2blast.pl  -blast  yass-output.yop     >  blastlike-output.blk
yass2dotplot.php       yass-output.yop  filename1=""  filename2="" ; open dp.png

A second example where the scoring system is modified, the E-value changed:

yass    file1.fa  file2.mfa    -C 2,-2,-3   -G -5,-2   -E 1e-3   -o yass-output.yop

References

how to cite this tool:

Noe L., Kucherov G., YASS: enhancing the sensitivity of DNA similarity search, 2005, Nucleic Acids Research, 33(2):W540-W543. <http://doi.org/10.1093/nar/gki478>

yass's People

Contributors

laurentnoe avatar poccopen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

yass's Issues

yass2dotplot.php

Bonsoir Mr Noé,
Je viens d'installer et compiler Yass et réussi avec succès un premier test (yass file1.fa file2.fa > output.yop)
J'ai voulu ensuite générer le dotplot correspondant au résultat (./yass2dotplot.php output.yop filename1="file1" filename2="file2") mais une erreur est apparue: -bash: ./yass2dotplot.php: Permission non accordée
Je débute sous Linux, aussi je pense qu'il s'agit plutôt d'une erreur de mon environnement système... Si vous avez une idée, je suis preneur. Je vous joins le fichier yass2dotplot.php que j'utilise.
yass2dotplot.php.txt
Merci par avance de votre aide et temps précieux.
Cdlt,
Stanislas Morand

yass2dotplot

I want to make the dotplot after yass
I have this error:

command: ../yass/yass2dotplot.php Prueba
/usr/bin/env: ‘php’: No such file or directory

I need to install something more?
thanks

Enable multiple threads?

Currently, yass runs with only single threads, and is slow for large fasta files.
Is it possible to speed up yass by running parallelly on more threads?
Thanks.

Manage dotplot windowsize

Hi,
I am using yass to align sequences and create a dotplot. How can I modify windowsize ?
I also notices that the is:issue wdsize.mat files creates come out empty. Is that a problem ?

Thank you very much in advance,

Issue with yass2dotplot.php (ImageCreate)

Fatal error: Uncaught Error: Call to undefined function ImageCreate() in /home/dnanexus/yass/yass2dotplot.php:236
Stack trace:
#0 {main}
thrown in /home/dnanexus/yass/yass2dotplot.php on line 236

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.