I am very interested in using this tool for my analysis, I have 3 doubts.
(1) I tried to use the command "$ flanker --flank both -w 6000 --gene emr --fasta_file all_genomas.fasta --include_gene" in my analysis, but it did not return anything, just an error:
Error: Gene erm not found in 43770.1000.con.0010
Traceback (most recent call last):
File "/home/michel/miniconda3/envs/flanker/bin/flanker", line 10, in <module>
sys.exit(main())
File "/home/michel/miniconda3/envs/flanker/lib/python3.7/site-packages/flanker/flanker.py", line 383, in main
flanker_main()
File "/home/michel/miniconda3/envs/flanker/lib/python3.7/site-packages/flanker/flanker.py", line 345, in flanker_main
flank_fasta_file_lin(args.fasta_file, args.window,gene.strip())
File "/home/michel/miniconda3/envs/flanker/lib/python3.7/site-packages/flanker/flanker.py", line 266, in flank_fasta_file_lin
gene_sense=str(gene_sense['STRAND'].iloc[0])
File "/home/michel/miniconda3/envs/flanker/lib/python3.7/site-packages/pandas/core/indexing.py", line 895, in getitem
return self._getitem_axis(maybe_callable, axis=axis)
File "/home/michel/miniconda3/envs/flanker/lib/python3.7/site-packages/pandas/core/indexing.py", line 1501, in _getitem_axis
self._validate_integer(key, axis)
File "/home/michel/miniconda3/envs/flanker/lib/python3.7/site-packages/pandas/core/indexing.py", line 1444, in _validate_integer
raise IndexError("single positional indexer is out-of-bounds")
IndexError: single positional indexer is out-of-bounds
It is probably because I use all my genomes (fragmented or not) are in a concatenated file. It throws an error when it cannot find the gene in some genome.
Is there a way to extract the contigs that have a gene from the genomes of interest?
It seems that we must do a previous analysis of abricate to know which contig contains the gene of interest.
(2) The second question is: if flanker uses abricate databases, then I can add more databases to abricate, for example a custom database or a database such as ISfinder or Megares,
Is the latter true? Does Flanker work this way?
It would be interesting, that Flanker has an option to add a sequence (fasta), then Flanker looks for neighboring genes to the added sequence.
(3) since genomes generally do not always have a high quality of assembly, contigs can be short, long and medium. If there is a gene of interest in a short contig, then I specify the --window 6000 option, but there is no 6000 bp on either side of the sequence. What about this?
I appreciate the answers in advance.
Thanks a lot