list of sequence alignment software

This list of sequence alignment software is a compilation of software tools and web portals used in pairwise sequence alignment and multiple sequence alignment. See structural alignment software for structural alignment of proteins.

Database search only

Name ! Description ! Sequence type* ! Authors ! Year
class="wikitable sortable"
BLAST \| Local search with fast k-tuple heuristic (Basic Local Alignment Search Tool)	Both	Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ{{Cite journal\|author=Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ \|title=Basic local alignment search tool \|journal=Journal of Molecular Biology \|volume=215 \|issue=3 \|pages=403–10 \|date=October 1990 \|pmid=2231712 \|doi=10.1016/S0022-2836(05)80360-2\|last2=Gish \|last3=Miller \|last4=Myers \|last5=Lipman\|s2cid=14441902 }}	1990
HPC-BLAST \| NCBI compliant multinode and multicore BLAST wrapper. Distributed with the latest version of BLAST, this wrapper facilitates parallelization of the algorithm on modern hybrid architectures with many nodes and many cores within each node.HPC-BLAST code repository https://github.com/UTennessee-JICS/HPC-BLAST	Protein	Burdyshaw CE, Sawyer S, Horton MD, Brook RG, Rekapalli B	2017
CS-BLAST \| Sequence-context specific BLAST, more sensitive than BLAST, FASTA, and SSEARCH. Position-specific iterative version CSI-BLAST more sensitive than PSI-BLAST	Protein	Angermueller C, Biegert A, Soeding J{{Cite journal \|last1= Angermüller \|first1= C. \|last2= Biegert \|first2= A. \|last3= Söding \|first3= J. \|title= Discriminative modelling of context-specific amino acid substitution probabilities \|journal= Bioinformatics \|volume= 28 \|issue= 24 \|pages= 3240–7\|date=Dec 2012 \|doi= 10.1093/bioinformatics/bts622 \|pmid=23080114\|doi-access= free \|hdl= 11858/00-001M-0000-0015-8D22-F \|hdl-access= free }}	2013
CUDASW++ \| GPU accelerated Smith Waterman algorithm for multiple shared-host GPUs	Protein	Liu Y, Maskell DL and Schmidt B	2009/2010
DIAMOND \| BLASTX and BLASTP aligner based on double indexing	Protein	Buchfink B, Xie C, Huson DH, Reuter K, Drost HG {{Cite journal \|author=Buchfink, Xie and Huson \|title=Fast and sensitive protein alignment using DIAMOND \|journal=Nature Methods \|volume=12 \|issue=1 \|pages=59–60 \|date= 2015 \|doi=10.1038/nmeth.3176\|pmid=25402007 \|s2cid=5346781 }}{{Cite journal \|author=B Buchfink, K Reuter and HG Drost \|title=Sensitive protein alignments at tree-of-life scale using DIAMOND \|journal=Nature Methods \|volume=18 \|pages=366–368 \|date= 2021 \|issue=4 \|doi=10.1038/s41592-021-01101-x\|pmid=33828273 \|pmc=8026399 \|doi-access=free }}	2015/2021
FASTA \| Local search with fast k-tuple heuristic, slower but more sensitive than BLAST	Both
GGSEARCH, GLSEARCH \| Global:Global (GG), Global:Local (GL) alignment with statistics	Protein
Genome Magician \| Software for ultra fast local DNA sequence motif search and pairwise alignment for NGS data (FASTA, FASTQ).	DNA	Hepperle D (www.sequentix.de)	2020
Genoogle \| Genoogle uses indexing and parallel processing techniques for searching DNA and Proteins sequences. It is developed in Java and open source.	Both	Albrecht F	2015
HMMER \| Local and global search with profile Hidden Markov models, more sensitive than PSI-BLAST	Both	Durbin R, Eddy SR, Krogh A, Mitchison G{{Cite book\|editor1-first=Richard \|editor1-last=Durbin \|editor2-first=Sean R. \|editor2-last=Eddy \|editor3-first=Anders \|editor3-last=Krogh \|editor3-link=Anders Krogh \|editor4-first=Graeme \|editor4-last=Mitchison \|title=Biological sequence analysis: probabilistic models of proteins and nucleic acids \|publisher=Cambridge University Press \|location=Cambridge, UK \|year=1998 \|isbn=978-0-521-62971-3}}{{Page needed\|date=September 2010}}	1998
HH-suite \| Pairwise comparison of profile Hidden Markov models; very sensitive	Protein	Söding J{{Cite journal\|author=Söding J \|title=Protein homology detection by HMM-HMM comparison \|journal=Bioinformatics \|volume=21 \|issue=7 \|pages=951–60 \|date=April 2005 \|pmid=15531603 \|doi=10.1093/bioinformatics/bti125\|doi-access=free \|hdl=11858/00-001M-0000-0017-EC7A-F \|hdl-access=free }}{{Cite journal\|last1=Remmert\|first1=Michael\|last2=Biegert\|first2=Andreas\|last3=Hauser\|first3=Andreas\|last4=Söding\|first4=Johannes\|date=2011-12-25\|title=HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment\|journal=Nature Methods\|volume=9\|issue=2\|pages=173–175\|doi=10.1038/nmeth.1818\|issn=1548-7105\|pmid=22198341\|hdl=11858/00-001M-0000-0015-8D56-A\|s2cid=205420247\|hdl-access=free}}	2005/2012
IDF \| Inverse Document Frequency	Both
Infernal \| Profile SCFG search	RNA	Eddy S
KLAST \| High-performance general purpose sequence similarity search tool	Both		2009/2014
LAMBDA \| High performance local aligner compatible to BLAST, but much faster; supports SAM/BAM	Protein	Hannes Hauswedell, Jochen Singer, Knut Reinert{{Cite journal\|last1=Hauswedell\|first1=Hannes\|last2=Singer\|first2=Jochen\|last3=Reinert\|first3=Knut\|date=2014-09-01\|journal=Bioinformatics\|volume=30 \|issue=17 \|pages=349–355\| title=Lambda: the local aligner for massive biological data\|url= \|pmid=25161219\|pmc=4147892 \|name-list-style=vanc\|doi=10.1093/bioinformatics/btu439}}	2014
MMseqs2 \| Software suite to search and cluster huge sequence sets. Similar sensitivity to BLAST and PSI-BLAST but orders of magnitude faster	Protein	Steinegger M, Mirdita M, Galiez C, Söding J{{Cite journal\|last1=Steinegger\|first1=Martin\|last2=Soeding\|first2=Johannes\|date=2017-10-16\|journal=Nature Biotechnology\|volume=35\|issue=11\|pages=1026–1028\|title=MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets\|url=https://www.nature.com/nbt/journal/vaop/ncurrent/full/nbt.3988.html\|doi=10.1038/nbt.3988\|pmid=29035372\|hdl=11858/00-001M-0000-002E-1967-3\|s2cid=402352\|hdl-access=free}}	2017
USEARCH \| Ultra-fast sequence analysis tool	Both	{{cite journal \| doi = 10.1093/bioinformatics/btq461 \| volume=26 \| title=Search and clustering orders of magnitude faster than BLAST \| year=2010 \| journal=Bioinformatics \| pages=2460–2461 \| last1 = Edgar \| first1 = R. C. \| issue=19 \| pmid=20709691\| doi-access=free }} publication	2010
OSWALD \|OpenCL Smith-Waterman on Altera's FPGA for Large Protein Databases \|Protein \|Rucci E, García C, Botella G, De Giusti A, Naiouf M, Prieto-Matías M{{Cite journal\|last1=Rucci\|first1=Enzo\|last2=Garcia\|first2=Carlos\|last3=Botella\|first3=Guillermo\|last4=Giusti\|first4=Armando E. De\|last5=Naiouf\|first5=Marcelo\|last6=Prieto-Matias\|first6=Manuel\|date=2016-06-30\|title=OSWALD: OpenCL Smith–Waterman on Altera's FPGA for Large Protein Databases\|url=http://hpc.sagepub.com/content/early/2016/06/30/1094342016654215\|journal=International Journal of High Performance Computing Applications\|volume=32\|issue=3\|pages=337–350\|doi=10.1177/1094342016654215\|s2cid=212680914\|issn=1094-3420\|hdl=11336/48798\|hdl-access=free}} \|2016
parasail \| Fast Smith-Waterman search using SIMD parallelization	Both	Daily J	2015
PSI-BLAST \| Position-specific iterative BLAST, local search with position-specific scoring matrices, much more sensitive than BLAST	Protein	Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ{{Cite journal\|author=Altschul SF \|title=Gapped BLAST and PSI-BLAST: a new generation of protein database search programs \|journal=Nucleic Acids Research \|volume=25 \|issue=17 \|pages=3389–402 \|date=September 1997 \|pmid=9254694 \|pmc=146917 \|doi=10.1093/nar/25.17.3389\|name-list-style=vanc\|author2=Madden TL\|author3=Schäffer AA\|display-authors=3\|last4=Zhang\|first4=J\|last5=Zhang\|first5=Z\|last6=Miller\|first6=W\|last7=Lipman\|first7=DJ}}	1997
PSI-Search \| Combining the Smith-Waterman search algorithm with the PSI-BLAST profile construction strategy to find distantly related protein sequences, and preventing homologous over-extension errors.	Protein	Li W, McWilliam H, Goujon M, Cowley A, Lopez R, Pearson WR{{Cite journal\|author=Li W \|title=PSI-Search: iterative HOE-reduced profile SSEARCH searching \|journal=Bioinformatics \|volume=28 \|issue=12 \|pages=1650–1651 \|date=June 2012 \|pmid=22539666 \|pmc=3371869 \|doi=10.1093/bioinformatics/bts240 \|name-list-style=vanc\|author2= McWilliam H\|author3=Goujon M\|display-authors=3\|last4=Cowley\|first4=A\|last5=Lopez\|first5=R\|last6=Pearson\|first6=WR}}	2012
R&R \|Retrieve and Relate (R&R) is a high performance yet sensitive multi-database search engine, capable of searching in parallel through DNA,RNA and Protein sequences. \|Both \| \|2019
ScalaBLAST \| Highly parallel Scalable BLAST	Both	Oehmen et al.{{cite journal \|last1=Oehmen \|first1=C.\|last2= Nieplocha \|first2=J. \|title=ScalaBLAST: A scalable implementation of BLAST for high-performance data-intensive bioinformatics analysis\|journal=IEEE Transactions on Parallel and Distributed Systems \|volume=17 \|issue=8 \|pages=740–749 \|date=August 2006 \|doi=10.1109/TPDS.2006.112\|s2cid=11122366\|url=https://zenodo.org/record/1232261 }}	2011
Sequilab \| Linking and profiling sequence alignment data from NCBI-BLAST results with major sequence analysis servers/services	Nucleotide, peptide		2010
SAM \| Local and global search with profile Hidden Markov models, more sensitive than PSI-BLAST	Both	Karplus K, Krogh A{{cite report\|last1=Hughey\|first1= R.\|last2=Karplus\|first2= K.\|last3= Krogh\|first3= A. \|title= SAM: sequence alignment and modeling software system. Technical report UCSC-CRL-99-11 \|publisher= University of California, Santa Cruz, CA\|year=2003 \|url=http://compbio.soe.ucsc.edu/papers/sam_doc/sam_doc.html}}	1999
SSEARCH \| Smith-Waterman search, slower but more sensitive than FASTA	Both
SWAPHI \| First parallelized algorithm employing the emerging Intel Xeon Phis to accelerate Smith-Waterman protein database search	Protein	Liu Y and Schmidt B	2014
SWAPHI-LS \| First parallel Smith-Waterman algorithm exploiting Intel Xeon Phi clusters to accelerate the alignment of long DNA sequences	DNA	Liu Y, Tran TT, Lauenroth F, Schmidt B	2014
SWIMM \| Smith-Waterman implementation for Intel Multicore and Manycore architectures	Protein	Rucci E, García C, Botella G, De Giusti A, Naiouf M and Prieto-Matías M{{Cite journal\|last1=Rucci\|first1=Enzo\|last2=García\|first2=Carlos\|last3=Botella\|first3=Guillermo\|last4=De Giusti\|first4=Armando\|last5=Naiouf\|first5=Marcelo\|last6=Prieto-Matías\|first6=Manuel\|date=2015-12-25\|title=An energy-aware performance analysis of SWIMM: Smith–Waterman implementation on Intel's Multicore and Manycore architectures\|journal=Concurrency and Computation: Practice and Experience\|volume=27\|issue=18\|pages=5517–5537\|doi=10.1002/cpe.3598\|s2cid=42945406\|issn=1532-0634\|url=http://sedici.unlp.edu.ar/handle/10915/82869\|hdl=11336/53930\|hdl-access=free}}	2015
SWIMM2.0 \| Enhanced Smith-Waterman on Intel's Multicore and Manycore architectures based on AVX-512 vector extensions	Protein	Rucci E, García C, Botella G, De Giusti A, Naiouf M and Prieto-Matías M{{Cite journal\|last1=Rucci\|first1=Enzo\|last2=García\|first2=Carlos\|last3=Botella\|first3=Guillermo\|last4=De Giusti\|first4=Armando\|last5=Naiouf\|first5=Marcelo\|last6=Prieto-Matías\|first6=Manuel\|date=2015-12-25\|title=SWIMM 2.0: enhanced Smith-Waterman on Intel's Multicore and Manycore architectures based on AVX-512 vector extensions\|journal=International Journal of Parallel Programming\|volume=47\|issue=2\|pages=296–317\|doi=10.1007/s10766-018-0585-7\|s2cid=49670113\|issn=1573-7640\|url=http://sedici.unlp.edu.ar/handle/10915/82888}}	2018
SWIPE \| Fast Smith-Waterman search using SIMD parallelization	Both	Rognes T	2011

*Sequence type: protein or nucleotide

Pairwise alignment

Name ! Description \|\| Sequence type* \|\| Alignment type** \|\| Author \|\| Year
class="wikitable sortable"
ACANA \| Fast heuristic anchor based pairwise alignment	Both	Both	Huang, Umbach, Li	2005
AlignMe \| Alignments for membrane protein sequences	Protein	Both	M. Stamm, K. Khafizov, R. Staritzbichler, L.R. Forrest	2013
ALLALIGN \|For DNA, RNA and protein molecules up to 32MB, aligns all sequences of size K or greater. Similar alignments are grouped together for analysis. Automatic repetitive sequence filter. \|Both \|Local \|E. Wachtel \|2017
Bioconductor Biostrings::pairwiseAlignment \| Dynamic programming	Both	Both + Ends-free	P. Aboyoun	2008
BioPerl dpAlign \| Dynamic programming	Both	Both + Ends-free	Y. M. Chan	2003
BLASTZ, LASTZ \| Seeded pattern-matching	Nucleotide	Local	Schwartz et al.{{Cite journal\| author=Schwartz S, Kent WJ, Smit A, Zhang Z, Baertsch R, Hardison RC, Haussler D, Miller W\| title=Human-mouse alignments with BLASTZ\| journal=Genome Research \|volume=13 \|issue=1 \|date=2003 \|pages=103–107 \|pmid=12529312 \| pmc=430961\| doi=10.1101/gr.809403\| last2=Kent\| last3=Smit\| last4=Zhang\| last5=Baertsch\| last6=Hardison\| last7=Haussler\| last8=Miller}}{{Cite thesis\| author=Harris R S \| year=2007\| title=Improved pairwise alignment of genomic DNA}}	2004,2009
CUDAlign \| DNA sequence alignment of unrestricted size in single or multiple GPUs	Nucleotide	Local, SemiGlobal, Global	E. Sandes{{Cite journal\|author=Sandes, Edans F. de O. \|author2=de Melo, Alba Cristina M.A.\|title=Retrieving Smith-Waterman Alignments with Optimizations for Megabase Biological Sequences Using GPU \|journal=IEEE Transactions on Parallel and Distributed Systems\|volume=24 \|issue=5 \| pages=1009–1021 \|date=May 2013 \|doi=10.1109/TPDS.2012.194}}{{Cite conference\|author=Sandes, Edans F. de O. \|author2=Miranda, G. \|author3=De Melo, A.C.M.A. \|author4=Martorell, X. \|author5=Ayguade, E.\|title=CUDAlign 3.0: Parallel Biological Sequence Comparison in Large GPU Clusters \|conference=Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on \|page=160 \|date=May 2014 \|doi=10.1109/CCGrid.2014.18}}{{Cite conference\|author=Sandes, Edans F. de O. \|author2=Miranda, G. \|author3=De Melo, A.C.M.A. \|author4=Martorell, X. \|author5=Ayguade, E.\|title=Fine-grain Parallel Megabase Sequence Comparison with Multiple Heterogeneous GPUs \|conference=Proceedings of the 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming \|pages=383–384 \|date=August 2014 \|doi=10.1145/2555243.2555280}}	2011-2015
DNADot \| Web-based dot-plot tool	Nucleotide	Global	R. Bowen	1998
DOTLET \| Java-based dot-plot tool	Both	Global	M. Pagni and T. Junier	1998
FEAST \| Posterior based local extension with descriptive evolution model	Nucleotide	Local	A. K. Hudek and D. G. Brown	2010
Genome Compiler Genome Compiler \|Align chromatogram files (.ab1, .scf) against a template sequence, locate errors, and correct them instantly. \|Nucleotide \|Local \|Genome Compiler Corporation \|2014
G-PAS \| GPU-based dynamic programming with backtracking	Both	Local, SemiGlobal, Global	W. Frohmberg, M. Kierzynka et al.	2011
GapMis \| Does pairwise sequence alignment with one gap	Both	SemiGlobal	K. Frousios, T. Flouri, C. S. Iliopoulos, K. Park, S. P. Pissis, G. Tischler	2012
Genome Magician \| Software for ultra fast local DNA sequence motif search and pairwise alignment for NGS data (FASTA, FASTQ).	DNA	Local, SemiGlobal, Global	Hepperle D (www.sequentix.de)	2020
GGSEARCH, GLSEARCH \| Global:Global (GG), Global:Local (GL) alignment with statistics	Protein	Global in query	W. Pearson	2007
JAligner \| Java open-source implementation of Smith-Waterman	Both	Local	A. Moustafa	2005
K*Sync \| Protein sequence to structure alignment that includes secondary structure, structural conservation, structure-derived sequence profiles, and consensus alignment scores	Protein	Both	D. Chivian & D. Baker{{Cite journal \|last1=Chivian \|first1=D \|last2=Baker \|first2=D \|title=Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection \|journal=Nucleic Acids Research \|volume=34 \|issue=17 \|pages=e112 \|date=2006 \|pmid=16971460 \|pmc=1635247 \|doi=10.1093/nar/gkl480}}	2003
LALIGN \| Multiple, non-overlapping, local similarity (same algorithm as SIM)	Both	Local non-overlapping	W. Pearson	1991 (algorithm)
NW-align \| Standard Needleman-Wunsch dynamic programming algorithm	Protein	Global	Y Zhang	2012
matcher \| Waterman-Eggert local alignment (based on LALIGN)	Both	Local	I. Longden (modified from W. Pearson)	1999
MCALIGN2 \| explicit models of indel evolution	DNA	Global	J. Wang et al.	2006
MegAlign Pro (Lasergene Molecular Biology) \| Software to align DNA, RNA, protein, or DNA + protein sequences via pairwise and multiple sequence alignment algorithms including MUSCLE, Mauve, MAFFT, Clustal Omega, Jotun Hein, Wilbur-Lipman, Martinez Needleman-Wunsch, Lipman-Pearson and Dotplot analysis.	Both	Both	DNASTAR	1993-2016
MUMmer \| suffix tree based	Nucleotide	Global	S. Kurtz et al.	2004
needle \| Needleman-Wunsch dynamic programming	Both	SemiGlobal	A. Bleasby	1999
Ngila \| logarithmic and affine gap costs and explicit models of indel evolution	Both	Global	R. Cartwright	2007
NW \| Needleman-Wunsch dynamic programming	Both	Global	A.C.R. Martin	1990-2015
parasail \| C/C++/Python/Java SIMD dynamic programming library for SSE, AVX2	Both	Global, Ends-free, Local	J. Daily	2015
Path \| Smith-Waterman on protein back-translation graph (detects frameshifts at protein level)	Protein	Local	M. Gîrdea et al.{{Cite journal \|last1=Girdea \|first1=M \|last2=Noe \|first2=L \|last3=Kucherov \|first3=G \|title=Back-translation for discovering distant protein homologies in the presence of frameshift mutations \|journal=Algorithms for Molecular Biology \|volume=5 \|issue=6 \|page=6 \|date=January 2010 \|pmid=20047662 \|pmc=2821327 \|doi=10.1186/1748-7188-5-6 \|doi-access=free }}	2009
PatternHunter \| Seeded pattern-matching	Nucleotide	Local	B. Ma et al.{{Cite journal \|last1=Ma \|first1=B. \|last2=Tromp \|first2=J. \|last3=Li \|first3=M. \|title=PatternHunter: faster and more sensitive homology search \|journal=Bioinformatics \|volume=18 \|issue=3 \|date=2002 \|pages=440–445 \|pmid=11934743 \|doi=10.1093/bioinformatics/18.3.440\|doi-access=free }}{{Cite journal \|last1=Li \|first1=M. \|last2=Ma \|first2=B. \|last3=Kisman \|first3=D. \|last4=Tromp \|first4=J. \|title=Patternhunter II: highly sensitive and fast homology search \|journal=Journal of Bioinformatics and Computational Biology \|volume=2 \|issue=3 \|date=2004 \|pages=417–439\|pmid=15359419 \|doi=10.1142/S0219720004000661\|citeseerx=10.1.1.1.2393 }}	2002–2004
ProbA (also propA) \| Stochastic partition function sampling via dynamic programming	Both	Global	U. Mückstein	2002
PyMOL \| "align" command aligns sequence & applies it to structure	Protein	Global (by selection)	W. L. DeLano	2007
REPuter \| suffix tree based	Nucleotide	Local	S. Kurtz et al.	2001
SABERTOOTH \| Alignment using predicted Connectivity Profiles	Protein	Global	F. Teichert, J. Minning, U. Bastolla, and M. Porto	2009
Satsuma \| Parallel whole-genome synteny alignments	DNA	Local	M.G. Grabherr et al.	2010
SEQALN \| Various dynamic programming	Both	Local or global	M.S. Waterman and P. Hardy	1996
SIM, GAP, NAP, LAP \| Local similarity with varying gap treatments	Both	Local or global	X. Huang and W. Miller	1990-6
SIM \| Local similarity	Both	Local	X. Huang and W. Miller	1991
SPA: Super pairwise alignment \| Fast pairwise global alignment	Nucleotide	Global	Shen, Yang, Yao, Hwang	2002
SSEARCH \| Local (Smith-Waterman) alignment with statistics	Protein	Local	W. Pearson	1981 (Algorithm)
Sequences Studio \| \|Java applet demonstrating various algorithms from{{Cite book\|first=Dan \|last=Gusfield \|year=1997\|title= Algorithms on strings, trees and sequences\|publisher= Cambridge university press\| isbn= 978-0-521-58519-4}}	Generic sequence	Local and global	A.Meskauskas	1997 (reference book)
SWIFOLD \| Smith-Waterman Acceleration on Intel's FPGA with OpenCL for Long DNA Sequences	Nucleotide	Local	E. Rucci{{Cite journal\|author=Rucci, Enzo \|author2=Garcia, Carlos\|author3=Botella, Guillermo\|author4=Naiouf, Marcelo\|author5=De Giusti,Armando\|author6=Prieto-Matias, Manuel\|title=SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences \|journal=BMC Systems Biology\|volume=12 \| doi=10.1186/s12918-018-0614-6\|doi-access=free \|year=2018\|issue=Suppl 5\|page=96\|pmid=30458766\|pmc=6245597}}{{Cite conference\|author=Rucci, Enzo \|author2=Garcia, Carlos\|author3=Botella, Guillermo\|author4=Naiouf, Marcelo\|author5=De Giusti,Armando\|author6=Prieto-Matias, Manuel\|title=Accelerating Smith-Waterman Alignment of Long DNA Sequences with OpenCL on FPGA \|conference=5th International Work-Conference on Bioinformatics and Biomedical Engineering \|pages=500–511 \|doi=10.1007/978-3-319-56154-7_45}}	2017-2018
SWIFT suit \| Fast Local Alignment Searching	DNA	Local	K. Rasmussen,{{Cite journal\|author=Rasmussen K, Stoye J, Myers EW\| title=Efficient q-Gram Filters for Finding All epsilon-Matches over a Given Length \|journal=Journal of Computational Biology \|volume=13 \|issue=2 \|pages=296–308 \|date=2006 \|pmid=16597241 \| doi=10.1089/cmb.2006.13.296\| last2=Stoye \| last3=Myers\| citeseerx=10.1.1.465.2084 }} W. Gerlach	2005,2008
stretcher \| Memory-optimized Needleman-Wunsch dynamic programming	Both	Global	I. Longden (modified from G. Myers and W. Miller)	1999
tranalign \| Aligns nucleic acid sequences given a protein alignment	Nucleotide	NA	G. Williams (modified from B. Pearson)	2002
UGENE \| Opensource Smith-Waterman for SSE/CUDA, Suffix array based repeats finder & dotplot	Both	Both	UniPro	2010
water \| Smith-Waterman dynamic programming	Both	Local	A. Bleasby	1999
wordmatch \| k-tuple pairwise match	Both	NA	I. Longden	1998
YASS \| Seeded pattern-matching	Nucleotide	Local	L. Noe and G. Kucherov{{Cite journal\|author=Noe L, Kucherov G\|title=YASS: enhancing the sensitivity of DNA similarity search \|journal=Nucleic Acids Research \|volume= 33\|issue=suppl_2 \|pages=W540–W543 \|date=2005 \|pmid=15980530 \| pmc=1160238 \|doi=10.1093/nar/gki478\|last2=Kucherov}}	2004

*Sequence type: protein or nucleotide **Alignment type: local or global

Multiple sequence alignment

Name ! Description \|\| Sequence type* \|\| Alignment type** \|\| Author \|\| Year \|\| License
class="wikitable sortable"
ABA \| A-Bruijn alignment	Protein	Global	B.Raphael et al.	2004	{{proprietary}}, freeware for education, research, nonprofit
ALE \| manual alignment ; some software assistance	Nucleotides	Local	J. Blandy and K. Fogel	1994 (latest version 2007)	{{free}}, GPL2
ALLALIGN \|For DNA, RNA and protein molecules up to 32MB, aligns all sequences of size K or greater, MSA or within a single molecule. Similar alignments are grouped together for analysis. Automatic repetitive sequence filter. \|Both \|Local \|E. Wachtel \|2017 \|Free
AMAP \| Sequence annealing	Both	Global	A. Schwartz and L. Pachter	2006
BAli-Phy \| Tree+multi-alignment; probabilistic-Bayesian; joint estimation	Both + Codons	Global	BD Redelings and MA Suchard	2005 (latest version 2018)	{{free}}, GPL
Base-By-Base \| Java-based multiple sequence alignment editor with integrated analysis tools	Both	Local or global	R. Brodie et al.	2004	{{proprietary}}, freeware, must register
CHAOS, DIALIGN \| Iterative alignment	Both	Local (preferred)	M. Brudno and B. Morgenstern	2003
Clustal \| Progressive alignment	Both	Local or global	Thompson et al.	1994	{{free}}, LGPL
CodonCode Aligner \| Multi-alignment; ClustalW & Phrap support	Nucleotides	Local or global	P. Richterich et al.	2003 (latest version 2009)
Compass \| COmparison of Multiple Protein sequence Alignments with assessment of Statistical Significance	Protein	Global	R.I. Sadreyev, et al.	2009
DECIPHER \| Progressive-iterative alignment	Both	Global	Erik S. Wright	2014	{{free}}, GPL
DIALIGN-TX and DIALIGN-T \| Segment-based method	Both	Local (preferred) or Global	A.R.Subramanian	2005 (latest version 2008)
DNA Alignment \| Segment-based method for intraspecific alignments	Both	Local (preferred) or Global	A.Roehl	2005 (latest version 2008)
DNA Baser Sequence Assembler \| Multi-alignment; Full automatic sequence alignment; Automatic ambiguity correction; Internal base caller; Command line seq alignment	Nucleotides	Local or global	Heracle BioSoft SRL	2006 (latest version 2018)	Commercial (some modules are freeware)
DNADynamo \| linked DNA to Protein multiple alignment with MUSCLE, Clustal and Smith-Waterman	Both	Local or global	DNADynamo	2004 (newest version 2017)
EDNA \| Energy Based Multiple Sequence Alignment for DNA Binding Sites	Nucleotides	Local or global	Salama, RA. et al.	2013
FAMSA \|Progressive alignment for extremely large protein families (hundreds of thousands of members) \|Protein \|Global \|Deorowicz et al. \|2016 \|{{free}}, GPL 3
FSA \| Sequence annealing	Both	Global	R. K. Bradley et al.	2008
Geneious \| Progressive-Iterative alignment; ClustalW plugin	Both	Local or global	A.J. Drummond et al.	2005 (latest version 2017)
GUIDANCE \| Quality control and filtering of multiple sequence alignments	Both	Local or global	O. Penn et al.	2010 (latest version 2015)
Kalign \| Progressive alignment	Both	Global	T. Lassmann	2005
MACSE \|Progressive-iterative alignment. Multiple alignment of coding sequences accounting for frameshifts and stop codons. \|Nucleotides \|Global \|V. Ranwez et al. \|2011 (latest version, v2.07 2023) \|
MAFFT \| Progressive-iterative alignment	Both	Local or global	K. Katoh et al.	2005	{{free}}, BSD
MARNA \| Multi-alignment of RNAs	RNA	Local	S. Siebert et al.	2005
MAVID \| Progressive alignment	Both	Global	N. Bray and L. Pachter	2004
MegAlign Pro (Lasergene Molecular Biology) \| Software to align DNA, RNA, protein, or DNA + protein sequences via pairwise and multiple sequence alignment algorithms including MUSCLE, Mauve, MAFFT, Clustal Omega, Jotun Hein, Wilbur-Lipman, Martinez Needleman-Wunsch, Lipman-Pearson and Dotplot analysis.	Both	Local or global	DNASTAR	1993-2023
MSA \| Dynamic programming	Both	Local or global	D.J. Lipman et al.	1989 (modified 1995)
MSAProbs \| Dynamic programming	Protein	Global	Y. Liu, B. Schmidt, D. Maskell	2010
MULTALIN \| Dynamic programming-clustering	Both	Local or global	F. Corpet	1988
Multi-LAGAN \| Progressive dynamic programming alignment	Both	Global	M. Brudno et al.	2003
MUSCLE \| Progressive-iterative alignment	Both	Local or global	R. Edgar	2004
Opal \| Progressive-iterative alignment	Both	Local or global	T. Wheeler and J. Kececioglu	2007 (latest stable 2013, latest beta 2016)
Pecan \| Probabilistic-consistency	DNA	Global	B. Paten et al.	2008
Phylo \| A human computing framework for comparative genomics to solve multiple alignment	Nucleotides	Local or global	McGill Bioinformatics	2010
PMFastR \| Progressive structure aware alignment \| RNA \| Global \| D. DeBlasio, J Braund, S Zhang \| 2009 \|
Praline \| Progressive-iterative-consistency-homology-extended alignment with preprofiling and secondary structure prediction	Protein	Global	J. Heringa	1999 (latest version 2009)
PicXAA \| Nonprogressive, maximum expected accuracy alignment	Both	Global	S.M.E. Sahraeian and B.J. Yoon	2010
POA \| Partial order/hidden Markov model	Protein	Local or global	C. Lee	2002
Probalign \| Probabilistic/consistency with partition function probabilities	Protein	Global	Roshan and Livesay	2006	{{free}}, public domain
ProbCons \| Probabilistic/consistency	Protein	Local or global	C. Do et al.	2005	{{free}}, public domain
PROMALS3D \| Progressive alignment/hidden Markov model/Secondary structure/3D structure	Protein	Global	J. Pei et al.	2008
PRRN/PRRP \| Iterative alignment (especially refinement)	Protein	Local or global	Y. Totoki (based on O. Gotoh)	1991 and later
PSAlign \| Alignment preserving non-heuristic	Both	Local or global	S.H. Sze, Y. Lu, Q. Yang.	2006
RevTrans \| Combines DNA and Protein alignment, by back translating the protein alignment to DNA.	DNA/Protein (special)	Local or global	Wernersson and Pedersen	2003 (newest version 2005)
SAGA \| Sequence alignment by genetic algorithm	Protein	Local or global	C. Notredame et al.	1996 (new version 1998)
SAM \| Hidden Markov model	Protein	Local or global	A. Krogh et al.	1994 (most recent version 2002)
Se-Al \| Manual alignment	Both	Local	A. Rambaut	2002
StatAlign \| Bayesian co-estimation of alignment and phylogeny (MCMC)	Both	Global	A. Novak et al.	2008
Stemloc \| Multiple alignment and secondary structure prediction	RNA	Local or global	I. Holmes	2005	{{free}}, GPL 3 (parte de DART)
T-Coffee \| More sensitive progressive alignment	Both	Local or global	C. Notredame et al.	2000 (newest version 2008)	{{free}}, GPL 2
UGENE \| Supports multiple alignment with MUSCLE, KAlign, Clustal and MAFFT plugins	Both	Local or global	UGENE team	2010 (newest version 2020)	{{free}}, GPL 2
VectorFriends \| VectorFriends Aligner, MUSCLE plugin, and ClustalW plugin	Both	Local or global	BioFriends team	2013	{{proprietary}}, freeware for academic use
GLProbs \| Adaptive pair-Hidden Markov Model based approach	Protein	Global	Y. Ye et al.	2013

*Sequence type: protein or nucleotide. **Alignment type: local or global

Genomics analysis

class="wikitable sortable"
Name ! Description ! Sequence type*
EAGLE {{cite journal\|last1=Pratas\|first1=Diogo\|last2=Silva\|first2=Jorge\|title=Persistent minimal sequences of SARS-CoV-2\|journal=Bioinformatics\|year=2020\|volume=36\|issue=21\|pages=5129–5132\|doi=10.1093/bioinformatics/btaa686\|pmid=32730589\|pmc=7559010\|doi-access=free}} \| An ultra-fast tool to find relative absent words in genomic data \| Nucleotide
ACT (Artemis Comparison Tool) \| Synteny and comparative genomics \| Nucleotide
AVID \| Pairwise global alignment with whole genomes	Nucleotide
BLAT \| Alignment of cDNA sequences to a genome.	Nucleotide
DECIPHER \| Alignment of rearranged genomes using 6 frame translation	Nucleotide
FLAK \| Fuzzy whole genome alignment and analysis	Nucleotide
GMAP \| Alignment of cDNA sequences to a genome. Identifies splice site junctions with high accuracy.	Nucleotide
Splign \| Alignment of cDNA sequences to a genome. Identifies splice site junctions with high accuracy. Able to recognize and separate gene duplications.	Nucleotide
Mauve \| Multiple alignment of rearranged genomes	Nucleotide
MGA \| Multiple Genome Aligner	Nucleotide
Mulan \| Local multiple alignments of genome-length sequences	Nucleotide
Multiz \| Multiple alignment of genomes	Nucleotide
PLAST-ncRNA \| Search for ncRNAs in genomes by partition function local alignment	Nucleotide
Sequerome \| Profiling sequence alignment data with major servers/services	Nucleotide, peptide
Sequilab \| Profiling sequence alignment data from NCBI-BLAST results with major servers-services	Nucleotide, peptide
Shuffle-LAGAN \| Pairwise global alignment of completed genome regions	Nucleotide
SIBsim4, Sim4 \| A program designed to align an expressed DNA sequence with a genomic sequence, allowing for introns	Nucleotide
SLAM \| Gene finding, alignment, annotation (human-mouse homology identification)	Nucleotide \|
SRPRISM \| An efficient aligner for assemblies with explicit guarantees, aligning reads without splices	Nucleotide

class="wikitable sortable"

Name

! Description

! Sequence type*

EAGLE {{cite journal|last1=Pratas|first1=Diogo|last2=Silva|first2=Jorge|title=Persistent minimal sequences of SARS-CoV-2|journal=Bioinformatics|year=2020|volume=36|issue=21|pages=5129–5132|doi=10.1093/bioinformatics/btaa686|pmid=32730589|pmc=7559010|doi-access=free}}

| An ultra-fast tool to find relative absent words in genomic data

| Nucleotide

ACT (Artemis Comparison Tool)

| Synteny and comparative genomics

| Nucleotide

AVID

| Pairwise global alignment with whole genomes

Nucleotide

BLAT

| Alignment of cDNA sequences to a genome.

Nucleotide

DECIPHER

| Alignment of rearranged genomes using 6 frame translation

Nucleotide

FLAK

| Fuzzy whole genome alignment and analysis

Nucleotide

GMAP

| Alignment of cDNA sequences to a genome. Identifies splice site junctions with high accuracy.

Nucleotide

Splign

| Alignment of cDNA sequences to a genome. Identifies splice site junctions with high accuracy. Able to recognize and separate gene duplications.

Nucleotide

Mauve

| Multiple alignment of rearranged genomes

Nucleotide

MGA

| Multiple Genome Aligner

Nucleotide

Mulan

| Local multiple alignments of genome-length sequences

Nucleotide

Multiz

| Multiple alignment of genomes

Nucleotide

PLAST-ncRNA

| Search for ncRNAs in genomes by partition function local alignment

Nucleotide

Sequerome

| Profiling sequence alignment data with major servers/services

Nucleotide, peptide

Sequilab

| Profiling sequence alignment data from NCBI-BLAST results with major servers-services

Nucleotide, peptide

Shuffle-LAGAN

| Pairwise global alignment of completed genome regions

Nucleotide

SIBsim4, Sim4

| A program designed to align an expressed DNA sequence with a genomic sequence, allowing for introns

Nucleotide

SLAM

| Gene finding, alignment, annotation (human-mouse homology identification)

Nucleotide

SRPRISM

| An efficient aligner for assemblies with explicit guarantees, aligning reads without splices

Nucleotide

*Sequence type: protein or nucleotide

Motif finding

Name ! Description \|\| Sequence type*
class="wikitable sortable"
PMS \| Motif search and discovery	Both
FMM \| Motif search and discovery (can get also positive & negative sequences as input for enriched motif search)	Nucleotide
BLOCKS \| Ungapped motif identification from BLOCKS database	Both
eMOTIF \| Extraction and identification of shorter motifs	Both
Gibbs motif sampler \| Stochastic motif extraction by statistical likelihood	Both
HMMTOP \| Prediction of transmembrane helices and topology of proteins	Protein
I-sites \| Local structure motif library	Protein
JCoils \| Prediction of Coiled coil and Leucine Zipper	Protein
MEME/MAST \| Motif discovery and search	Both
CUDA-MEME \| GPU accelerated MEME (v4.4.0) algorithm for GPU clusters	Both
MERCI \| Discriminative motif discovery and search	Both
PHI-Blast \| Motif search and alignment tool	Both
Phyloscan \| Motif search tool	Nucleotide
PRATT \| Pattern generation for use with ScanProsite	Protein
ScanProsite \| Motif database search tool	Protein
TEIRESIAS \| Motif extraction and database search	Both
BASALT \| Multiple motif and regular expression search	Both

*Sequence type: protein or nucleotide

Benchmarking

class="wikitable sortable"
Name ! Authors
PFAM 30.0 (2016) \|
SMART (2015) \| Letunic, Copley, Schmidt, Ciccarelli, Doerks, Schultz, Ponting, Bork
BAliBASE 3 (2015) \| Thompson, Plewniak, Poch
Oxbench (2011) \| Raghava, Searle, Audley, Barber, Barton
Benchmark collection (2009) \| Edgar
HOMSTRAD (2005) \| Mizuguchi
PREFAB 4.0 (2005) \| Edgar
SABmark (2004) \| Van Walle, Lasters, Wyns

class="wikitable sortable"

Name

! Authors

PFAM 30.0 (2016)

SMART (2015)

| Letunic, Copley, Schmidt, Ciccarelli, Doerks, Schultz, Ponting, Bork

BAliBASE 3 (2015)

| Thompson, Plewniak, Poch

Oxbench (2011)

| Raghava, Searle, Audley, Barber, Barton

Benchmark collection (2009)

| Edgar

HOMSTRAD (2005)

| Mizuguchi

PREFAB 4.0 (2005)

| Edgar

SABmark (2004)

| Van Walle, Lasters, Wyns

Alignment viewers, editors

Please see List of alignment visualization software.

Short-read sequence alignment

class="wikitable sortable"
Name ! Description ! paired-end option ! Use FASTQ quality ! Gapped ! Multi-threaded ! License ! Reference ! Year
Arioc \| Computes Smith-Waterman gapped alignments and mapping qualities on one or more GPUs. Supports BS-seq alignments. Processes 100,000 to 500,000 reads per second (varies with data, hardware, and configured sensitivity). \| {{yes}} \| {{no}} \| {{yes}} \| {{yes}} \| {{free}}, BSD \|{{cite journal\|last1=Wilton\|first1=Richard\|last2=Budavari\|first2=Tamas\|last3=Langmead\|first3=Ben\|last4=Wheelan\|first4=Sarah J.\|last5=Salzberg\|first5=Steven L.\|last6=Szalay\|first6=Alexander S.\|title=Arioc: high-throughput read alignment with GPU-accelerated exploration of the seed-and-extend search space\|journal=PeerJ\|volume=3\|pages=e808\|year=2015\|doi=10.7717/peerj.808\|pmid=25780763\|pmc=4358639 \|doi-access=free }} \| 2015
BarraCUDA \| A GPGPU accelerated Burrows–Wheeler transform (FM-index) short read alignment program based on BWA, supports alignment of indels with gap openings and extensions. \| {{yes}} \| {{no}} \| {{yes}} \| {{yes}}, POSIX Threads and CUDA \| {{free}}, GPL \| \|
BBMap \| Uses a short kmers to rapidly index genome; no size or scaffold count limit. Higher sensitivity and specificity than Burrows–Wheeler aligners, with similar or greater speed. Performs affine-transform-optimized global alignment, which is slower but more accurate than Smith-Waterman. Handles Illumina, 454, PacBio, Sanger, and Ion Torrent data. Splice-aware; capable of processing long indels and RNA-seq. Pure Java; runs on any platform. Used by the Joint Genome Institute. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{free}}, BSD \| \| 2010
BFAST \| Explicit time and accuracy tradeoff with a prior accuracy estimation, supported by indexing the reference sequences. Optimally compresses indexes. Can handle billions of short reads. Can handle insertions, deletions, SNPs, and color errors (can map ABI SOLiD color space reads). Performs a full Smith Waterman alignment. \| \| \| \| {{yes}}, POSIX Threads \| {{free}}, GPL \|{{cite journal\|last1=Homer\|first1=Nils\|last2=Merriman\|first2=Barry\|last3=Nelson\|first3=Stanley F.\|title=BFAST: An Alignment Tool for Large Scale Genome Resequencing\|journal=PLOS ONE\|volume=4\|issue=11\|year=2009\|pages=e7767\|pmid=19907642\|pmc=2770639\|doi=10.1371/journal.pone.0007767\|bibcode=2009PLoSO...4.7767H\|doi-access=free}} \| 2009
BigBWA \| Runs the Burrows–Wheeler Aligner-BWA on a Hadoop cluster. It supports the algorithms BWA-MEM, BWA-ALN, and BWA-SW, working with paired and single reads. It implies an important reduction in the computational time when running in a Hadoop cluster, adding scalability and fault-tolerance. \| {{yes}} \| {{partial\|Low quality bases trimming}} \| {{yes}} \| {{yes}} \| {{free}}, GPL 3 \|{{cite journal\|last1=Abuín\|first1=J.M.\|last2=Pichel\|first2=J.C.\|last3=Pena\|first3=T.F.\|last4=Amigo\|first4=J.\|title=BigBWA: approaching the Burrows–Wheeler aligner to Big Data technologies\|journal=Bioinformatics\|year=2015\|doi=10.1093/bioinformatics/btv506\|pmid=26323715\|volume=31\|issue=24\|pages=4003–5\|doi-access=free}} \| 2015
BLASTN \| BLAST's nucleotide alignment program, slow and not accurate for short reads, and uses a sequence database (EST, Sanger sequence) rather than a reference genome. \| \| \| \| \| \| \|
BLAT \| Made by Jim Kent. Can handle one mismatch in initial alignment step. \| \| \| \| {{yes}}, client-server \| {{proprietary}}, freeware for academic and noncommercial use \|{{cite journal\|last1=Kent\|first1=W. J.\|title=BLAT---The BLAST-Like Alignment Tool\|journal=Genome Research\|volume=12\|issue=4\|year=2002\|pages=656–664\|issn=1088-9051\|doi=10.1101/gr.229202\|pmid=11932250\|pmc=187518}} \|2002
Bowtie \| Uses a Burrows–Wheeler transform to create a permanent, reusable index of the genome; 1.3 GB memory footprint for human genome. Aligns more than 25 million Illumina reads in 1 CPU hour. Supports Maq-like and SOAP-like alignment policies \| {{yes}} \| {{yes}} \| {{no}} \| {{yes}}, POSIX Threads \| {{free}}, Artistic \|{{cite journal\|last1=Langmead\|first1=Ben\|last2=Trapnell\|first2=Cole\|last3=Pop\|first3=Mihai\|last4=Salzberg\|first4=Steven L\|title=Ultrafast and memory-efficient alignment of short DNA sequences to the human genome\|journal=Genome Biology\|volume=10\|issue=3\|year=2009\|pages=R25\|issn=1465-6906\|doi=10.1186/gb-2009-10-3-r25\|pmid=19261174\|pmc=2690996 \|doi-access=free }} \|2009
BWA \| Uses a Burrows–Wheeler transform to create an index of the genome. It's a bit slower than Bowtie but allows indels in alignment. \| {{yes}} \| {{partial\|Low quality bases trimming}} \| {{yes}} \| {{yes}} \| {{free}}, GPL \|{{cite journal\|last1=Li\|first1=H.\|last2=Durbin\|first2=R.\|title=Fast and accurate short read alignment with Burrows–Wheeler transform\|journal=Bioinformatics\|volume=25\|issue=14\|year=2009\|pages=1754–1760\|issn=1367-4803\|doi=10.1093/bioinformatics/btp324\|pmid=19451168\|pmc=2705234}} \| 2009
BWA-PSSM \| A probabilistic short read aligner based on the use of position specific scoring matrices (PSSM). The aligner is adaptable in the sense that it can take into account the quality scores of the reads and models of data specific biases, such as those observed in Ancient DNA, PAR-CLIP data or genomes with biased nucleotide compositions. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{free}}, GPL \|{{cite journal\|last1=Kerpedjiev\|first1=Peter\|last2=Frellsen\|first2=Jes\|last3=Lindgreen\|first3=Stinus\|last4=Krogh\|first4=Anders\|title=Adaptable probabilistic mapping of short reads using position specific scoring matrices\|journal=BMC Bioinformatics\|volume=15\|issue=1\|year=2014\|page=100\|issn=1471-2105\|doi=10.1186/1471-2105-15-100\|pmid=24717095\|pmc=4021105 \|doi-access=free }} \| 2014
CASHX \| Quantify and manage large quantities of short-read sequence data. CASHX pipeline contains a set of tools that can be used together, or separately as modules. This algorithm is very accurate for perfect hits to a reference genome. \| \| \| \| {{no}} \| {{proprietary}}, freeware for academic and noncommercial use \| \|
Cloudburst \| Short-read mapping using Hadoop MapReduce \| \| \| \| {{yes}}, Hadoop MapReduce \| {{free}}, Artistic \| \|
CUDA-EC \| Short-read alignment error correction using GPUs. \| \| \| \| {{yes}}, GPU enabled \| \| \|
CUSHAW \| A CUDA compatible short read aligner to large genomes based on Burrows–Wheeler transform \| {{yes}} \| {{yes}} \| {{no}} \| {{yes}} (GPU enabled) \| {{free}}, GPL \|{{cite journal\|last1=Liu\|first1=Y.\|last2=Schmidt\|first2=B.\|last3=Maskell\|first3=D. L.\|title=CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows–Wheeler transform\|journal=Bioinformatics\|volume=28\|issue=14\|year=2012\|pages=1830–1837\|issn=1367-4803\|doi=10.1093/bioinformatics/bts276\|pmid=22576173\|doi-access=free}} \|2012
CUSHAW2 \| Gapped short-read and long-read alignment based on maximal exact match seeds. This aligner supports both base-space (e.g. from Illumina, 454, Ion Torrent and PacBio sequencers) and ABI SOLiD color-space read alignments. \| {{yes}} \| {{no}} \| {{yes}} \| {{yes}} \| {{free}}, GPL \| \| 2014
CUSHAW2-GPU \| GPU-accelerated CUSHAW2 short-read aligner. \| {{yes}} \| {{no}} \| {{yes}} \| {{yes}} \| {{free}}, GPL \| \|
CUSHAW3 \| Sensitive and accurate base-space and color-space short-read alignment with hybrid seeding \| {{yes}} \| {{no}} \| {{yes}} \| {{yes}} \| {{free}}, GPL \|{{cite journal\|last1=Liu\|first1=Y.\|last2=Schmidt\|first2=B.\|title=Long read alignment based on maximal exact match seeds\|journal=Bioinformatics\|volume=28\|issue=18\|year=2012\|pages=i318–i324\|issn=1367-4803\|doi=10.1093/bioinformatics/bts414\|pmid=22962447\|pmc=3436841}} \| 2012
drFAST \| Read mapping alignment software that implements cache obliviousness to minimize main/cache memory transfers like mrFAST and mrsFAST, however designed for the SOLiD sequencing platform (color space reads). It also returns all possible map locations for improved structural variation discovery. \| {{yes}} \| {{yes}}, for structural variation \| {{yes}} \| {{no}} \| {{free}}, BSD \| \|
ELAND \| Implemented by Illumina. Includes ungapped alignment with a finite read length. \| \| \| \| \| \| \|
ERNE \| Extended Randomized Numerical alignEr for accurate alignment of NGS reads. It can map bisulfite-treated reads. \| {{yes}} \| {{partial\|Low quality bases trimming}} \| {{yes}} \| Multithreading and MPI-enabled \| {{free}}, GPL 3 \| \|
GASSST \| Finds global alignments of short DNA sequences against large DNA banks \| \| \| \| Multithreading \| CeCILL version 2 License. \|{{cite journal\|last1=Rizk\|first1=Guillaume\|last2=Lavenier\|first2=Dominique\|title=GASSST: global alignment short sequence search tool\|journal=Bioinformatics\|volume=26\|issue=20\|year=2010\|pages=2534–2540\|pmid=20739310\|pmc=2951093\|doi=10.1093/bioinformatics/btq485}} \| 2011
[https://github.com/smarco/gem3-mapper GEM] \| High-quality alignment engine (exhaustive mapping with substitutions and indels). More accurate and several times faster than BWA or Bowtie 1/2. Many standalone biological applications (mapper, split mapper, mappability, and other) provided. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{free}}, GPL3 \|{{cite journal\|last1=Marco-Sola\|first1=Santiago\|last2=Sammeth\|first2=Michael\|last3=Guigó\|first3=Roderic\|last4=Ribeca\|first4=Paolo\|title=The GEM mapper: fast, accurate and versatile alignment by filtration\|journal=Nature Methods\|volume=9\|issue=12\|year=2012\|pages=1185–1188\|issn=1548-7091\|doi=10.1038/nmeth.2221\|pmid=23103880\|s2cid=2004416}} \|2012
Genalice MAP \| Ultra fast and comprehensive NGS read aligner with high precision and small storage footprint. \| {{yes}} \| {{partial\|Low quality bases trimming}} \| {{yes}} \| {{yes}} \| {{proprietary}}, commercial \| \|
Geneious Assembler \| Fast, accurate overlap assembler with the ability to handle any combination of sequencing technology, read length, any pairing orientations, with any spacer size for the pairing, with or without a reference genome. \| \| \| \| {{yes}} \| {{proprietary}}, commercial \| \|
GensearchNGS \| Complete framework with user-friendly GUI to analyse NGS data. It integrates a proprietary high quality alignment algorithm and plug-in ability to integrate various public aligner into a framework allowing to import short reads, align them, detect variants, and generate reports. It is made for resequencing projects, namely in a diagnostic setting. \| {{yes}} \| {{no}} \| {{yes}} \| {{yes}} \| {{proprietary}}, commercial \| \|
GMAP and GSNAP \| Robust, fast short-read alignment. GMAP: longer reads, with multiple indels and splices (see entry above under Genomics analysis); GSNAP: shorter reads, with one indel or up to two splices per read. Useful for digital gene expression, SNP and indel genotyping. Developed by Thomas Wu at Genentech. Used by the National Center for Genome Resources (NCGR) in Alpheus. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{proprietary}}, freeware for academic and noncommercial use \| \|
GNUMAP \| Accurately performs gapped alignment of sequence data obtained from next-generation sequencing machines (specifically of Solexa-Illumina) back to a genome of any size. Includes adaptor trimming, SNP calling and Bisulfite sequence analysis. \| \| {{yes}}, also supports Illumina _int.txt and _prb.txt files with all 4 quality scores for each base \| \| Multithreading and MPI-enabled \| \|{{cite journal\|last1=Clement\|first1=N. L.\|last2=Snell\|first2=Q.\|last3=Clement\|first3=M. J.\|last4=Hollenhorst\|first4=P. C.\|last5=Purwar\|first5=J.\|last6=Graves\|first6=B. J.\|last7=Cairns\|first7=B. R.\|last8=Johnson\|first8=W. E.\|title=The GNUMAP algorithm: unbiased probabilistic mapping of oligonucleotides from next-generation sequencing\|journal=Bioinformatics\|volume=26\|issue=1\|year=2009\|pages=38–45\|issn=1367-4803\|doi=10.1093/bioinformatics/btp614\|pmid=19861355\|pmc=6276904}} \|2009
HIVE-hexagon \| Uses a hash table and bloom matrix to create and filter potential positions on the genome. For higher efficiency uses cross-similarity between short reads and avoids realigning non unique redundant sequences. It is faster than Bowtie and BWA and allows indels and divergent sensitive alignments on viruses, bacteria, and more conservative eukaryotic alignments. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{proprietary}}, freeware for academic and noncommercial users registered to HIVE deployment instance \|{{cite journal\|last1=Santana-Quintero\|first1=Luis\|last2=Dingerdissen\|first2=Hayley\|last3=Thierry-Mieg\|first3=Jean\|last4=Mazumder\|first4=Raja\|last5=Simonyan\|first5=Vahan\|title=HIVE-Hexagon: High-Performance, Parallelized Sequence Alignment for Next-Generation Sequencing Data Analysis\|journal=PLOS ONE\|year=2014\|pages=1754–1760\|doi=10.1371/journal.pone.0099033\|pmid=24918764\|volume=9\|issue=6\|pmc=4053384\|bibcode=2014PLoSO...999033S\|doi-access=free}} \| 2014
IMOS \| Improved Meta-aligner and Minimap2 On Spark. A long read distributed aligner on Apache Spark platform with linear scalability w.r.t. single node execution. \| \| {{yes}} \| {{yes}} \| {{yes}} \| {{free}} \| \|
Isaac \| Fully uses all the computing power available on one server node; thus, it scales well over a broad range of hardware architectures, and alignment performance improves with hardware abilities \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{free}}, GPL \| \|
LAST \| Uses adaptative seeds and copes more efficiently with repeat-rich sequences (e.g. genomes). For example: it can align reads to genomes without repeat-masking, without becoming overwhelmed by repetitive hits. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{free}}, GPL \|{{cite journal\|last1=Kielbasa\|first1=S.M.\|last2=Wan\|first2=R.\|last3=Sato\|first3=K. \|last4=Horton\|first4=P.\|last5=Frith\|first5=M.C.\|title=Adaptive seeds tame genomic sequence comparison\|journal=Genome Research\|volume=21\|issue=3\|year=2011\|pages=487–493\|pmid=21209072\|pmc=3044862\|doi=10.1101/gr.113985.110}} \| 2011
MAQ \| Ungapped alignment that takes into account quality scores for each base. \| \| \| \| \| {{free}}, GPL \| \|
mrFAST, mrsFAST \| Gapped (mrFAST) and ungapped (mrsFAST) alignment software that implements cache obliviousness to minimize main/cache memory transfers. They are designed for the Illumina sequencing platform and they can return all possible map locations for improved structural variation discovery. \| {{yes}} \| {{yes}}, for structural variation \| {{yes}} \| {{no}} \| {{free}}, BSD \| \|
MOM \| MOM or maximum oligonucleotide mapping is a query matching tool that captures a maximal length match within the short read. \| \| \| \| {{yes}} \| \| \|
MOSAIK \| Fast gapped aligner and reference-guided assembler. Aligns reads using a banded Smith-Waterman algorithm seeded by results from a k-mer hashing scheme. Supports reads ranging in size from very short to very long. \| \| \| \| {{yes}} \| \| \|
MPscan \| Fast aligner based on a filtration strategy (no indexing, use q-grams and Backward Nondeterministic DAWG Matching) \| \| \| \| \| \| {{cite book\|last1=Rivals\|first1=Eric\|last2=Salmela\|first2=Leena\|last3=Kiiskinen\|first3=Petteri\|last4=Kalsi\|first4=Petri\|last5=Tarhio\|first5=Jorma\|title=Algorithms in Bioinformatics \|chapter=Mpscan: Fast Localisation of Multiple Reads in Genomes \|volume=5724\|year=2009\|pages= 246–260\|doi=10.1007/978-3-642-04241-6_21\|series=Lecture Notes in Computer Science\|bibcode=2009LNCS.5724..246R\|isbn=978-3-642-04240-9\|citeseerx=10.1.1.156.928\|s2cid=17187140 }} \| 2009
Novoalign & NovoalignCS \| Gapped alignment of single end and paired end Illumina GA I & II, ABI Colour space & ION Torrent reads. High sensitivity and specificity, using base qualities at all steps in the alignment. Includes adapter trimming, base quality calibration, Bi-Seq alignment, and options for reporting multiple alignments per read. Use of ambiguous IUPAC codes in reference for common SNPs can improve SNP recall and remove allelic bias. \| {{yes}} \| {{yes}} \| {{yes}} \| Multi-threading and MPI versions available with paid license \| {{proprietary}}, freeware single threaded version for academic and noncommercial use \| \|
NextGENe \| Developed for use by biologists performing analysis of next generation sequencing data from Roche Genome Sequencer FLX, Illumina GA/HiSeq, Life Technologies Applied BioSystems’ SOLiD System, PacBio and Ion Torrent platforms. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{proprietary}}, commercial \| \|
NextGenMap \| Flexible and fast read mapping program (twice as fast as BWA), achieves a mapping sensitivity comparable to Stampy. Internally uses a memory efficient index structure (hash table) to store positions of all 13-mers present in the reference genome. Mapping regions where pairwise alignments are required are dynamically determined for each read. Uses fast SIMD instructions (SSE) to accelerate alignment calculations on CPU. If available, alignments are computed on GPU (using OpenCL/CUDA) further reducing runtime 20-50%. \| {{yes}} \| {{no}} \| {{yes}} \| {{yes}}, POSIX Threads, OpenCL/CUDA, SSE \| {{free}} \|{{cite journal\|last1=Sedlazeck\|first1=Fritz J.\|last2=Rescheneder\|first2=Philipp\|last3=von Haeseler\|first3=Arndt\|title=NextGenMap: fast and accurate read mapping in highly polymorphic genomes\|journal=Bioinformatics\|volume=29\|issue=21\|year=2013\|pages=2790–2791\|pmid=23975764\|doi=10.1093/bioinformatics/btt468\|doi-access=free}} \| 2013
Omixon Variant Toolkit \| Includes highly sensitive and highly accurate tools for detecting SNPs and indels. It offers a solution to map NGS short reads with a moderate distance (up to 30% sequence divergence) from reference genomes. It poses no restrictions on the size of the reference, which, combined with its high sensitivity, makes the Variant Toolkit well-suited for targeted sequencing projects and diagnostics. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{proprietary}}, commercial \| \|
PALMapper \| Efficiently computes both spliced and unspliced alignments at high accuracy. Relying on a machine learning strategy combined with a fast mapping based on a banded Smith-Waterman-like algorithm, it aligns around 7 million reads per hour on one CPU. It refines the originally proposed QPALMA approach. \| \| \| \| {{yes}} \| {{free}}, GPL \| \|
Partek Flow \| For use by biologists and bioinformaticians. It supports ungapped, gapped and splice-junction alignment from single and paired-end reads from Illumina, Life technologies Solid TM, Roche 454 and Ion Torrent raw data (with or without quality information). It integrates powerful quality control on FASTQ/Qual level and on aligned data. Additional functionality include trimming and filtering of raw reads, SNP and InDel detection, mRNA and microRNA quantification and fusion gene detection. \| {{yes}} \| {{yes}} \| {{yes}} \| Multiprocessor-core, client-server installation possible \| {{proprietary}}, commercial, free trial version \| \|
PASS \| Indexes the genome, then extends seeds using pre-computed alignments of words. Works with base space, color space (SOLID), and can align genomic and spliced RNA-seq reads. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{proprietary}}, freeware for academic and noncommercial use \| \|
PerM \| Indexes the genome with periodic seeds to quickly find alignments with full sensitivity up to four mismatches. It can map Illumina and SOLiD reads. Unlike most mapping programs, speed increases for longer read lengths. \| \| \| \| {{yes}} \| {{free}}, GPL \|{{cite journal\|last1=Chen\|first1=Yangho\|last2=Souaiaia\|first2=Tade\|last3=Chen\|first3=Ting\|title=PerM: efficient mapping of short sequencing reads with periodic full sensitive spaced seeds\|journal=Bioinformatics\|volume=25\|issue=19\|year=2009\|pages=2514–2521\|pmid=19675096\|pmc=2752623\|doi=10.1093/bioinformatics/btp486}} \|
PRIMEX \| Indexes the genome with a k-mer lookup table with full sensitivity up to an adjustable number of mismatches. It is best for mapping 15-60 bp sequences to a genome. \| {{no}} \| {{no}} \| {{yes}} \| {{no}}, multiple processes per search \| \| [https://www.researchgate.net/publication/8966263_PRIMEX_rapid_identification_of_oligonucleotide_matches_in_whole_genomes] \| 2003
QPalma \| Can use quality scores, intron lengths, and computation splice site predictions to perform and performs an unbiased alignment. Can be trained to the specifics of a RNA-seq experiment and genome. Useful for splice site/intron discovery and for gene model building. (See PALMapper for a faster version). \| \| \| \| {{yes}}, client-server \| {{free}}, GPL 2 \| \|
RazerS \| No read length limit. Hamming or edit distance mapping with configurable error rates. Configurable and predictable sensitivity (runtime/sensitivity tradeoff). Supports paired-end read mapping. \| \| \| \| \| {{free}}, LGPL \| \|
REAL, cREAL \| REAL is an efficient, accurate, and sensitive tool for aligning short reads obtained from next-generation sequencing. The programme can handle an enormous amount of single-end reads generated by the next-generation Illumina/Solexa Genome Analyzer. cREAL is a simple extension of REAL for aligning short reads obtained from next-generation sequencing to a genome with circular structure. \| \| {{yes}} \| \| {{yes}} \| {{free}}, GPL \| \|
RMAP \| Can map reads with or without error probability information (quality scores) and supports paired-end reads or bisulfite-treated read mapping. There are no limitations on read length or number of mismatches. \| {{yes}} \| {{yes}} \| {{yes}} \| \| {{free}}, GPL 3 \| \|
rNA \| A randomized Numerical Aligner for Accurate alignment of NGS reads \| {{yes}} \| {{partial\|Low quality bases trimming}} \| {{yes}} \| Multithreading and MPI-enabled \| {{free}}, GPL 3 \| \|
RTG Investigator \| Extremely fast, tolerant to high indel and substitution counts. Includes full read alignment. Product includes comprehensive pipelines for variant detection and metagenomic analysis with any combination of Illumina, Complete Genomics and Roche 454 data. \| {{yes}} \| {{yes}}, for variant calling \| {{yes}} \| {{yes}} \| {{proprietary}}, freeware for individual investigator use \| \|
Segemehl \| Can handle insertions, deletions, mismatches; uses enhanced suffix arrays \| {{yes}} \| {{no}} \| {{yes}} \| {{yes}} \| {{proprietary}}, freeware for noncommercial use \|{{cite journal\|last1=Searls\|first1=David B.\|last2=Hoffmann\|first2=Steve\|last3=Otto\|first3=Christian\|last4=Kurtz\|first4=Stefan\|last5=Sharma\|first5=Cynthia M.\|last6=Khaitovich\|first6=Philipp\|last7=Vogel\|first7=Jörg\|last8=Stadler\|first8=Peter F.\|last9=Hackermüller\|first9=Jörg\|title=Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures\|journal=PLOS Computational Biology\|volume=5\|issue=9\|year=2009\|pages=e1000502\|issn=1553-7358\|doi=10.1371/journal.pcbi.1000502\|pmid=19750212\|pmc=2730575\|bibcode=2009PLSCB...5E0502H \|doi-access=free }} \|2009
SeqMap \| Up to 5 mixed substitutions and insertions-deletions; various tuning options and input-output formats \| \| \| \| \| {{proprietary}}, freeware for academic and noncommercial use \| \|
Shrec \| Short read error correction with a suffix tree data structure \| \| \| \| {{yes}}, Java \| \| \|
SHRiMP \| Indexes the reference genome as of version 2. Uses masks to generate possible keys. Can map ABI SOLiD color space reads. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}}, OpenMP \| {{free}}, [[BSD licenses \| {{free}}, BSD]] derivative \| {{cite journal\|last1=Rumble\|first1=Stephen M.\|last2=Lacroute\|first2=Phil\|last3=Dalca\|first3=Adrian V.\|last4=Fiume\|first4=Marc\|last5=Sidow\|first5=Arend\|last6=Brudno\|first6=Michael\|title=SHRiMP: Accurate Mapping of Short Color-space Reads\|journal=PLOS Computational Biology \|volume=5\|issue=5\|year=2009\|pages=e1000386\|pmid=19461883\|pmc=2678294\|doi=10.1371/journal.pcbi.1000386\|bibcode=2009PLSCB...5E0386R \|doi-access=free }} {{cite journal\|last1=David\|first1=Matei\|last2=Dzamba\|first2=Misko\|last3=Lister\|first3=Dan\|last4= Ilie\|first4=Lucian\|last5=Brudno\|first5=Michael\|title=SHRiMP2: Sensitive yet Practical Short Read Mapping\|journal=Bioinformatics\|volume=27\|issue=7\|year=2011\|pages=1011–1012\|pmid=21278192\|doi=10.1093/bioinformatics/btr046\|doi-access=free}} \| 2009-2011
SLIDER \| Slider is an application for the Illumina Sequence Analyzer output that uses the "probability" files instead of the sequence files as an input for alignment to a reference sequence or a set of reference sequences. \| {{yes}} \| {{yes}} \| {{No}} \| {{No}} \| \|{{cite journal\|last1=Malhis\|first1=Nawar\|last2=Butterfield\|first2=Yaron S. N.\|last3=Ester\|first3=Martin\|last4=Jones\|first4=Steven J. M.\|title=Slider – Maximum use of probability information for alignment of short sequence reads and SNP detection\|journal=Bioinformatics \|volume=25\|issue=1\|year=2009\|pages=6–13\|pmid=18974170\|pmc=2638935\|doi=10.1093/bioinformatics/btn565}}{{cite journal\|last1=Malhis\|first1=Nawar\|last2=Jones\|first2=Steven J. M.\|title=High Quality SNP Calling Using Illumina Data at Shallow Coverage\|journal=Bioinformatics \|volume=26\|issue=8\|year=2010\|pages=1029–1035\|pmid=20190250\|doi=10.1093/bioinformatics/btq092\|doi-access=}} \| 2009-2010
SOAP, SOAP2, SOAP3, SOAP3-dp \| SOAP: robust with a small (1-3) number of gaps and mismatches. Speed improvement over BLAT, uses a 12 letter hash table. SOAP2: using bidirectional BWT to build the index of reference, and it is much faster than the first version. SOAP3: GPU-accelerated version that could find all 4-mismatch alignments in tens of seconds per one million reads. SOAP3-dp, also GPU accelerated, supports arbitrary number of mismatches and gaps according to affine gap penalty scores. \| {{yes}} \| {{no}} \| {{yes}}, SOAP3-dp \| {{yes}}, POSIX Threads; SOAP3, SOAP3-dp need GPU with CUDA support \| {{free}}, GPL \|{{cite journal\|last1=Li\|first1=R.\|last2=Li\|first2=Y.\|last3=Kristiansen\|first3=K.\|last4=Wang\|first4=J.\|title=SOAP: short oligonucleotide alignment program\|journal=Bioinformatics\|volume=24\|issue=5\|year=2008\|pages=713–714\|issn=1367-4803\|doi=10.1093/bioinformatics/btn025\|pmid=18227114\|doi-access=free}}{{cite journal\|last1=Li\|first1=R.\|last2=Yu\|first2=C.\|last3=Li\|first3=Y.\|last4=Lam\|first4=T.-W.\|last5=Yiu\|first5=S.-M.\|last6=Kristiansen\|first6=K.\|last7=Wang\|first7=J.\|title=SOAP2: an improved ultrafast tool for short read alignment\|journal=Bioinformatics\|volume=25\|issue=15\|year=2009\|pages=1966–1967\|issn=1367-4803\|doi=10.1093/bioinformatics/btp336\|pmid=19497933\|doi-access=}} \|
SOCS \| For ABI SOLiD technologies. Significant increase in time to map reads with mismatches (or color errors). Uses an iterative version of the Rabin-Karp string search algorithm. \| \| \| \| {{yes}} \| {{free}}, GPL \| \|
SparkBWA \| Integrates the Burrows–Wheeler Aligner (BWA) on an Apache Spark framework running atop Hadoop. Version 0.2 of October 2016, supports the algorithms BWA-MEM, BWA-backtrack, and BWA-ALN. All of them work with single-reads and paired-end reads. \| {{yes}} \| {{partial\|Low quality bases trimming}} \| {{yes}} \| {{yes}} \| {{free}}, GPL 3 \|{{Cite journal\|last1=Abuín\|first1=José M.\|last2=Pichel\|first2=Juan C.\|last3=Pena\|first3=Tomás F.\|last4=Amigo\|first4=Jorge\|date=2016-05-16\|title=SparkBWA: Speeding Up the Alignment of High-Throughput DNA Sequencing Data\|journal=PLOS ONE\|volume=11\|issue=5\|pages=e0155461\|doi=10.1371/journal.pone.0155461\|issn=1932-6203\|pmid=27182962\|pmc=4868289\|bibcode=2016PLoSO..1155461A\|doi-access=free}} \|2016
SSAHA, SSAHA2 \| Fast for a small number of variants \| \| \| \| \| {{proprietary}}, freeware for academic and noncommercial use \| \|
Stampy \| For Illumina reads. High specificity, and sensitive for reads with indels, structural variants, or many SNPs. Slow, but speed increased dramatically by using BWA for first alignment pass. \| {{yes}} \| {{yes}} \| {{yes}} \| {{no}} \| {{proprietary}}, freeware for academic and noncommercial use \|{{cite journal\|last1=Lunter\|first1=G.\|last2=Goodson\|first2=M.\|title=Stampy: A statistical algorithm for sensitive and fast mapping of Illumina sequence reads\|journal=Genome Research\|volume=21\|issue=6\|year=2010\|pages=936–939\|issn=1088-9051\|doi=10.1101/gr.111120.110\|pmid=20980556\|pmc=3106326}} \| 2010
SToRM \| For Illumina or ABI SOLiD reads, with SAM native output. Highly sensitive for reads with many errors, indels (full from 0 to 15, extended support otherwise). Uses spaced seeds (single hit) and a very fast SSE-SSE2-AVX2-AVX-512 banded alignment filter. For fixed-length reads only, authors recommend SHRiMP2 otherwise. \| {{no}} \| {{yes}} \| {{yes}} \| {{yes}}, OpenMP \| {{free}} \|{{cite journal\|last1=Noe\|first1=L.\|last2=Girdea\|first2=M.\|last3=Kucherov\|first3=G.\|title=Designing efficient spaced seeds for SOLiD read mapping\|journal=Advances in Bioinformatics\|volume=2010\|year=2010\|page=708501\|pmid=20936175\|pmc=2945724\|doi=10.1155/2010/708501\|doi-access=free}} \|2010
Subread, Subjunc \| Superfast and accurate read aligners. Subread can be used to map both gDNA-seq and RNA-seq reads. Subjunc detects exon-exon junctions and maps RNA-seq reads. They employ a novel mapping paradigm named seed-and-vote. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{free}}, GPL 3 \| \|
Taipan \| De-novo assembler for Illumina reads \| \| \| \| \| {{proprietary}}, freeware for academic and noncommercial use \| \|
UGENE \| Visual interface both for Bowtie and BWA, and an embedded aligner \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{free}}, GPL \| \|
VelociMapper \| FPGA-accelerated reference sequence alignment mapping tool from TimeLogic. Faster than Burrows–Wheeler transform-based algorithms like BWA and Bowtie. Supports up to 7 mismatches and/or indels with no performance penalty. Produces sensitive Smith–Waterman gapped alignments. \| {{yes}} \| {{yes}} \| {{yes}} \| {{yes}} \| {{proprietary}}, commercial \| \|
XpressAlign \| FPGA based sliding window short read aligner which exploits the embarrassingly parallel property of short read alignment. Performance scales linearly with number of transistors on a chip (i.e. performance guaranteed to double with each iteration of Moore's Law without modification to algorithm). Low power consumption is useful for datacentre equipment. Predictable runtime. Better price/performance than software sliding window aligners on current hardware, but not better than software BWT-based aligners currently. Can manage large numbers (>2) of mismatches. Will find all hit positions for all seeds. Single-FPGA experimental version, needs work to develop it into a multi-FPGA production version. \| \| \| \| \| {{proprietary}}, freeware for academic and noncommercial use \| \|
ZOOM \| 100% sensitivity for a reads between 15 and 240 bp with practical mismatches. Very fast. Support insertions and deletions. Works with Illumina & SOLiD instruments, not 454. \| \| \| \| {{partial\|Yes (GUI), no (CLI)}} \| {{proprietary}}, commercial \| {{cite journal\|last1=Lin\|first1=H.\|last2=Zhang\|first2=Z.\|last3=Zhang\|first3=M.Q.\|last4=Ma\|first4=B.\|last5=Li\|first5=M.\|title=ZOOM! Zillions of oligos mapped\|journal=Bioinformatics\|volume=24\|issue=21\|year=2008\|pages=2431–2437\|pmid=18684737\|pmc=2732274\|doi=10.1093/bioinformatics/btn416}} \|

class="wikitable sortable"

Name

! Description

! paired-end option

! Use FASTQ quality

! Gapped

! Multi-threaded

! License

! Reference

! Year

Arioc

| Computes Smith-Waterman gapped alignments and mapping qualities on one or more GPUs. Supports BS-seq alignments. Processes 100,000 to 500,000 reads per second (varies with data, hardware, and configured sensitivity).

| {{yes}}

| {{no}}

| {{yes}}

| {{free}}, BSD

|{{cite journal|last1=Wilton|first1=Richard|last2=Budavari|first2=Tamas|last3=Langmead|first3=Ben|last4=Wheelan|first4=Sarah J.|last5=Salzberg|first5=Steven L.|last6=Szalay|first6=Alexander S.|title=Arioc: high-throughput read alignment with GPU-accelerated exploration of the seed-and-extend search space|journal=PeerJ|volume=3|pages=e808|year=2015|doi=10.7717/peerj.808|pmid=25780763|pmc=4358639 |doi-access=free }}

| 2015

BarraCUDA

| A GPGPU accelerated Burrows–Wheeler transform (FM-index) short read alignment program based on BWA, supports alignment of indels with gap openings and extensions.

| {{yes}}

| {{no}}

| {{yes}}

| {{yes}}, POSIX Threads and CUDA

| {{free}}, GPL

BBMap

| Uses a short kmers to rapidly index genome; no size or scaffold count limit. Higher sensitivity and specificity than Burrows–Wheeler aligners, with similar or greater speed. Performs affine-transform-optimized global alignment, which is slower but more accurate than Smith-Waterman. Handles Illumina, 454, PacBio, Sanger, and Ion Torrent data. Splice-aware; capable of processing long indels and RNA-seq. Pure Java; runs on any platform. Used by the Joint Genome Institute.

| {{yes}}

| {{free}}, BSD

| 2010

BFAST

| Explicit time and accuracy tradeoff with a prior accuracy estimation, supported by indexing the reference sequences. Optimally compresses indexes. Can handle billions of short reads. Can handle insertions, deletions, SNPs, and color errors (can map ABI SOLiD color space reads). Performs a full Smith Waterman alignment.

| {{yes}}, POSIX Threads

| {{free}}, GPL

|{{cite journal|last1=Homer|first1=Nils|last2=Merriman|first2=Barry|last3=Nelson|first3=Stanley F.|title=BFAST: An Alignment Tool for Large Scale Genome Resequencing|journal=PLOS ONE|volume=4|issue=11|year=2009|pages=e7767|pmid=19907642|pmc=2770639|doi=10.1371/journal.pone.0007767|bibcode=2009PLoSO...4.7767H|doi-access=free}}

| 2009

BigBWA

| Runs the Burrows–Wheeler Aligner-BWA on a Hadoop cluster. It supports the algorithms BWA-MEM, BWA-ALN, and BWA-SW, working with paired and single reads. It implies an important reduction in the computational time when running in a Hadoop cluster, adding scalability and fault-tolerance.

| {{yes}}

| {{partial|Low quality bases trimming}}

| {{yes}}

| {{free}}, GPL 3

|{{cite journal|last1=Abuín|first1=J.M.|last2=Pichel|first2=J.C.|last3=Pena|first3=T.F.|last4=Amigo|first4=J.|title=BigBWA: approaching the Burrows–Wheeler aligner to Big Data technologies|journal=Bioinformatics|year=2015|doi=10.1093/bioinformatics/btv506|pmid=26323715|volume=31|issue=24|pages=4003–5|doi-access=free}}

| 2015

BLASTN

| BLAST's nucleotide alignment program, slow and not accurate for short reads, and uses a sequence database (EST, Sanger sequence) rather than a reference genome.

BLAT

| Made by Jim Kent. Can handle one mismatch in initial alignment step.

| {{yes}}, client-server

| {{proprietary}}, freeware for academic and noncommercial use

|{{cite journal|last1=Kent|first1=W. J.|title=BLAT---The BLAST-Like Alignment Tool|journal=Genome Research|volume=12|issue=4|year=2002|pages=656–664|issn=1088-9051|doi=10.1101/gr.229202|pmid=11932250|pmc=187518}}

|2002

Bowtie

| Uses a Burrows–Wheeler transform to create a permanent, reusable index of the genome; 1.3 GB memory footprint for human genome. Aligns more than 25 million Illumina reads in 1 CPU hour. Supports Maq-like and SOAP-like alignment policies

| {{yes}}

| {{no}}

| {{yes}}, POSIX Threads

| {{free}}, Artistic

|{{cite journal|last1=Langmead|first1=Ben|last2=Trapnell|first2=Cole|last3=Pop|first3=Mihai|last4=Salzberg|first4=Steven L|title=Ultrafast and memory-efficient alignment of short DNA sequences to the human genome|journal=Genome Biology|volume=10|issue=3|year=2009|pages=R25|issn=1465-6906|doi=10.1186/gb-2009-10-3-r25|pmid=19261174|pmc=2690996 |doi-access=free }}

|2009

BWA

| Uses a Burrows–Wheeler transform to create an index of the genome. It's a bit slower than Bowtie but allows indels in alignment.

| {{yes}}

| {{partial|Low quality bases trimming}}

| {{yes}}

| {{free}}, GPL

|{{cite journal|last1=Li|first1=H.|last2=Durbin|first2=R.|title=Fast and accurate short read alignment with Burrows–Wheeler transform|journal=Bioinformatics|volume=25|issue=14|year=2009|pages=1754–1760|issn=1367-4803|doi=10.1093/bioinformatics/btp324|pmid=19451168|pmc=2705234}}

| 2009

BWA-PSSM

| A probabilistic short read aligner based on the use of position specific scoring matrices (PSSM). The aligner is adaptable in the sense that it can take into account the quality scores of the reads and models of data specific biases, such as those observed in Ancient DNA, PAR-CLIP data or genomes with biased nucleotide compositions.

| {{yes}}

| {{free}}, GPL

|{{cite journal|last1=Kerpedjiev|first1=Peter|last2=Frellsen|first2=Jes|last3=Lindgreen|first3=Stinus|last4=Krogh|first4=Anders|title=Adaptable probabilistic mapping of short reads using position specific scoring matrices|journal=BMC Bioinformatics|volume=15|issue=1|year=2014|page=100|issn=1471-2105|doi=10.1186/1471-2105-15-100|pmid=24717095|pmc=4021105 |doi-access=free }}

| 2014

CASHX

| Quantify and manage large quantities of short-read sequence data. CASHX pipeline contains a set of tools that can be used together, or separately as modules. This algorithm is very accurate for perfect hits to a reference genome.

| {{no}}

| {{proprietary}}, freeware for academic and noncommercial use

Cloudburst

| Short-read mapping using Hadoop MapReduce

| {{yes}}, Hadoop MapReduce

| {{free}}, Artistic

CUDA-EC

| Short-read alignment error correction using GPUs.

| {{yes}}, GPU enabled

CUSHAW

| A CUDA compatible short read aligner to large genomes based on Burrows–Wheeler transform

| {{yes}}

| {{no}}

| {{yes}} (GPU enabled)

| {{free}}, GPL

|{{cite journal|last1=Liu|first1=Y.|last2=Schmidt|first2=B.|last3=Maskell|first3=D. L.|title=CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows–Wheeler transform|journal=Bioinformatics|volume=28|issue=14|year=2012|pages=1830–1837|issn=1367-4803|doi=10.1093/bioinformatics/bts276|pmid=22576173|doi-access=free}}

|2012

CUSHAW2

| Gapped short-read and long-read alignment based on maximal exact match seeds. This aligner supports both base-space (e.g. from Illumina, 454, Ion Torrent and PacBio sequencers) and ABI SOLiD color-space read alignments.

| {{yes}}

| {{no}}

| {{yes}}

| {{free}}, GPL

| 2014

CUSHAW2-GPU

| GPU-accelerated CUSHAW2 short-read aligner.

| {{yes}}

| {{no}}

| {{yes}}

| {{free}}, GPL

CUSHAW3

| Sensitive and accurate base-space and color-space short-read alignment with hybrid seeding

| {{yes}}

| {{no}}

| {{yes}}

| {{free}}, GPL

|{{cite journal|last1=Liu|first1=Y.|last2=Schmidt|first2=B.|title=Long read alignment based on maximal exact match seeds|journal=Bioinformatics|volume=28|issue=18|year=2012|pages=i318–i324|issn=1367-4803|doi=10.1093/bioinformatics/bts414|pmid=22962447|pmc=3436841}}

| 2012

drFAST

| Read mapping alignment software that implements cache obliviousness to minimize main/cache memory transfers like mrFAST and mrsFAST, however designed for the SOLiD sequencing platform (color space reads). It also returns all possible map locations for improved structural variation discovery.

| {{yes}}

| {{yes}}, for structural variation

| {{yes}}

| {{no}}

| {{free}}, BSD

ELAND

| Implemented by Illumina. Includes ungapped alignment with a finite read length.

ERNE

| Extended Randomized Numerical alignEr for accurate alignment of NGS reads. It can map bisulfite-treated reads.

| {{yes}}

| {{partial|Low quality bases trimming}}

| {{yes}}

| Multithreading and MPI-enabled

| {{free}}, GPL 3

GASSST

| Finds global alignments of short DNA sequences against large DNA banks

| Multithreading

| CeCILL version 2 License.

|{{cite journal|last1=Rizk|first1=Guillaume|last2=Lavenier|first2=Dominique|title=GASSST: global alignment short sequence search tool|journal=Bioinformatics|volume=26|issue=20|year=2010|pages=2534–2540|pmid=20739310|pmc=2951093|doi=10.1093/bioinformatics/btq485}}

| 2011

[https://github.com/smarco/gem3-mapper GEM]

| High-quality alignment engine (exhaustive mapping with substitutions and indels). More accurate and several times faster than BWA or Bowtie 1/2. Many standalone biological applications (mapper, split mapper, mappability, and other) provided.

| {{yes}}

| {{free}}, GPL3

|{{cite journal|last1=Marco-Sola|first1=Santiago|last2=Sammeth|first2=Michael|last3=Guigó|first3=Roderic|last4=Ribeca|first4=Paolo|title=The GEM mapper: fast, accurate and versatile alignment by filtration|journal=Nature Methods|volume=9|issue=12|year=2012|pages=1185–1188|issn=1548-7091|doi=10.1038/nmeth.2221|pmid=23103880|s2cid=2004416}}

|2012

Genalice MAP

| Ultra fast and comprehensive NGS read aligner with high precision and small storage footprint.

| {{yes}}

| {{partial|Low quality bases trimming}}

| {{yes}}

| {{proprietary}}, commercial

Geneious Assembler

| Fast, accurate overlap assembler with the ability to handle any combination of sequencing technology, read length, any pairing orientations, with any spacer size for the pairing, with or without a reference genome.

| {{yes}}

| {{proprietary}}, commercial

GensearchNGS

| Complete framework with user-friendly GUI to analyse NGS data. It integrates a proprietary high quality alignment algorithm and plug-in ability to integrate various public aligner into a framework allowing to import short reads, align them, detect variants, and generate reports. It is made for resequencing projects, namely in a diagnostic setting.

| {{yes}}

| {{no}}

| {{yes}}

| {{proprietary}}, commercial

GMAP and GSNAP

| Robust, fast short-read alignment. GMAP: longer reads, with multiple indels and splices (see entry above under Genomics analysis); GSNAP: shorter reads, with one indel or up to two splices per read. Useful for digital gene expression, SNP and indel genotyping. Developed by Thomas Wu at Genentech. Used by the National Center for Genome Resources (NCGR) in Alpheus.

| {{yes}}

| {{proprietary}}, freeware for academic and noncommercial use

GNUMAP

| Accurately performs gapped alignment of sequence data obtained from next-generation sequencing machines (specifically of Solexa-Illumina) back to a genome of any size. Includes adaptor trimming, SNP calling and Bisulfite sequence analysis.

| {{yes}}, also supports Illumina *_int.txt and *_prb.txt files with all 4 quality scores for each base

| Multithreading and MPI-enabled

|{{cite journal|last1=Clement|first1=N. L.|last2=Snell|first2=Q.|last3=Clement|first3=M. J.|last4=Hollenhorst|first4=P. C.|last5=Purwar|first5=J.|last6=Graves|first6=B. J.|last7=Cairns|first7=B. R.|last8=Johnson|first8=W. E.|title=The GNUMAP algorithm: unbiased probabilistic mapping of oligonucleotides from next-generation sequencing|journal=Bioinformatics|volume=26|issue=1|year=2009|pages=38–45|issn=1367-4803|doi=10.1093/bioinformatics/btp614|pmid=19861355|pmc=6276904}}

|2009

HIVE-hexagon

| Uses a hash table and bloom matrix to create and filter potential positions on the genome. For higher efficiency uses cross-similarity between short reads and avoids realigning non unique redundant sequences. It is faster than Bowtie and BWA and allows indels and divergent sensitive alignments on viruses, bacteria, and more conservative eukaryotic alignments.

| {{yes}}

| {{proprietary}}, freeware for academic and noncommercial users registered to HIVE deployment instance

|{{cite journal|last1=Santana-Quintero|first1=Luis|last2=Dingerdissen|first2=Hayley|last3=Thierry-Mieg|first3=Jean|last4=Mazumder|first4=Raja|last5=Simonyan|first5=Vahan|title=HIVE-Hexagon: High-Performance, Parallelized Sequence Alignment for Next-Generation Sequencing Data Analysis|journal=PLOS ONE|year=2014|pages=1754–1760|doi=10.1371/journal.pone.0099033|pmid=24918764|volume=9|issue=6|pmc=4053384|bibcode=2014PLoSO...999033S|doi-access=free}}

| 2014

IMOS

| Improved Meta-aligner and Minimap2 On Spark. A long read distributed aligner on Apache Spark platform with linear scalability w.r.t. single node execution.

| {{yes}}

| {{free}}

Isaac

| Fully uses all the computing power available on one server node; thus, it scales well over a broad range of hardware architectures, and alignment performance improves with hardware abilities

| {{yes}}

| {{free}}, GPL

LAST

| Uses adaptative seeds and copes more efficiently with repeat-rich sequences (e.g. genomes). For example: it can align reads to genomes without repeat-masking, without becoming overwhelmed by repetitive hits.

| {{yes}}

| {{free}}, GPL

|{{cite journal|last1=Kielbasa|first1=S.M.|last2=Wan|first2=R.|last3=Sato|first3=K. |last4=Horton|first4=P.|last5=Frith|first5=M.C.|title=Adaptive seeds tame genomic sequence comparison|journal=Genome Research|volume=21|issue=3|year=2011|pages=487–493|pmid=21209072|pmc=3044862|doi=10.1101/gr.113985.110}}

| 2011

MAQ

| Ungapped alignment that takes into account quality scores for each base.

| {{free}}, GPL

mrFAST, mrsFAST

| Gapped (mrFAST) and ungapped (mrsFAST) alignment software that implements cache obliviousness to minimize main/cache memory transfers. They are designed for the Illumina sequencing platform and they can return all possible map locations for improved structural variation discovery.

| {{yes}}

| {{yes}}, for structural variation

| {{yes}}

| {{no}}

| {{free}}, BSD

MOM

| MOM or maximum oligonucleotide mapping is a query matching tool that captures a maximal length match within the short read.

| {{yes}}

MOSAIK

| Fast gapped aligner and reference-guided assembler. Aligns reads using a banded Smith-Waterman algorithm seeded by results from a k-mer hashing scheme. Supports reads ranging in size from very short to very long.

| {{yes}}

MPscan

| Fast aligner based on a filtration strategy (no indexing, use q-grams and Backward Nondeterministic DAWG Matching)

| {{cite book|last1=Rivals|first1=Eric|last2=Salmela|first2=Leena|last3=Kiiskinen|first3=Petteri|last4=Kalsi|first4=Petri|last5=Tarhio|first5=Jorma|title=Algorithms in Bioinformatics |chapter=Mpscan: Fast Localisation of Multiple Reads in Genomes |volume=5724|year=2009|pages= 246–260|doi=10.1007/978-3-642-04241-6_21|series=Lecture Notes in Computer Science|bibcode=2009LNCS.5724..246R|isbn=978-3-642-04240-9|citeseerx=10.1.1.156.928|s2cid=17187140 }}

| 2009

Novoalign & NovoalignCS

| Gapped alignment of single end and paired end Illumina GA I & II, ABI Colour space & ION Torrent reads. High sensitivity and specificity, using base qualities at all steps in the alignment. Includes adapter trimming, base quality calibration, Bi-Seq alignment, and options for reporting multiple alignments per read. Use of ambiguous IUPAC codes in reference for common SNPs can improve SNP recall and remove allelic bias.

| {{yes}}

| Multi-threading and MPI versions available with paid license

| {{proprietary}}, freeware single threaded version for academic and noncommercial use

NextGENe

| Developed for use by biologists performing analysis of next generation sequencing data from Roche Genome Sequencer FLX, Illumina GA/HiSeq, Life Technologies Applied BioSystems’ SOLiD System, PacBio and Ion Torrent platforms.

| {{yes}}

| {{proprietary}}, commercial

NextGenMap

| Flexible and fast read mapping program (twice as fast as BWA), achieves a mapping sensitivity comparable to Stampy. Internally uses a memory efficient index structure (hash table) to store positions of all 13-mers present in the reference genome. Mapping regions where pairwise alignments are required are dynamically determined for each read. Uses fast SIMD instructions (SSE) to accelerate alignment calculations on CPU. If available, alignments are computed on GPU (using OpenCL/CUDA) further reducing runtime 20-50%.

| {{yes}}

| {{no}}

| {{yes}}

| {{yes}}, POSIX Threads, OpenCL/CUDA, SSE

| {{free}}

|{{cite journal|last1=Sedlazeck|first1=Fritz J.|last2=Rescheneder|first2=Philipp|last3=von Haeseler|first3=Arndt|title=NextGenMap: fast and accurate read mapping in highly polymorphic genomes|journal=Bioinformatics|volume=29|issue=21|year=2013|pages=2790–2791|pmid=23975764|doi=10.1093/bioinformatics/btt468|doi-access=free}}

| 2013

Omixon Variant Toolkit

| Includes highly sensitive and highly accurate tools for detecting SNPs and indels. It offers a solution to map NGS short reads with a moderate distance (up to 30% sequence divergence) from reference genomes. It poses no restrictions on the size of the reference, which, combined with its high sensitivity, makes the Variant Toolkit well-suited for targeted sequencing projects and diagnostics.

| {{yes}}

| {{proprietary}}, commercial

PALMapper

| Efficiently computes both spliced and unspliced alignments at high accuracy. Relying on a machine learning strategy combined with a fast mapping based on a banded Smith-Waterman-like algorithm, it aligns around 7 million reads per hour on one CPU. It refines the originally proposed QPALMA approach.

| {{yes}}

| {{free}}, GPL

Partek Flow

| For use by biologists and bioinformaticians. It supports ungapped, gapped and splice-junction alignment from single and paired-end reads from Illumina, Life technologies Solid TM, Roche 454 and Ion Torrent raw data (with or without quality information). It integrates powerful quality control on FASTQ/Qual level and on aligned data. Additional functionality include trimming and filtering of raw reads, SNP and InDel detection, mRNA and microRNA quantification and fusion gene detection.

| {{yes}}

| Multiprocessor-core, client-server installation possible

| {{proprietary}}, commercial, free trial version

PASS

| Indexes the genome, then extends seeds using pre-computed alignments of words. Works with base space, color space (SOLID), and can align genomic and spliced RNA-seq reads.

| {{yes}}

| {{proprietary}}, freeware for academic and noncommercial use

PerM

| Indexes the genome with periodic seeds to quickly find alignments with full sensitivity up to four mismatches. It can map Illumina and SOLiD reads. Unlike most mapping programs, speed increases for longer read lengths.

| {{yes}}

| {{free}}, GPL

|{{cite journal|last1=Chen|first1=Yangho|last2=Souaiaia|first2=Tade|last3=Chen|first3=Ting|title=PerM: efficient mapping of short sequencing reads with periodic full sensitive spaced seeds|journal=Bioinformatics|volume=25|issue=19|year=2009|pages=2514–2521|pmid=19675096|pmc=2752623|doi=10.1093/bioinformatics/btp486}}

PRIMEX

| Indexes the genome with a k-mer lookup table with full sensitivity up to an adjustable number of mismatches. It is best for mapping 15-60 bp sequences to a genome.

| {{no}}

| {{yes}}

| {{no}}, multiple processes per search

| [https://www.researchgate.net/publication/8966263_PRIMEX_rapid_identification_of_oligonucleotide_matches_in_whole_genomes]

| 2003

QPalma

| Can use quality scores, intron lengths, and computation splice site predictions to perform and performs an unbiased alignment. Can be trained to the specifics of a RNA-seq experiment and genome. Useful for splice site/intron discovery and for gene model building. (See PALMapper for a faster version).

| {{yes}}, client-server

| {{free}}, GPL 2

RazerS

| No read length limit. Hamming or edit distance mapping with configurable error rates. Configurable and predictable sensitivity (runtime/sensitivity tradeoff). Supports paired-end read mapping.

| {{free}}, LGPL

REAL, cREAL

| REAL is an efficient, accurate, and sensitive tool for aligning short reads obtained from next-generation sequencing. The programme can handle an enormous amount of single-end reads generated by the next-generation Illumina/Solexa Genome Analyzer. cREAL is a simple extension of REAL for aligning short reads obtained from next-generation sequencing to a genome with circular structure.

| {{yes}}

| {{free}}, GPL

RMAP

| Can map reads with or without error probability information (quality scores) and supports paired-end reads or bisulfite-treated read mapping. There are no limitations on read length or number of mismatches.

| {{yes}}

| {{free}}, GPL 3

rNA

| A randomized Numerical Aligner for Accurate alignment of NGS reads

| {{yes}}

| {{partial|Low quality bases trimming}}

| {{yes}}

| Multithreading and MPI-enabled

| {{free}}, GPL 3

RTG Investigator

| Extremely fast, tolerant to high indel and substitution counts. Includes full read alignment. Product includes comprehensive pipelines for variant detection and metagenomic analysis with any combination of Illumina, Complete Genomics and Roche 454 data.

| {{yes}}

| {{yes}}, for variant calling

| {{yes}}

| {{proprietary}}, freeware for individual investigator use

Segemehl

| Can handle insertions, deletions, mismatches; uses enhanced suffix arrays

| {{yes}}

| {{no}}

| {{yes}}

| {{proprietary}}, freeware for noncommercial use

|{{cite journal|last1=Searls|first1=David B.|last2=Hoffmann|first2=Steve|last3=Otto|first3=Christian|last4=Kurtz|first4=Stefan|last5=Sharma|first5=Cynthia M.|last6=Khaitovich|first6=Philipp|last7=Vogel|first7=Jörg|last8=Stadler|first8=Peter F.|last9=Hackermüller|first9=Jörg|title=Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures|journal=PLOS Computational Biology|volume=5|issue=9|year=2009|pages=e1000502|issn=1553-7358|doi=10.1371/journal.pcbi.1000502|pmid=19750212|pmc=2730575|bibcode=2009PLSCB...5E0502H |doi-access=free }}

|2009

SeqMap

| Up to 5 mixed substitutions and insertions-deletions; various tuning options and input-output formats

| {{proprietary}}, freeware for academic and noncommercial use

Shrec

| Short read error correction with a suffix tree data structure

| {{yes}}, Java

SHRiMP

| Indexes the reference genome as of version 2. Uses masks to generate possible keys. Can map ABI SOLiD color space reads.

| {{yes}}

| {{yes}}, OpenMP

| {{free}}, [[BSD licenses

| {{free}}, BSD]] derivative

{{cite journal|last1=Rumble|first1=Stephen M.|last2=Lacroute|first2=Phil|last3=Dalca|first3=Adrian V.|last4=Fiume|first4=Marc|last5=Sidow|first5=Arend|last6=Brudno|first6=Michael|title=SHRiMP: Accurate Mapping of Short Color-space Reads|journal=PLOS Computational Biology

|volume=5|issue=5|year=2009|pages=e1000386|pmid=19461883|pmc=2678294|doi=10.1371/journal.pcbi.1000386|bibcode=2009PLSCB...5E0386R |doi-access=free }}

{{cite journal|last1=David|first1=Matei|last2=Dzamba|first2=Misko|last3=Lister|first3=Dan|last4= Ilie|first4=Lucian|last5=Brudno|first5=Michael|title=SHRiMP2: Sensitive yet Practical Short Read Mapping|journal=Bioinformatics|volume=27|issue=7|year=2011|pages=1011–1012|pmid=21278192|doi=10.1093/bioinformatics/btr046|doi-access=free}}

| 2009-2011

SLIDER

| Slider is an application for the Illumina Sequence Analyzer output that uses the "probability" files instead of the sequence files as an input for alignment to a reference sequence or a set of reference sequences.

| {{yes}}

| {{No}}

|{{cite journal|last1=Malhis|first1=Nawar|last2=Butterfield|first2=Yaron S. N.|last3=Ester|first3=Martin|last4=Jones|first4=Steven J. M.|title=Slider – Maximum use of probability information for alignment of short sequence reads and SNP detection|journal=Bioinformatics

|volume=25|issue=1|year=2009|pages=6–13|pmid=18974170|pmc=2638935|doi=10.1093/bioinformatics/btn565}}{{cite journal|last1=Malhis|first1=Nawar|last2=Jones|first2=Steven J. M.|title=High Quality SNP Calling Using Illumina Data at Shallow Coverage|journal=Bioinformatics

|volume=26|issue=8|year=2010|pages=1029–1035|pmid=20190250|doi=10.1093/bioinformatics/btq092|doi-access=}}

| 2009-2010

SOAP, SOAP2, SOAP3, SOAP3-dp

| SOAP: robust with a small (1-3) number of gaps and mismatches. Speed improvement over BLAT, uses a 12 letter hash table. SOAP2: using bidirectional BWT to build the index of reference, and it is much faster than the first version. SOAP3: GPU-accelerated version that could find all 4-mismatch alignments in tens of seconds per one million reads. SOAP3-dp, also GPU accelerated, supports arbitrary number of mismatches and gaps according to affine gap penalty scores.

| {{yes}}

| {{no}}

| {{yes}}, SOAP3-dp

| {{yes}}, POSIX Threads; SOAP3, SOAP3-dp need GPU with CUDA support

| {{free}}, GPL

|{{cite journal|last1=Li|first1=R.|last2=Li|first2=Y.|last3=Kristiansen|first3=K.|last4=Wang|first4=J.|title=SOAP: short oligonucleotide alignment program|journal=Bioinformatics|volume=24|issue=5|year=2008|pages=713–714|issn=1367-4803|doi=10.1093/bioinformatics/btn025|pmid=18227114|doi-access=free}}{{cite journal|last1=Li|first1=R.|last2=Yu|first2=C.|last3=Li|first3=Y.|last4=Lam|first4=T.-W.|last5=Yiu|first5=S.-M.|last6=Kristiansen|first6=K.|last7=Wang|first7=J.|title=SOAP2: an improved ultrafast tool for short read alignment|journal=Bioinformatics|volume=25|issue=15|year=2009|pages=1966–1967|issn=1367-4803|doi=10.1093/bioinformatics/btp336|pmid=19497933|doi-access=}}

SOCS

| For ABI SOLiD technologies. Significant increase in time to map reads with mismatches (or color errors). Uses an iterative version of the Rabin-Karp string search algorithm.

| {{yes}}

| {{free}}, GPL

SparkBWA

| Integrates the Burrows–Wheeler Aligner (BWA) on an Apache Spark framework running atop Hadoop. Version 0.2 of October 2016, supports the algorithms BWA-MEM, BWA-backtrack, and BWA-ALN. All of them work with single-reads and paired-end reads.

| {{yes}}

| {{partial|Low quality bases trimming}}

| {{yes}}

| {{free}}, GPL 3

|{{Cite journal|last1=Abuín|first1=José M.|last2=Pichel|first2=Juan C.|last3=Pena|first3=Tomás F.|last4=Amigo|first4=Jorge|date=2016-05-16|title=SparkBWA: Speeding Up the Alignment of High-Throughput DNA Sequencing Data|journal=PLOS ONE|volume=11|issue=5|pages=e0155461|doi=10.1371/journal.pone.0155461|issn=1932-6203|pmid=27182962|pmc=4868289|bibcode=2016PLoSO..1155461A|doi-access=free}}

|2016

SSAHA, SSAHA2

| Fast for a small number of variants

| {{proprietary}}, freeware for academic and noncommercial use

Stampy

| For Illumina reads. High specificity, and sensitive for reads with indels, structural variants, or many SNPs. Slow, but speed increased dramatically by using BWA for first alignment pass.

| {{yes}}

| {{no}}

| {{proprietary}}, freeware for academic and noncommercial use

|{{cite journal|last1=Lunter|first1=G.|last2=Goodson|first2=M.|title=Stampy: A statistical algorithm for sensitive and fast mapping of Illumina sequence reads|journal=Genome Research|volume=21|issue=6|year=2010|pages=936–939|issn=1088-9051|doi=10.1101/gr.111120.110|pmid=20980556|pmc=3106326}}

| 2010

SToRM

| For Illumina or ABI SOLiD reads, with SAM native output. Highly sensitive for reads with many errors, indels (full from 0 to 15, extended support otherwise). Uses spaced seeds (single hit) and a very fast SSE-SSE2-AVX2-AVX-512 banded alignment filter. For fixed-length reads only, authors recommend SHRiMP2 otherwise.

| {{no}}

| {{yes}}

| {{yes}}, OpenMP

| {{free}}

|{{cite journal|last1=Noe|first1=L.|last2=Girdea|first2=M.|last3=Kucherov|first3=G.|title=Designing efficient spaced seeds for SOLiD read mapping|journal=Advances in Bioinformatics|volume=2010|year=2010|page=708501|pmid=20936175|pmc=2945724|doi=10.1155/2010/708501|doi-access=free}}

|2010

Subread, Subjunc

| Superfast and accurate read aligners. Subread can be used to map both gDNA-seq and RNA-seq reads. Subjunc detects exon-exon junctions and maps RNA-seq reads. They employ a novel mapping paradigm named seed-and-vote.

| {{yes}}

| {{free}}, GPL 3

Taipan

| De-novo assembler for Illumina reads

| {{proprietary}}, freeware for academic and noncommercial use

UGENE

| Visual interface both for Bowtie and BWA, and an embedded aligner

| {{yes}}

| {{free}}, GPL

VelociMapper

| FPGA-accelerated reference sequence alignment mapping tool from TimeLogic. Faster than Burrows–Wheeler transform-based algorithms like BWA and Bowtie. Supports up to 7 mismatches and/or indels with no performance penalty. Produces sensitive Smith–Waterman gapped alignments.

| {{yes}}

| {{proprietary}}, commercial

XpressAlign

| FPGA based sliding window short read aligner which exploits the embarrassingly parallel property of short read alignment. Performance scales linearly with number of transistors on a chip (i.e. performance guaranteed to double with each iteration of Moore's Law without modification to algorithm). Low power consumption is useful for datacentre equipment. Predictable runtime. Better price/performance than software sliding window aligners on current hardware, but not better than software BWT-based aligners currently. Can manage large numbers (>2) of mismatches. Will find all hit positions for all seeds. Single-FPGA experimental version, needs work to develop it into a multi-FPGA production version.

| {{proprietary}}, freeware for academic and noncommercial use

ZOOM

| 100% sensitivity for a reads between 15 and 240 bp with practical mismatches. Very fast. Support insertions and deletions. Works with Illumina & SOLiD instruments, not 454.

| {{partial|Yes (GUI), no (CLI)}}

| {{proprietary}}, commercial

| {{cite journal|last1=Lin|first1=H.|last2=Zhang|first2=Z.|last3=Zhang|first3=M.Q.|last4=Ma|first4=B.|last5=Li|first5=M.|title=ZOOM! Zillions of oligos mapped|journal=Bioinformatics|volume=24|issue=21|year=2008|pages=2431–2437|pmid=18684737|pmc=2732274|doi=10.1093/bioinformatics/btn416}}

References

Seq

Sequence

Sequence alignment software

list of sequence alignment software

Database search only

Pairwise alignment

Multiple sequence alignment

Genomics analysis

Motif finding

Benchmarking

Alignment viewers, editors

Short-read sequence alignment

See also

References