SeqTools

Overview

The SeqTools package contains three tools for visualising sequence alignments: Blixem, Dotter and Belvu.

Blixem is an interactive browser of sequence alignments that have been stacked up in a "master-slave" multiple alignment; it is not a 'true' multiple alignment but a 'one-to-many' alignment. Dotter is a graphical dot-matrix program for detailed comparison of two sequences. Belvu is a multiple sequence alignment viewer and phylogenetic tool with an extensive set of user-configurable modes to color residues. 

Supported platforms

Blixem features

  • Overview section showing the positions of genes and alignments around the alignment window
  • Detail section showing the actual alignment of protein or nucleotide sequences to the genomic DNA sequence.
  • View alignments against both strands of the reference sequence.
  • View sequences in nucleotide or protein mode; in protein mode, Blixem will display the three-frame translation of the reference sequence.
  • Residues are highlighted in different colours depending on whether they are an exact match, conserved substitution or mismatch.
  • Gapped alignments are supported, with insertions and deletions being highlighted in the match sequence.
  • Matches can be sorted and filtered.
  • SNPs and other variations can be highlighted in the reference sequence.
  • Poly(A) tails can be displayed and poly(A) signals highlighted in the reference sequence.

Dotter features

  • Every residue in one sequence is compared to every residue in the other, and a matrix of scores is calculated.
  • One sequence is plotted on the x-axis and the other on the y-axis.
  • Noise is filtered out so that alignments appear as diagonal lines.
  • Pairwise scores are averaged over a sliding window to make the score matrix more intelligible.
  • The averaged score matrix forms a three-dimensional landscape, with the two sequences in two dimensions and the height of the peaks in the third. This landscape is projected onto two dimensions using a grey-scale image - the darker grey of a peak, the higher the score is.
  • The contrast and threshold of the grey-scale image can be adjusted interactively, without having to recalculate the score matrix.
  • An Alignment Tool is provided to examine the sequence alignment that the grey-scale image represents.
  • Known high-scoring pairs can be loaded from a GFF file and overlaid onto the plot.
  • Gene models can be loaded from GFF and displayed alongside the relevant axis.
  • Compare a sequence against itself to find internal repeats.
  • Find overlaps between multiple sequences by making a dot-plot of all sequences versus themselves.
  • Run Dotter in batch mode to create large, time-consuming dot-plots as a background process.

Belvu features

  • View multiple sequence alignments.
  • Residues can be coloured by conservation, with user-configurable cutoffs and colours.
  • Residues can be coloured by residue type (user-configurable).
  • Colour schemes can be imported or exported.
  • Swissprot (or PIR) entries can be fetched by double clicking.
  • The position in the alignment can be easily tracked.
  • Manual deletion of rows and columns.
  • Automatic editing of rows and columns based on customisable criteria:
    • removal of all-gap columns;
    • removal of all gaps;
    • removal of redundant sequences;
    • removal of a column by a user-specified percentage of gaps;
    • filtering of sequences by percent identity;
    • removal of sequences by a user-specified percentage of gaps;
    • removal of partial sequences (those starting or ending with gaps); and
    • removal of columns by conservation (with user-specified upper/lower cutoffs). 
  • The alignment can be saved in Stockholm, Selex, MSF or FASTA format.
  • Distance matrices between sequences can be generated using a variety of distance metrics.
  • Distance matrices can be imported or exported.
  • Phylogenetic trees can be constructed based on various distance-based tree reconstruction algorithms.
  • Trees can be saved in New Hampshire format.
  • Belvu can perform bootstrap phylogenetic reconstruction.

Software pipelines

As well as being used independently, Blixem, Dotter and Belvu can also be called from other tools as part of a software pipeline. A common workflow is to call Blixem from the ZMap genome browser to analyse a set of alignments in more detail, and to call Dotter from within Blixem to give a graphical representation of a particular alignment. Belvu has an extensive set of command-line arguments for specifying processing and output parameters, making it possible to perform complete processes in a single command-line call. See our team page for more information.

Background

Blixem, Dotter and Belvu were originally written as part of the AceDB genome database system. Version 4 of the programs involved an extensive re-write to take advantage of modern GUI toolkits and to separate them from AceDB to form this independent SeqTools package. They can be used independently or with any other tool that outputs data in a suitable format - the current preferred file formats are FASTA and GFF v3 for Blixem and Dotter; a variety of file formats are supported by Belvu.

Links

Download and Installation

Production release

This is the recommended release for most users. It is well-tested, stable and supported code.

Please download the latest version from the FTP site.

Development build

Reasonably stable development code, which contains most of the latest features.

Please download the latest version from the FTP site.

Daily build

Experimental code; not guaranteed to be stable (or even to compile). Should only be used if you require the very latest changes.

Please download the latest version from the FTP site.

Linux

  • Install the following pre-requisites. This is most easily done using your system package manager (e.g. synaptic on Ubuntu, available from the System/Administration menu). Depending on your system, packages may be named differently or additional packages may be required. In this case the ./configure step below will fail and you will need to install the required package before running it again. If you cannot find a package it can be useful to search for just the main part of the name excluding numbers/dashes, e.g. "gtk" or "libgtk" rather than "libgtk2.0-dev". 
libcurl4-gnutls-dev (optional)
libgtk2.0-dev
libglib2.0-dev
libreadline6-dev
libsqlite3-dev      (optional)
  • In the terminal, unpack the downloaded seqtools tar file using the following command, where XXX is the version you downloaded:
tar -xf seqtools-XXX.tar.gz
  • To install in the default location (usually /usr/bin):
cd seqtools-XXX
./configure
make
make install

To install in a different location, or for help with dependencies, see the tips section. For more detailed instructions, see the INSTALL  file in the source code.

Mac OS X

  • Install the following pre-requisites. We strongly recommend that you use MacPorts  to install these packages as this will also install any dependencies for you.
curl    (optional)
gtk2
glib2
sqlite3 (optional)
  • In the terminal, unpack the downloaded seqtools tar file using the following command, where XXX is the version you downloaded:
tar -xf seqtools-XXX.tar.gz
  • To install in the default location (usually /opt/local/bin):
cd seqtools-XXX
./configure
make
make install

To install in a different location, or for help with dependencies, see the tips section. For more detailed instructions, see the INSTALL  file in the source code.

Windows

SeqTools cannot currently run natively on Windows. However, it can be installed and run using Cygwin  (which provides a Linux-like environment on Windows).

  • From the Cygwin website , download setup-x86.exe (or setup-x86_64.exe if you have a 64-bit machine).
  • Run the setup program, accepting all the default options. You will need to choose a mirror on the Choose Download Site page (e.g. cygwin.mirror.constant.com).
  • On the Select Packages page, select these packages:
Devel/gcc-core
Devel/gcc-g++
Devel/libsqlite3-devel  (optional)
Devel/make
Devel/pkg-config
Editors/nano            (optional)
Libs/libcurl-devel      (optional)
Libs/libglib2.0-devel   **also tick SOURCE checkbox**
Libs/libgtk2.0-devel    **also tick SOURCE checkbox**
Libs/libreadline-devel  (or Libs/readline)
Web/wget                (optional)
X11/xinit
  • When installation is complete, open the Cygwin terminal and type the following commands, replacing XXX with the version of SeqTools you want to download:
wget
ftp://ftp.sanger.ac.uk/pub/resources/software/seqtools/PRODUCTION/seqtools-XXX.tar.gz
tar -xf seqtools-XXX.tar.gz
cd seqtools-XXX
./configure
make
make install
  • You need to start an "xterm" terminal to run the SeqTools programs from. This can be done
    by typing the following in the Cygwin terminal:
export DISPLAY=:0; startxwin -- -emulate3buttons &
  • Alternatively, to start the xterm automatically each time you open the Cygwin terminal, 
    you can add the above command to your bash profile. To do this, type the following
    in the Cygwin terminal (you only need to do this once):
echo "DISPLAY=:0; startxwin -- -emulate3buttons &" >> ~/.bash_profile

For further help, see the tips section or the INSTALL file in the source code.

FreeBSD

Please see the ports  page.

Tips

  • You may need to run make install using sudo if you do not have root privileges, i.e.:
sudo make install
  • Alternatively, to install to a different location (e.g. one not requiring root privileges), use the --prefix argument when you run ./configure. For example, the following command would set the install location to foo/bar in your home directory:

./configure --prefix=~/foo/bar
  • If GTK+ is not in the default location then you may need to pass its location to the configure script. GTK+ is usually installed in /usr/lib, /usr/local/lib, /opt/lib or /opt/local/lib. If GTK+ is in e.g. /opt/local/lib then you would call configure with the following arguments:
./configure PKG_CONFIG_PATH=/opt/local/lib/pkgconfig LDFLAGS="-Xlinker -rpath -Xlinker /opt/local/lib"

Learn and Support

Getting started

Run the programs without arguments to see their usage information, or try out the examples given in the examples directory of the source-code download.

For more details, see the README file in the source code.

Help pages

Help pages, including a quick-start guide and user manual, are installed along with the programs. They can be accessed from within the programs using either the Help menu, the lifebuoy icon on the toolbar, or the Ctrl-H keyboard shortcut. They are included in the doc/User_doc directory in the source code..

User manuals

User manuals are installed along with the programs. The manuals for the current production versions can also be downloaded here:

Other documentation

Other documentation, such as design notes, is included in the doc directory in the source-code. It can also be viewed here.

License and Citation

SeqTools is free software and is distributed under the terms of the GNU General Public License.

SeqTools should be credited to "Genome Research Ltd". 

Contact

SeqTools is maintained by the Annosoft team at the Sanger Institute.

If you have a bug or feature request, please raise a ticket by emailing seqtools.

For any other enquires, please email annosoft.

Authors

Sanger Contributors
External Contributors

Publications

  • SeqTools: visual tools for manual analysis of sequence alignments.

    Barson G and Griffiths E

    BMC research notes 2016;9;1;39

  • Scoredist: a simple and robust protein sequence distance estimator.

    Sonnhammer EL and Hollich V

    BMC bioinformatics 2005;6;108

  • A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis.

    Sonnhammer EL and Durbin R

    Gene 1995;167;1-2;GC1-10

  • A workbench for large-scale sequence homology analysis.

    Sonnhammer EL and Durbin R

    Computer applications in the biosciences : CABIOS 1994;10;3;301-7

Tool type
Screenshots
  • Blixem in nucleotide mode

  • Blixem in protein mode

  • Dotter in nucleotide mode

  • Dotter in protein mode

  • Belvu in colour-by-conservation mode

  • Belvu in colour-by-residue mode

  • Belvu - Phylogenetic Tree

  • Belvu - Conservation Plot

Related Tools