By Brandon W. King
------------------
-Last updated: March 23rd, 2006
+Last updated: May 18th, 2006
-Updated to Mussagl build: 141
+Updated to Mussagl build: 141 (Update to 200 in progress)
.. contents::
:align: center
+Main Window
+-----------
+
+Overview
+~~~~~~~~
+.. Screenshot with numbers showing features.
+
+.. image:: images/window_overview.png
+ :alt: Mussa Window
+ :align: center
+
+Legend:
+
+ 1. `DNA Sequence (Black bars)`_
+
+ 2. Annotation_
+
+ 3. Motif_
+
+ 4. `Conservation tracks`_
+
+ 5. `Motif Toggle`_
+
+ 6. `Zoom Factor`_ (Base pairs per pixel)
+
+ 7. `Dynamic Threshold`_
+
+ 8. `Sequence Information Bar`_
+
+ 9. `Sequence Scroll Bar`_
+
+
+DNA Sequence (black bars)
+~~~~~~~~~~~~~~~~~~~~~~~~~
+
+.. image:: images/sequence_bar.png
+ :alt: Sequence Bar
+ :align: center
+
+Each of the black bars represents one of the loaded sequences, in this
+case the sequence around the gene 'MCK' in human, mouse, and rabit.
+
+FIXME: Should I mention the repeats here?
+
+
+Annotation
+~~~~~~~~~~
+
+.. figure:: images/annotation.png
+ :alt: Annotation
+ :align: center
+
+ Annotation shown in green on sequence bar.
+
+
+Annotations can be included on any of the sequences using the `Load a
+mussa parameter file`_ method of loading your sequences. You can
+define annotations by location or using an exact subsequence and you
+may also choose any color for display of the annoation; see the
+`Annotation File Format`_ section for details.
+
+Note: Currently there is no way to add annotations using the GUI (only
+via the .mupa file). We plan to add this feature in the future, but it
+likely will not make it into the first release.
+
+
+Motif
+~~~~~
+
+.. figure:: images/motif.png
+ :alt: Motif
+ :align: center
+
+ Motif shown in light blue on sequence bar.
+
+The only real difference between an annotation and motif in mussagl is
+that you can define motifs from within the GUI. See the `Motifs`_
+section for more information.
+
+
+Conservation tracks
+~~~~~~~~~~~~~~~~~~~
+
+.. figure:: images/conservation_tracks.png
+ :alt: Conservation Tracks
+ :align: center
+
+ Conservations tracks shown as red lines between sequence bars.
+
+The red lines between the sequence bars represent conservation between
+the sequences. The amount of sequence conservation shown will depend
+on the relatedness of your sequences and the `dynamic threshold` you
+are using. Sequences with lots of repeats will cause major slow downs
+in calculating the matches.
+
+
+Motif Toggle
+~~~~~~~~~~~~
+
+.. image:: images/motif_toggle.png
+ :alt: Motif Toggle
+ :align: center
+
+Toggles motifs on and off. This will not turn on and off annotations.
+
+Note: As of the current build (#200), this feature hasn't been
+implemented.
+
+
+Zoom Factor
+~~~~~~~~~~~
+
+.. image:: images/zoom_factor.png
+ :alt: Zoom Factor
+ :align: center
+
+The zoom factor represents the number of base pairs represented per
+pixel. When you zoom in far enough the sequence will switch from
+seeing a black bar, representing the sequence, to the actual sequence
+(well, ASCII representation of sequence).
+
+
+Dynamic Threshold
+~~~~~~~~~~~~~~~~~
+
+.. image:: images/dynamic_threshold.png
+ :alt: Dynamic Threshold
+ :align: center
+
+You can dynamically change the threshold for how strong of match you
+consider the conservation to be with one of two options:
+
+ 1. Number of base pair matchs out of window size.
+
+ 2. Percent base pair conservation.
+
+See the Threshold_ section for more infromation.
+
+
+Sequence Information Bar
+~~~~~~~~~~~~~~~~~~~~~~~~
+
+.. image:: images/seq_info_bar.png
+ :alt: Sequence Information Bar
+ :align: center
+
+The sequence infomation bars can be found to the left and right sides
+of mussagl. Next to each sequence you will find the following
+information:
+
+ 1. Species (If it has been defined)
+ 2. Total Size of Sequence
+ 3. Current base pair position
+
+
+Sequence Scroll Bar
+~~~~~~~~~~~~~~~~~~~
+
+.. image:: images/scroll_bar.png
+ :alt: Sequence Scroll Bar
+ :align: center
+
+The scroll bar allows you to scroll through the sequence which is
+useful when you have zoomed in using the `zoom factor`_.
+
+
+Annotations / Motifs
+--------------------
+
+Annotations
+~~~~~~~~~~~
+
+Motifs
+~~~~~~
+
+Load Motifs from File
+*********************
+
+Motif Dialog
+************
+
+
Detailed Info
-------------
Threshold
~~~~~~~~~
-The threshold of an analysis is in minimum number of base pair
-matches must be meet to in order to be kept as a match. Note that you
-can vary the threshold from within Mussagl. For example, if you
-choose a `window size`_ of **30** and a **threshold** of **20** the mussa
-nway transitive algorithm will store all matches that are 20 out of 30
-bp matches or better and pass it on to Mussagl. Mussagl will
-then allow you to dynamically choose a threshold from 10 to 30 base
-pairs. A threshold of 30 bps would only show 30 out of 30 bp
-matches. A threshold of 20 bps would show all matches of 20 out of 30
-bps or better. Choosing a threshold below 20 in this case won't have
-an effect [*]_ because the mussa algorithm didn't report and matches below
-this threshold.
-
-.. [*] In the future, Mussagl will automatically detect the minimum
- threshold which was used when defining an analysis and not allow
- you to select a threshold below the minimum. See `ticket #52
- <http://woldlab.caltech.edu/cgi-bin/mussa/ticket/52>`_ for more
- info.
+The threshold of an analysis is in minimum number of base pair matches
+must be meet to in order to be kept as a match. Note that you can vary
+the threshold from within Mussagl. For example, if you choose a
+`window size`_ of **30** and a **threshold** of **20** the mussa nway
+transitive algorithm will store all matches that are 20 out of 30 bp
+matches or better and pass it on to Mussagl. Mussagl will then allow
+you to dynamically choose a threshold from 20 to 30 base pairs. A
+threshold of 30 bps would only show 30 out of 30 bp matches. A
+threshold of 20 bps would show all matches of 20 out of 30 bps or
+better. If you would like to see results for matches lower than 20 out
+of 30, you will need to rerun the analysis with a lower threshold.
Window Size
~~~~~~~~~~~
-The typical sizes people tend to choose are between 20 and 30. Feel
-free to analysis with this setting depending on your needs.
+The typical sizes people tend to choose are between 20 and 30. You
+will likely need to experiment with this setting depending on your
+needs and input sequence.
Sequences
bp length near 280Kb. Once mussa has run once, you can reload
previously run analyses.
+FIXME: We have learned more about how much sequence and how many to
+put in mussagl, this information should be documented here.
+
Mussa File Formats
------------------
1751 2000 Glorp Glorptype
-.. _motif:
+.. _motif_file_format:
Motif File Format
~~~~~~~~~~~~~~~~~
Format:
<motif> <red> <green> <blue>
+
+Example:
+
GGCC 0.0 1 1
.. _wiki: http://mussa.caltech.edu
.. _build: http://woldlab.caltech.edu/cgi-bin/mussa/wiki/MussaglBuild
.. _fasta: http://en.wikipedia.org/wiki/FASTA_format
+.. _wpDnaMotif: http://en.wikipedia.org/wiki/DNA_motif