1 .. include:: <s5defs.txt>
9 .. The contents of this directory contain the source
10 for a presentation for the Caltech Bioinformatics Journal club.
12 .. footer:: Caltech Bioinformatics Journal Club
19 Mussa is tool to search for conserved regions between several
20 sequences. Hopefully regions detected as conserved will
21 highlight potentially important DNA sequence features such as
22 cis-regulatory modules, microRNA genes, and exons.
24 Mussa extends previous 2-way sequence comparison to N sequences.
31 Family Relations and Mussa started using the same sequence
32 comparison algorithm but developed in different directions.
34 .. image:: familytree.png
35 :alt: Gratutious software family tree
37 `Family Relations`_ focused on providing a robust usable piece
40 Mussa focused on the N-way algorithm.
42 .. _`Family Relations`: http://cartwheel.caltech.edu/
49 To compute a result Mussa conceptually uses these modules
60 The original seqcomp comparion uses a refinement of a fairly simple
61 algorithm to compare two sequences.
63 Given window of size W and sequences S[0] and S[1]::
65 for x in range(len(S[0])-W):
66 for y in range(len(S[1])-W):
69 if S[0][x+i] == S[1][y+i]:
74 The actual algorithm only needs to compare the base that
75 "slid in" into window, and account for the base that "slid out"
82 Assume that in this case we need 3 matches out of 4
84 .. image:: 4bp_window_no_match.png
86 In this case there are none.
93 Assume that in this case we need 3 matches out of 4
95 .. image:: 4bp_window_match.png
97 However, now that we slid over one position there are now 4
98 and so we would record 0, 5
106 Once one pass is complete one of the sequences is reversed complimented
107 and the process is repeated.
109 .. container:: incremental
111 When extending to more than two sequences, mussa needs to compare
113 (N * (N-1)) / 2 sequences
124 describe the difference between a long distance comparison
125 and multiple closer comparisons. (should use some pictures for that)
134 Should this include pulling things from the tutorial?
135 cover sucking things out of UCSC?