Diane's World

I take the 5th


Tue, 06 Dec 2005

Brians Repeats

We're trying to determine if there are repeats near genomic probems.

The first pass file had a large number of repeats in it.

high_genomic_signal_fewer_repeats.txt had those stripped out.

I created a fasta file with the following python code

data = [ x.strip().split('\t') for x in open('high_genomic_signal_fewer_repeats.txt','r')]
open('high_genomic_signal_fewer_repeats.fa','w').writelines([ ">%s|%s\n%s\n" % (x[2],x[0],x[3]) for x in data])

ok yeah I shouldn't do that in so few lines.

Hopefully the following

blastall -p blastn -e 0.000001 -d mouse_34.0.fa -i high_genomic_signal_fewer_repeats.fa -m 7 | tee high_genomic_signal_fewer_repeats.blast.xml

command will generate a new, smaller xml results file.

[20:00] | [] | # | G


menu

About Diane Trout
Blog

calendar

< December 2005 >
SuMoTuWeThFrSa
     1 2 3
4 5 6 7 8 910
11121314151617
18192021222324
25262728293031

categories

/ (123)
  geek/ (37)
    debian/ (2)
    programming/ (1)
    t41/ (6)
    voip/ (1)
  government/ (7)
    tac/ (3)
  life/ (8)
    transhuman/ (1)
    vegan/ (2)
      recipes/ (2)
  study/ (49)
    arma/ (1)
    bi188/ (8)
    cs191a/ (6)
    ese166/ (25)
  work/ (21)
    compclust/ (6)
    sigmoid/ (1)
    talks/ (1)