htsworkflow.git
12 years agoMake antibody & condition visible. add-condition
Diane Trout [Wed, 25 May 2011 00:00:02 +0000 (17:00 -0700)]
Make antibody & condition visible.
Also change replicate number to allow null and not autodefault to 1.
Lastly reorganize the fields on the library page to be a bit less confusing.

13 years agoOnly use one name for dates on the page.
Diane Trout [Thu, 17 Feb 2011 23:38:19 +0000 (15:38 -0800)]
Only use one name for dates on the page.
Also the django database uses a date field for creation date
and wont let you format it to an isoformatted timestamp
so I hacked in the time portion into the RDFa field.

13 years agoMinor RDFa cleanups
Diane Trout [Thu, 17 Feb 2011 19:51:06 +0000 (11:51 -0800)]
Minor RDFa cleanups
I changed the XMLSchema prefix to the example used by the W3c "xsd"
instead of the random other sites example "xmls".

More importantly the content tag is content and not contents so
the places I was trying to replace the human readable value with
a standardized number or date were being ignored.

Lastly I figured out how to get django 1.1 to spit out a
XMLSchema#dateTime value. (which is not XMLSchema#datetime value)

13 years agoChange default font size to 14pt
Diane Trout [Thu, 17 Feb 2011 19:00:00 +0000 (11:00 -0800)]
Change default font size to 14pt
So many old

13 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Thu, 17 Feb 2011 08:39:54 +0000 (00:39 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

13 years agoAdd RDFa data to our library pages.
Diane Trout [Thu, 17 Feb 2011 08:35:44 +0000 (00:35 -0800)]
Add RDFa data to our library pages.
While doing that I discovered that the library_index page was spitting up
seriously bad HTML, with two DOCTYPE & html tags.
It looks like it was a combination of my simplifications with the
leftovers of brandon's "app" templates I ended up putting in two
base.html templates.

I stripped out the app templates and tried to simplify the template
inheritance.

As a side effect the library detail page, now has a header and is using
a css based font.

There was also a minor issue where I discovered the paginator was showing
"X libraries libraries" because the views was adding "libraries" to
the changelist result count. (And the change list also adds in the name).

I needed to replace the default admin pagination template, as
RDFa uses xhtml, and xhtml doesn't support the   entity.
(The solution is to use a unicode character for nbsp -- such as
  -- unicode for non-breaking space.

13 years agoAdd a small comment defining "results_dir"
Diane Trout [Thu, 17 Feb 2011 08:14:51 +0000 (00:14 -0800)]
Add a small comment defining "results_dir"
because I confused myself

13 years agoAdd in patterns for Encode Elements submission
Diane Trout [Wed, 16 Feb 2011 19:06:34 +0000 (11:06 -0800)]
Add in patterns for Encode Elements submission

13 years agoStarting component to try and figure out what we've submitted.
Diane Trout [Tue, 15 Feb 2011 07:28:58 +0000 (23:28 -0800)]
Starting component to try and figure out what we've submitted.

13 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Tue, 15 Feb 2011 00:52:35 +0000 (16:52 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

13 years agoMake plus/minus bigwig filename extension a bit more lax.
Diane Trout [Tue, 15 Feb 2011 00:49:46 +0000 (16:49 -0800)]
Make plus/minus bigwig filename extension a bit more lax.
*plus.bigwig vs *.plus.bigwig

13 years agoAdd in support for generating md5sums for files
Diane Trout [Mon, 14 Feb 2011 18:42:51 +0000 (10:42 -0800)]
Add in support for generating md5sums for files

13 years agoAdd in a little bit of qseq2fastq testing code. More is still needed.
Diane Trout [Mon, 7 Feb 2011 23:52:21 +0000 (15:52 -0800)]
Add in a little bit of qseq2fastq testing code. More is still needed.
This one just tests the slice parsing code

13 years agoAdd documentation about which Phred format we're generating
Diane Trout [Mon, 7 Feb 2011 23:51:05 +0000 (15:51 -0800)]
Add documentation about which Phred format we're generating

13 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Thu, 3 Feb 2011 19:50:12 +0000 (11:50 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

13 years agoFirst cycle can be 2 digit.
Diane Trout [Thu, 3 Feb 2011 02:05:29 +0000 (18:05 -0800)]
First cycle can be 2 digit.

13 years agoFix typo in a function name.
Diane Trout [Tue, 25 Jan 2011 01:16:09 +0000 (17:16 -0800)]
Fix typo in a function name.
(Silly character bleed through from a different application)

13 years agoAdd in 12x / 18x PCR protocol end points from Lorian
Diane Trout [Tue, 25 Jan 2011 00:52:39 +0000 (16:52 -0800)]
Add in 12x / 18x PCR protocol end points from Lorian

13 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Tue, 25 Jan 2011 00:49:22 +0000 (16:49 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

13 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
root [Tue, 25 Jan 2011 00:48:46 +0000 (16:48 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

13 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Tue, 25 Jan 2011 00:48:10 +0000 (16:48 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

13 years agoUpdate view names as per Cricket for the Jan 21st encode submission.
Diane Trout [Fri, 21 Jan 2011 23:23:29 +0000 (15:23 -0800)]
Update view names as per Cricket for the Jan 21st encode submission.
Also we're avoiding submitting the GENCODE v4 models for this
submission so I commented them out

13 years agoMerge branch 'master' of gitosis@mus.cacr.caltech.edu:htsworkflow
Diane Trout [Fri, 21 Jan 2011 22:10:36 +0000 (14:10 -0800)]
Merge branch 'master' of gitosis@mus.cacr.caltech.edu:htsworkflow

13 years agoAdd function to ucsc submission tool to make symlink tree.
Diane Trout [Fri, 21 Jan 2011 22:07:49 +0000 (14:07 -0800)]
Add function to ucsc submission tool to make symlink tree.
I needed a way to read data files from one tree and build archives in
another.

This also changed the condor scripts to tell tar to read the contents
of the file being pointed at by the symlink.

In addition there were slight additions and loosenings of the
file name to ucsc view matching patterns. (As Georgi slightly
changed his names.)

13 years agoMake replicate field visable on the public page.
root [Fri, 14 Jan 2011 23:14:15 +0000 (15:14 -0800)]
Make replicate field visable on the public page.
We might want to change the default value for replicate to "None"

13 years agoInclude version.py in source distribution for setup.py
Diane Trout [Wed, 12 Jan 2011 22:40:25 +0000 (14:40 -0800)]
Include version.py in source distribution for setup.py

13 years agoAdd validation script 0.5.0
Diane Trout [Sat, 8 Jan 2011 01:22:36 +0000 (17:22 -0800)]
Add validation script
Current version just validates illumina phred64 fastq files.
But I can add more later.

13 years agoRename all the scripts to start with htsw-
Diane Trout [Tue, 4 Jan 2011 21:29:27 +0000 (13:29 -0800)]
Rename all the scripts to start with htsw-

In preparation for making a debian package I thought
I should try to limit my pollution of the command namespace.

so now everything that's actually an installed script starts
with htsw-. There's one left-over that still might be useful
in the future but isn't currently that was left with its old
name.

13 years agoAdd utility to fix the off-by-3 error in the amplified from field.
Diane Trout [Tue, 21 Dec 2010 23:53:59 +0000 (15:53 -0800)]
Add utility to fix the off-by-3 error in the amplified from field.

The bug appeared because the library name and library primary key
drifted apart because of some accidental deletions.

I eventually moved us back to using raw primary keys as that
made constructing django urls simplier & easier. However
I'd apparently forgotten to adjust the amplified from field.

13 years agoMake the inventory login page look a bit nicer.
Diane Trout [Thu, 2 Dec 2010 01:19:07 +0000 (17:19 -0800)]
Make the inventory login page look a bit nicer.
This involved adjusting some of the base django templates.
Whenever given a choice I try to make our pages look more like the
admin site.

13 years agoAdjust some whitespace
Diane Trout [Thu, 2 Dec 2010 01:18:05 +0000 (17:18 -0800)]
Adjust some whitespace

13 years agoThis mildly adjusts the css to be a bit prettier
Diane Trout [Wed, 1 Dec 2010 00:26:51 +0000 (16:26 -0800)]
This mildly adjusts the css to be a bit prettier

13 years agoAdd a new ini-file option to point to the flowcell repository.
Diane Trout [Wed, 1 Dec 2010 00:15:43 +0000 (16:15 -0800)]
Add a new ini-file option to point to the flowcell repository.
It's in the [frontend] section called "results_dir".

13 years agoAdd config option notification_bcc for always attaching to notify emails.
Diane Trout [Tue, 30 Nov 2010 19:45:47 +0000 (11:45 -0800)]
Add config option notification_bcc for always attaching to notify emails.
Its called notification_bcc in the htsworkflow.ini file and
NOTIFICATION_BCC in the django settings.py file.

It ignores the "send bcc" option on the email_started form.

13 years agoCache the attributes for each type of fastq file.
Diane Trout [Wed, 17 Nov 2010 05:39:25 +0000 (21:39 -0800)]
Cache the attributes for each type of fastq file.
Previously it was just using whatever was last as the
set of attributes. Which led to everything being flagged as
FastqRd2

13 years agoMap junctions.bed to the Junctions view.
Diane Trout [Tue, 16 Nov 2010 01:41:04 +0000 (17:41 -0800)]
Map junctions.bed to the Junctions view.
Hopefully it'll pass validation and not be treated as a bed file,
even though it ends in the .bed extension.

(There were lots of problems with a previous subission because the
cufflinks bed file was almost but not quite a match to ucsc's bed
file validator. Or maybe they caved on the definition of a score.)

13 years agoAutodetect location of *2fastq scripts
Diane Trout [Tue, 16 Nov 2010 01:40:11 +0000 (17:40 -0800)]
Autodetect location of *2fastq scripts
Uses the .__file__ variable of the python modules.

13 years agoDon't count failed flowcells when guessing if a library is paired or single ended
Diane Trout [Mon, 15 Nov 2010 22:59:57 +0000 (14:59 -0800)]
Don't count failed flowcells when guessing if a library is paired or single ended
In addition the NameToView map was extended to cache the is_paired flag
so I don't have to keep searching through the dictionary of library information.

13 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Sun, 7 Nov 2010 23:58:07 +0000 (15:58 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

13 years agoVery basic non-ExtJS version of the inventory page
Diane Trout [Sun, 7 Nov 2010 23:54:22 +0000 (15:54 -0800)]
Very basic non-ExtJS version of the inventory page
It needs to group by object type. Possible with an index of types
followed by index of objects before getting to the detail page.

The component where it forces you to login before viewing the
inventory page is broken. It shows a super-plain form, but
fails on submit with some kind of cross site forgery prevention
cookie fails

13 years agoRemove another ExtJS panel reference.
Diane Trout [Sun, 7 Nov 2010 23:46:49 +0000 (15:46 -0800)]
Remove another ExtJS panel reference.
This version has a working two column independantly scrollable
library browse with no ExtJS dependency.

13 years agoUpdate to the new UCSC DAF file.
Diane Trout [Fri, 5 Nov 2010 23:27:02 +0000 (16:27 -0700)]
Update to the new UCSC DAF file.
We are submitting 3 different types of expression data,
one for Gencode v3c, one for Gencode v4, and one for the
de novo cufflinks assembly.

As a result I needed to update the file name to ucsc view map
to describe these new, more specific datasets.

Also I modified the name of the archival and upload condor scripts
to include a username.

13 years agoCreate a class to handle mapping extension to ucsc view attributes.
Diane Trout [Wed, 3 Nov 2010 23:59:07 +0000 (16:59 -0700)]
Create a class to handle mapping extension to ucsc view attributes.

This will attempt to guess if a library is paired end by seeing
if there are more paired end lanes than single read lanes.

The file exention matching was changed to use fnmatch instead of
ends with.

I still haven't added the ability to define extensions to ucsc DAF
view maps in a config file, as I don't know how to handle the case
of the .bam file that goes to a different view depending on if its
a paired end vs single end.

Also the ucsc_gather script is too long and parts of it need
to migrate into the rest of the htsworkflow tree.

13 years agoTwo lines between functions (trivial change)
Diane Trout [Tue, 2 Nov 2010 00:16:03 +0000 (17:16 -0700)]
Two lines between functions (trivial change)

13 years agoOnly complane about missing genomes if we _wanted_ to align the lane.
Diane Trout [Mon, 1 Nov 2010 21:55:42 +0000 (14:55 -0700)]
Only complane about missing genomes if we _wanted_ to align the lane.
If it was flagged for sequencing, don't complain.

13 years agoCheck for s_${lane}_02_matrix.txt as well as s_${lane}_1_matrix.txt
Diane Trout [Mon, 1 Nov 2010 21:52:45 +0000 (14:52 -0700)]
Check for s_${lane}_02_matrix.txt as well as s_${lane}_1_matrix.txt
It still could use more test coverage

13 years agoRemove dependency on ExtJS for library & lanes_for pages.
Diane Trout [Thu, 28 Oct 2010 00:25:02 +0000 (17:25 -0700)]
Remove dependency on ExtJS for library & lanes_for pages.
The inventory page still needs extjs.

In the process I cleaned up some of the CSS as well.

The downside is you'll need to link
htsworkflow/static/js/jquery.min.js
to a valid jquery sourcefile.

13 years agoThe WITH_SEQUENCE flag needs to be lower-case "true".
Diane Trout [Wed, 27 Oct 2010 22:51:20 +0000 (15:51 -0700)]
The WITH_SEQUENCE flag needs to be lower-case "true".

13 years agoReport version number derived from git tag.
Diane Trout [Fri, 22 Oct 2010 23:40:13 +0000 (16:40 -0700)]
Report version number derived from git tag.
This patch includes the necessary infrastructure to support that
feature and its been added to qseq2fastq and srf2fastq.

Additionally to improve testability of qseq2fastq and srf2fastq, the
original standalone module was moved into htsworkflow.pipelines and a
small stub module was placed in scripts.

13 years agoSet WITH_SEQUENCE as both a per-lane AND global parameter 0.4.5
Diane Trout [Tue, 19 Oct 2010 19:35:57 +0000 (19:35 +0000)]
Set WITH_SEQUENCE as both a per-lane AND global parameter
the per-lane version set $(lane)_WITH_SEQUENCE := TRUE in the makefiles
the gerald WITH_SEQUENCE TRUE changed the setting in the config.txt file

So lets just set both versions and hopefully we'll get sequence files
out the other end.

Let me show you my hammer...

13 years agoWITH_SEQUENCE is actually a global GERALD option, remove the lane specifier.
Diane Trout [Fri, 8 Oct 2010 17:32:46 +0000 (17:32 +0000)]
WITH_SEQUENCE is actually a global GERALD option, remove the lane specifier.

13 years agoUpdate gerald config file generation.
Diane Trout [Thu, 7 Oct 2010 22:32:45 +0000 (22:32 +0000)]
Update gerald config file generation.
Include ELAND_SET_SIZE, WITH_SEQUENCE options
Stop commenting SEQUENCE_FORMAT out

13 years agoMinor changes to display
Lorian Schaeffer [Tue, 5 Oct 2010 18:12:42 +0000 (18:12 +0000)]
Minor changes to display

13 years agoRemoved some models from the admin index view
Lorian Schaeffer [Fri, 1 Oct 2010 19:01:58 +0000 (19:01 +0000)]
Removed some models from the admin index view

13 years agoChanged Frequently Used links to be properly relative
Lorian Schaeffer [Fri, 1 Oct 2010 18:57:01 +0000 (18:57 +0000)]
Changed Frequently Used links to be properly relative

13 years agoRemoved from admin index page
Lorian Schaeffer [Fri, 1 Oct 2010 01:39:37 +0000 (01:39 +0000)]
Removed from admin index page

13 years agoRemoved unused DataRun model from admin index page
Lorian Schaeffer [Fri, 1 Oct 2010 01:25:11 +0000 (01:25 +0000)]
Removed unused DataRun model from admin index page

13 years agoRemoved all analysis models from main admin index
Lorian Schaeffer [Fri, 1 Oct 2010 01:21:39 +0000 (01:21 +0000)]
Removed all analysis models from main admin index

13 years agoLimited which models show up on admin index page
Lorian Schaeffer [Fri, 1 Oct 2010 01:19:44 +0000 (01:19 +0000)]
Limited which models show up on admin index page
Removed some fields from view in the Library Change/Add page
Rearranged the remaining Library fields

13 years agoIgnore some files generated by python
Diane Trout [Thu, 30 Sep 2010 21:58:28 +0000 (21:58 +0000)]
Ignore some files generated by python
.coverage
*.egg-info

13 years agoAdjust spoolwatchers use of inotify to correspond to pyinotify 0.8.9
Diane Trout [Fri, 24 Sep 2010 22:34:17 +0000 (22:34 +0000)]
Adjust spoolwatchers use of inotify to correspond to pyinotify 0.8.9

13 years agoHandle the case when a sequencing lane lacks any yield information.
Diane Trout [Fri, 24 Sep 2010 22:34:16 +0000 (22:34 +0000)]
Handle the case when a sequencing lane lacks any yield information.
(For instance if we only run one lane of analysis on a flowcell)

13 years agoRemind myself that dry-run doesn't work with the extract results code
Diane Trout [Fri, 24 Sep 2010 22:34:15 +0000 (22:34 +0000)]
Remind myself that dry-run doesn't work with the extract results code

13 years agoAdd support for CASAVA 1.7
Diane Trout [Fri, 10 Sep 2010 22:35:20 +0000 (22:35 +0000)]
Add support for CASAVA 1.7

They stopped generating eland_extended files so I needed to parse
the export files instead.

Also more carefully test how I'm computing the U0-2 and R0-2
scores.

13 years agoBetter detect which program ids generated CNF4 srf files.
Diane Trout [Tue, 24 Aug 2010 21:32:06 +0000 (21:32 +0000)]
Better detect which program ids generated CNF4 srf files.
apparently illumina2srf v1.11.5Illumina.1.3 also genrated CNF4s.
It's only illumina2srf v1.11.6... that generated CNF1s.

13 years agoSplit parts of build_fastqs into a seperate function.
Diane Trout [Tue, 24 Aug 2010 00:31:55 +0000 (00:31 +0000)]
Split parts of build_fastqs into a seperate function.
Use the database to flag if a lane is bad instead of hard coding it.
Support comments and spaces in the library_id to target directory file.

13 years agoAdd a "All Lanes" option to the control-lane field for the cases
Diane Trout [Mon, 23 Aug 2010 22:54:20 +0000 (22:54 +0000)]
Add a "All Lanes" option to the control-lane field for the cases
where we didn't set a control lane, but instead let the pipeline
estimate from the entire flowcell.

13 years agoShrink flowcell notes field.
Diane Trout [Sat, 14 Aug 2010 00:23:06 +0000 (00:23 +0000)]
Shrink flowcell notes field.
See [ticket:167]

13 years agoDon't offer to track flowcell config files in svn
Diane Trout [Sat, 14 Aug 2010 00:07:16 +0000 (00:07 +0000)]
Don't offer to track flowcell config files in svn

13 years agoRemove debug print statement
Diane Trout [Fri, 13 Aug 2010 23:57:06 +0000 (23:57 +0000)]
Remove debug print statement

13 years agoReduce the size of the bioanalyzer summary text box
Diane Trout [Fri, 13 Aug 2010 23:51:32 +0000 (23:51 +0000)]
Reduce the size of the bioanalyzer summary text box
to 3 lines.

13 years agoTell git to ignore *.py[co]~ files.
Diane Trout [Fri, 13 Aug 2010 22:49:10 +0000 (22:49 +0000)]
Tell git to ignore *.py[co]~ files.
(Useful if you're using git-svn)

13 years agoThis patch adds in fields to support storing bioanalyzer results. [ticket:166]
Diane Trout [Fri, 13 Aug 2010 22:49:09 +0000 (22:49 +0000)]
This patch adds in fields to support storing bioanalyzer results. [ticket:166]

13 years agoGeorgi used slightly different extensions for the second batch
Diane Trout [Wed, 14 Jul 2010 22:32:16 +0000 (22:32 +0000)]
Georgi used slightly different extensions for the second batch
of data

13 years agoAdd search field to the lanes admin page
Diane Trout [Wed, 14 Jul 2010 22:31:57 +0000 (22:31 +0000)]
Add search field to the lanes admin page

13 years agoUse the htsworkflow API to determine if a flowcell is paired end or not.
Diane Trout [Mon, 12 Jul 2010 19:02:22 +0000 (19:02 +0000)]
Use the htsworkflow API to determine if a flowcell is paired end or not.

13 years agoUnder some conditions urlerror doesn't have a code,
Diane Trout [Mon, 12 Jul 2010 19:01:31 +0000 (19:01 +0000)]
Under some conditions urlerror doesn't have a code,
so just report the raw error message in that case

13 years agoReport if a lane was on a paired end flowcell from the library
Diane Trout [Mon, 12 Jul 2010 18:23:39 +0000 (18:23 +0000)]
Report if a lane was on a paired end flowcell from the library
json structure.

13 years agoSince srf2named_fastq detects what type of srf file I can remove
Diane Trout [Fri, 9 Jul 2010 00:08:12 +0000 (00:08 +0000)]
Since srf2named_fastq detects what type of srf file I can remove
the -c option that was forcing it to read CNF1 formatted srf files

13 years agoUpdate srf2named_fastq to try to detect if the srf file is CNF1 or CNF4
Diane Trout [Thu, 8 Jul 2010 22:15:31 +0000 (22:15 +0000)]
Update srf2named_fastq to try to detect if the srf file is CNF1 or CNF4
and figure out the correct option to pass to srf2fastq.

13 years agoAdd option to force overwriting old fastqs.
Diane Trout [Thu, 8 Jul 2010 18:54:15 +0000 (18:54 +0000)]
Add option to force overwriting old fastqs.

It will ignore the current existence of a fastq file when generating
the condor submit script.

In addition it will tell srf2named_fastq to --force as well.

13 years agoIf a quality score started with an @ sign it was treated as a header
Diane Trout [Wed, 7 Jul 2010 00:19:37 +0000 (00:19 +0000)]
If a quality score started with an @ sign it was treated as a header
which created an invalid fastq file.

This patch fixes that, and introduces some test cases for srf2named_fastq.py

13 years agoThis still isn't ideal as the RESULTS_HOME_DIR is still out of my
Diane Trout [Wed, 7 Jul 2010 00:19:36 +0000 (00:19 +0000)]
This still isn't ideal as the RESULTS_HOME_DIR is still out of my
home directory, but at least the use of expanduser means it'll work on
both linux and os x.

13 years agoSave changes needed to submit to UCSC in Jun 2010.
Diane Trout [Tue, 22 Jun 2010 19:07:49 +0000 (19:07 +0000)]
Save changes needed to submit to UCSC in Jun 2010.

Some of those changes include:
  * modifying the list of variables to include in the ddf
  * making it easier to set the MapAlgorithm.
  * Return information about the condor scripts so I can make a
    condor dagman script to run all the compression jobs

Perhaps some parts of this should be moved into the main
htsworkflow. I can see wanting the code to batch convert
srf/qseqs to fastq being more generally useful.

13 years agoFix srf2named_fastq to output the proper /2 in paired end reads
Diane Trout [Tue, 22 Jun 2010 19:07:48 +0000 (19:07 +0000)]
Fix srf2named_fastq to output the proper /2 in paired end reads

13 years agoUpdate the 'flowcell started' email message to include the sequencer
Diane Trout [Tue, 15 Jun 2010 18:47:09 +0000 (18:47 +0000)]
Update the 'flowcell started' email message to include the sequencer
in the url.

13 years agoMake it easier to run srf2named_fastq.py
Diane Trout [Tue, 15 Jun 2010 18:47:08 +0000 (18:47 +0000)]
Make it easier to run srf2named_fastq.py

13 years agoWrapper script that helps convert srf files to fastq files.
Diane Trout [Tue, 15 Jun 2010 00:18:33 +0000 (00:18 +0000)]
Wrapper script that helps convert srf files to fastq files.
It can do the following:
  split the fastq into two files (for paired end reads)
  add in a flowcell id to the header (for either type of read).

13 years agoRename avg_lib_size to gel_cut_size, and add insert_size
Diane Trout [Mon, 14 Jun 2010 21:11:32 +0000 (21:11 +0000)]
Rename avg_lib_size to gel_cut_size, and add insert_size
to clear up the confusion about what was supposed to be
being recorded in avg_lib_size.

In addition, this patch adds in a per lane status field.

13 years agoUpdate internal copy of the django admin/templates/admin/index.html
Diane Trout [Mon, 14 Jun 2010 21:11:31 +0000 (21:11 +0000)]
Update internal copy of the django admin/templates/admin/index.html
They changed how the css was being imported in the django css
so when running with django 1.1 the admin index page was messed up.

This gets all the borders to show up correctly.

13 years agoCollect fastqs by read and add them to the configuration ini file as a
Diane Trout [Fri, 11 Jun 2010 00:16:19 +0000 (00:16 +0000)]
Collect fastqs by read and add them to the configuration ini file as a
single line. (As desired by UCSC).

Also the library to result map file supports a basic comment character.
if # is the _first_ character it will skip that line.

Next I should fix the avg library size / insert length variables.

13 years agoPut partial support back in for srf files.
Diane Trout [Fri, 11 Jun 2010 00:16:18 +0000 (00:16 +0000)]
Put partial support back in for srf files.

Since I don't know if the srf file is supposed to be single or
paired end, this version assumes paired end unless you provide the
--single option.

Currently it'll give up if you try to convert a paired srf file
to a fastq file.

Also I made the code formatting in the make_parser function
look cleaner, and changed it to allow setting the logging verbosity
via command line options --verbose/--debug.

13 years agoAdd script to try and build submissions to the UCSC encode project.
Diane Trout [Thu, 10 Jun 2010 00:55:11 +0000 (00:55 +0000)]
Add script to try and build submissions to the UCSC encode project.

This version supports generating qseq2fastq entries using the
htsworkflow api and scanning the flowcell repository directory.

There was code to generate the ddf files (from ini files).

I need to update the ini generation code to scan the submisison directory
for fastq files and group them by read.

13 years agoExtend htsworkflow.pipelines.sequences to also try to figure out the cycle count.
Diane Trout [Thu, 10 Jun 2010 00:55:10 +0000 (00:55 +0000)]
Extend htsworkflow.pipelines.sequences to also try to figure out the cycle count.
In addition there is experimental code to shove the found sequences into a
sql database.

I also needed to bug fix the sequence patterns to catch the fake flowcell
ilmn200901 which wasn't matching my regexp for detecting flowcell ids.

13 years agoMove the code to scan the sequence file archive to its own module so
Diane Trout [Thu, 10 Jun 2010 00:55:09 +0000 (00:55 +0000)]
Move the code to scan the sequence file archive to its own module so
I can use it in scripts other than make-library-tree

13 years agonew api module actually needed logging.
Diane Trout [Thu, 10 Jun 2010 00:55:08 +0000 (00:55 +0000)]
new api module actually needed logging.

13 years agoMove the knowledge of the urls for the REST API to one new file
Diane Trout [Thu, 10 Jun 2010 00:55:06 +0000 (00:55 +0000)]
Move the knowledge of the urls for the REST API to one new file
htsworkflow.util.api and then update some of the scripts that were
using the api to import from the new module.

Yes this increases the dependencies, but it does mean it'll be
easier to update the urls if we need to change them

13 years agoapparently commas are important
Diane Trout [Tue, 1 Jun 2010 19:36:31 +0000 (19:36 +0000)]
apparently commas are important

13 years agomake-library-tree should be an installed script too
Diane Trout [Tue, 1 Jun 2010 19:35:59 +0000 (19:35 +0000)]
make-library-tree should be an installed script too

13 years agoAdd in extra fields lorian asked for to library detail page
Diane Trout [Sat, 29 May 2010 01:06:19 +0000 (01:06 +0000)]
Add in extra fields lorian asked for to library detail page