htsworkflow.git
14 years agoTrack which flowcells are archived on which long term storage devices
Brandon King [Wed, 10 Jun 2009 17:44:47 +0000 (17:44 +0000)]
Track which flowcells are archived on which long term storage devices
 * Added a command_line script called mark_archived_data
   * Storage Device must be in DB as well as flowcell before using
   * If you supply a device (i.e. /dev/sda), it will extract the serial number of the drive for you.
   * Uses HTTP REST request to link Flowcell, Libraries and Storage Devices.
   * Supports archives across storage devices.
   * device query requires py_sg python package (easy_install py_sg)

14 years agoScan more than one rsync repository for directory trees to copy.
Diane Trout [Wed, 10 Jun 2009 00:51:49 +0000 (00:51 +0000)]
Scan more than one rsync repository for directory trees to copy.
Also don't configure the rsync class until the bot.run function
to avoid problems with logging while daemonized.

Also add some code to make sure that we have a trailing /

14 years agoAdd code to construct a copy url using a mapping in the config file and
Diane Trout [Tue, 9 Jun 2009 00:44:18 +0000 (00:44 +0000)]
Add code to construct a copy url using a mapping in the config file and
the top level directory that had writes in it.

14 years agoMinor updates to inventory schema based on input from Lorian.
Brandon King [Fri, 5 Jun 2009 18:00:50 +0000 (18:00 +0000)]
Minor updates to inventory schema based on input from Lorian.

14 years agoPyinotify behaves oddly when the stdio file descriptors are closed.
Diane Trout [Fri, 5 Jun 2009 00:32:00 +0000 (00:32 +0000)]
Pyinotify behaves oddly when the stdio file descriptors are closed.
so don't initialize it until after the daemonize code has been
called. this means after BenderJab.start has been called.
(So I changed from SpoolWatcher.start to SpoolWatcher.run
and moved the watch manager configuration into the
start of run)

14 years agoA simple command to request the hard drive serial number from a device.
Brandon King [Thu, 4 Jun 2009 22:55:37 +0000 (22:55 +0000)]
A simple command to request the hard drive serial number from a device.
 * Used py_sg (easy_install py_sg)

14 years agoConsistently use 'watchdir' instead of 'watch_dir' now.
Diane Trout [Mon, 1 Jun 2009 17:57:12 +0000 (17:57 +0000)]
Consistently use 'watchdir' instead of 'watch_dir' now.

14 years agoWatch more than one directory tree for modification.
Diane Trout [Sat, 30 May 2009 01:07:08 +0000 (01:07 +0000)]
Watch more than one directory tree for modification.
I also added code to record which directory watch, and what entry in that
directory was being created (AKA whatever element of the runfolder is being
touched, I record the root of the runfolder directory).

To support this I changed the config option in the benderjab file from
watchdir to watchdirs to make it a bit clearer that things have changed.

I still need to work out the communication protocol to copier so it
can figure out what to start copying. Also I'm recording the
watchdirectory, but copier needs access to a different url. So there's
some question about where the right place to map watchdir to copy url source
might be.

14 years agoDon't print the discovered mount point.
Diane Trout [Sat, 30 May 2009 01:01:49 +0000 (01:01 +0000)]
Don't print the discovered mount point.

14 years agoFirst attempt at an inventory tracking database
Brandon King [Fri, 29 May 2009 23:00:04 +0000 (23:00 +0000)]
First attempt at an inventory tracking database
 * Includes a LongTermStorage object which links flowcells/libraries to storage items.

14 years agoAdd a minimal test case to test a small bit of the django app.
Diane Trout [Wed, 27 May 2009 23:21:42 +0000 (23:21 +0000)]
Add a minimal test case to test a small bit of the django app.
(I needed to figure out how to do testing for django).

there's a tool nosedjango from http://hg.assembla.com/nosedjango
that adds auto-detection of the django settings file to
nose.

I needed to modify it for our app though.

14 years agoMore consistently split the flowcell id and status messages
Diane Trout [Mon, 11 May 2009 20:06:31 +0000 (20:06 +0000)]
More consistently split the flowcell id and status messages
some flowcells weren't being found because their names were
"flowcell (status)" which meant the code that was trying to find
the run.xml file in the results/flowcells directory was looking
for a directory named "results/flowcell_id (status)" which
wasn't so effective.

We really need a status field

14 years agoFix parameter list for base class.
Diane Trout [Mon, 11 May 2009 20:03:16 +0000 (20:03 +0000)]
Fix parameter list for base class.
I was passing in the ElementTree xml into the base class constructor instead of
an integer representing which 'end' the sample is from.
Which when working with single ended ElandLanes ended up with the ElementTree node
instead of none (which showed up in the UI).

14 years agoUpdate htsworkflow.util.fctracker module to work with the newer django db
Diane Trout [Wed, 6 May 2009 22:56:29 +0000 (22:56 +0000)]
Update htsworkflow.util.fctracker module to work with the newer django db
this was needed to support the make-library-tree script

14 years agoUpdate the django app to report "raw reads" for a sequence lane.
Diane Trout [Tue, 5 May 2009 19:21:30 +0000 (19:21 +0000)]
Update the django app to report "raw reads" for a sequence lane.
I still need to figure out how to allow downloading of the
"result" files.

14 years agoHandle lanes that were only sequenced.
Diane Trout [Tue, 5 May 2009 18:02:20 +0000 (18:02 +0000)]
Handle lanes that were only sequenced.
So the report needs to be a bit smaller, and I need to archive a different file
Also this version only counts the number of records in the "sequence" file
which is raw sequence that passed the QC filter, I don't have a uniform way
of determining how many total sequences thre were.

14 years agoReturn NaN, NaN if a mean +/- deviation value was undefined in Summary.htm
Diane Trout [Tue, 5 May 2009 17:59:03 +0000 (17:59 +0000)]
Return NaN, NaN if a mean +/- deviation value was undefined in Summary.htm

14 years agoAllow overriding the queued commands environment.
Diane Trout [Tue, 5 May 2009 17:56:11 +0000 (17:56 +0000)]
Allow overriding the queued commands environment.

15 years agoAdd wait_for_ipar option to the spool watcher config which will wait until
Diane Trout [Fri, 1 May 2009 00:22:17 +0000 (00:22 +0000)]
Add wait_for_ipar option to the spool watcher config which will wait until
"IPAR_Netcopy_Complete.txt" is written before sending the "sequencing finished
message.

15 years agoAllow mapping library IDs to the library primary key
Diane Trout [Thu, 30 Apr 2009 22:44:34 +0000 (22:44 +0000)]
Allow mapping library IDs to the library primary key

15 years agoadd --clean option to runfolder
Diane Trout [Fri, 24 Apr 2009 23:37:35 +0000 (23:37 +0000)]
add --clean option to runfolder
this deletes the various log files as well as the images and calibration
directories.
Also it runs make clean_intermediate in the firecrest/ipar directory
(if the makefile exists)
Perhaps it should delete the bustard & gerald directories?
Or should I delete them after I've archived the useful parts.

15 years agoAllow picking runfolder variant we should be trying to extract srf files from.
Diane Trout [Fri, 24 Apr 2009 23:34:13 +0000 (23:34 +0000)]
Allow picking runfolder variant we should be trying to extract srf files from.

adds an option -r to chose which version of the illumina2srf or solexa2srf
programs
0 is solexa2srf
1 is illumina2srf -V1
2 is illumina2srf (current pipeline 1.3.2 version)

15 years agoStop assuming that library_id == pk.
Diane Trout [Wed, 8 Apr 2009 00:47:34 +0000 (00:47 +0000)]
Stop assuming that library_id == pk.
I needed to fix the get_absolute_url to lookup by library_id,
we needed a redirect function to get to the right admin page.

And for sequence runs I don't have any eland results so the get result files
failed because it was assuming that the eland_result from the run xml would
always have all 8 lanes defined in its dictionary. Now I check to see if its
defined and if its not I don't display anything.

15 years agoDon't crash if we can't find the flowcell run xml file
Diane Trout [Sat, 4 Apr 2009 00:41:40 +0000 (00:41 +0000)]
Don't crash if we can't find the flowcell run xml file

15 years agoDisplay the affiliations on the public library detail page.
Diane Trout [Sat, 4 Apr 2009 00:36:43 +0000 (00:36 +0000)]
Display the affiliations on the public library detail page.
This involved cleaning up some of the view code to push all of the html
generation into the template and none in the view function, by modifying
the view function to feed everything to the template in the form of
variables.

15 years agoShow the "Amplified from sample" as the parent library id and allow
Diane Trout [Wed, 1 Apr 2009 00:49:19 +0000 (00:49 +0000)]
Show the "Amplified from sample" as the parent library id and allow
searching on that on the public library index.

Additionally detect if we have an eland_multi or eland extended file
and make the url to download those different files. (and change the
link name slightly to reflect you're getting a different file.)

finally don't bother showing the bed downloader for newer file types
as the bed convertor doesn't work on them.

15 years agoUse the image directory name to detect what type of image dir we have,
Diane Trout [Tue, 31 Mar 2009 01:33:27 +0000 (01:33 +0000)]
Use the image directory name to detect what type of image dir we have,
firecrest or ipar.

Also remove the pesky return that was preventing --extract-results from
actually generating the compressed eland files.

And finally change some of the logging to logging.info instead of debug
as there wasn't currently a way to generate logging.debug messages.

15 years agoDon't attempt to save xml for a matrix in firecrest if it wasn't created.
Diane Trout [Tue, 31 Mar 2009 01:30:40 +0000 (01:30 +0000)]
Don't attempt to save xml for a matrix in firecrest if it wasn't created.
since we're now creating it in bustard for greater than pipeleine 1.1

15 years agoremove some useless debugging print statements
Diane Trout [Tue, 31 Mar 2009 01:29:12 +0000 (01:29 +0000)]
remove some useless debugging print statements

15 years agoprovide an error message if no run is found with the --use-run option
Diane Trout [Tue, 31 Mar 2009 01:28:14 +0000 (01:28 +0000)]
provide an error message if no run is found with the --use-run option

15 years agoAdd unit test to check that the crosstalk matrix was being saved and reloaded
Diane Trout [Mon, 30 Mar 2009 23:14:26 +0000 (23:14 +0000)]
Add unit test to check that the crosstalk matrix was being saved and reloaded
Aparently if your unit test fails to check something there's no guarantee
that it happens.
Also actually add an implementation for saving the matrix.

15 years agoIPAR detection is more reliable than firecrest so do it first, and then
Diane Trout [Mon, 30 Mar 2009 22:57:41 +0000 (22:57 +0000)]
IPAR detection is more reliable than firecrest so do it first, and then
check for firecrest second

15 years agoAllow specifying a run instead of just scanning the runfolder for it.
Diane Trout [Mon, 30 Mar 2009 18:53:49 +0000 (18:53 +0000)]
Allow specifying a run instead of just scanning the runfolder for it.
Also rework some of the command line options to group the commands
together, and to print helpful error messages when runfolder is being used
incorrectly.

15 years agoAdd some logging code to report what the program is doing
Diane Trout [Mon, 30 Mar 2009 18:52:26 +0000 (18:52 +0000)]
Add some logging code to report what the program is doing

15 years agoFor pipeline 1.1rc1 or 1.3.2, look for the matrix files in the bustard dir
Diane Trout [Sat, 28 Mar 2009 02:17:22 +0000 (02:17 +0000)]
For pipeline 1.1rc1 or 1.3.2, look for the matrix files in the bustard dir
also if the bustard config.xml file is present, check to see if the
matrix file was forced in there.

15 years agoMove sample data from simulate_runfolder into testdata
Diane Trout [Sat, 28 Mar 2009 00:59:59 +0000 (00:59 +0000)]
Move sample data from simulate_runfolder into testdata
Also I changed make_matrix to take a filename instead of a matrix directory
as in pipeline > 1.1rc1 they started writing the matrix into the bustard
directory instead of as a subdirectory of the firecrest directory.

15 years agoMake the visiable names match how the "group" and "contact" names were being used.
Diane Trout [Wed, 25 Mar 2009 19:12:43 +0000 (19:12 +0000)]
Make the visiable names match how the "group" and "contact" names were being used.
So "Name" is the person submitting the sample
ans "contact" is now called "lab name" and mostly contains the PI or Lab manager name.

15 years agoConsume output form the subprocesses
Diane Trout [Fri, 20 Mar 2009 00:21:27 +0000 (00:21 +0000)]
Consume output form the subprocesses

one of the times I was building the srf files the illumina2srf programs
stopped while the files were still incomplete, but with no CPU time.

It appears that the output from the child processes reached a point
where linux decided to block the process. This patch reads the output
from illumina2srf and if you run it with debug mode on it'll log it
otherwise it just ignores it.

15 years agoActually we want fastq files, not scarf files
Diane Trout [Wed, 18 Mar 2009 19:24:41 +0000 (19:24 +0000)]
Actually we want fastq files, not scarf files

15 years agoGenerate a gerald config file, even if the genome is unknown
Diane Trout [Wed, 18 Mar 2009 19:20:49 +0000 (19:20 +0000)]
Generate a gerald config file, even if the genome is unknown
as it makes it easier to edit the wrong config file and manually launch
an analysis.

15 years agoprovide more options for picking how much logging info to spew
Diane Trout [Wed, 18 Mar 2009 18:35:49 +0000 (18:35 +0000)]
provide more options for picking how much logging info to spew

15 years agoonly report cwd when starting the queue monitor, not on every start job loop
Diane Trout [Wed, 18 Mar 2009 18:35:15 +0000 (18:35 +0000)]
only report cwd when starting the queue monitor, not on every start job loop

15 years agoignore *.py[co] files
Diane Trout [Wed, 18 Mar 2009 18:34:09 +0000 (18:34 +0000)]
ignore *.py[co] files

15 years agoParse runfolders generated with IPAR 1.3 and pipeline 1.3.2
Diane Trout [Mon, 16 Mar 2009 22:49:11 +0000 (22:49 +0000)]
Parse runfolders generated with IPAR 1.3 and pipeline 1.3.2
I'm still parsing the Summary.htm file, though it appears they added
an xml file with the same information in it.

Also the s_matrix.txt file seems to have gone away.

This also adds a full Summary.htm into pipelines/test/testdata

15 years agoUse django's FilteredSelectMultiple javascript widget for picking affiliations
Diane Trout [Thu, 12 Mar 2009 23:56:57 +0000 (23:56 +0000)]
Use django's FilteredSelectMultiple javascript widget for picking affiliations

15 years agoallow searching by cluster station or sequencer name on the flowcell admin page
Diane Trout [Thu, 12 Mar 2009 00:54:07 +0000 (00:54 +0000)]
allow searching by cluster station or sequencer name on the flowcell admin page

15 years agoWe can't search linked tables. This is a problem that needs fixing.
Diane Trout [Tue, 10 Mar 2009 19:20:11 +0000 (19:20 +0000)]
We can't search linked tables. This is a problem that needs fixing.

15 years agoThe public library page needed to pass eland.genome_map to summarize_mapped_reads
Diane Trout [Tue, 10 Mar 2009 01:17:13 +0000 (01:17 +0000)]
The public library page needed to pass eland.genome_map to summarize_mapped_reads

15 years agoImprove support for eland searching a single fasta containing multiple records.
Diane Trout [Tue, 10 Mar 2009 01:11:22 +0000 (01:11 +0000)]
Improve support for eland searching a single fasta containing multiple records.
the problem was that I was assuming / was a path seperator between genome
directory name and chromosome, but eland was also reporting it as
fasta file name / fasta record.

By happy accident in genome map, the fasta file with multiple records would
be stored in the GenomeMap dictionary as having the same name, value pair
while things that had the genome encoded would be fasta filename mapping to
genome/fasta filename.

as a result it appears that splitting a mapped item on the path seperator /
and then looking the "base path" up in the genome map will allow me to
determine if an element is a genome directory "path" or a multi record
fasta file by its absence (for genome dirs) or presence (for multi
fasta records)

15 years agoUse gigio's fully name instead of nickname
Diane Trout [Tue, 10 Mar 2009 00:29:04 +0000 (00:29 +0000)]
Use gigio's fully name instead of nickname

15 years agoLet cluster_station and sequencer default to the first entry in linked table
Diane Trout [Tue, 10 Mar 2009 00:08:33 +0000 (00:08 +0000)]
Let cluster_station and sequencer default to the first entry in linked table

15 years agoreplace % with %% in the library names, so retrieve config's % expansion works.
Diane Trout [Mon, 9 Mar 2009 23:36:55 +0000 (23:36 +0000)]
replace % with %% in the library names, so retrieve config's % expansion works.

15 years agoChange flowcell admin index page to include a formatted cluster estimate
Diane Trout [Mon, 9 Mar 2009 22:05:54 +0000 (22:05 +0000)]
Change flowcell admin index page to include a formatted cluster estimate

15 years agoChange default picomolarity to 5, and allow overriding the default
Diane Trout [Mon, 9 Mar 2009 19:28:10 +0000 (19:28 +0000)]
Change default picomolarity to 5, and allow overriding the default
via the htsworkflow ini file.
closes [ticket:120] and [ticket:83]

15 years agoAdd commented htsworkflow.ini.example as documentation for new settings
Diane Trout [Mon, 9 Mar 2009 19:25:21 +0000 (19:25 +0000)]
Add commented htsworkflow.ini.example as documentation for new settings
like default_pm.

15 years agoSort by library id, not creation data closes [ticket:116]
Diane Trout [Mon, 9 Mar 2009 18:39:02 +0000 (18:39 +0000)]
Sort by library id, not creation data closes [ticket:116]
I left the sort by creation date in there commented out, in case that's how
hudson/alpha wants to do it.

15 years agoAdd flowcell notes field back to the admin page
Diane Trout [Mon, 9 Mar 2009 17:19:40 +0000 (17:19 +0000)]
Add flowcell notes field back to the admin page

15 years agoflowcell model moved from samples to experiments, eland_config needed to know
Diane Trout [Sat, 7 Mar 2009 06:47:18 +0000 (06:47 +0000)]
flowcell model moved from samples to experiments, eland_config needed to know
(and I forgot to tell it)

15 years agoThe default django css pages wanted a few gifs, which this patch adds
Diane Trout [Sat, 7 Mar 2009 01:47:38 +0000 (01:47 +0000)]
The default django css pages wanted a few gifs, which this patch adds

15 years agoGrab more of the admin css pages that were refering to each other
Diane Trout [Sat, 7 Mar 2009 01:43:04 +0000 (01:43 +0000)]
Grab more of the admin css pages that were refering to each other
and rename the location for the images they were looking for from
img/admin to img/ (since I expect we'll modify these templates some
from django's defaults)

15 years agoDisplay the pM on the public library detail page.
Diane Trout [Sat, 7 Mar 2009 00:58:48 +0000 (00:58 +0000)]
Display the pM on the public library detail page.
Also I was getting tired of our ugly pages so I ripped large chunks of the
django css and more of their templates to make our library index page look
nicer.

I started working on the public library detail page but that started
involving too many alterations to the code for friday.

15 years agoAllow filtering by the new hidden field on the library admin page
Diane Trout [Sat, 7 Mar 2009 00:55:28 +0000 (00:55 +0000)]
Allow filtering by the new hidden field on the library admin page

15 years agoLorian asked that the library size default to 225 when porting from our old database
Diane Trout [Sat, 7 Mar 2009 00:54:50 +0000 (00:54 +0000)]
Lorian asked that the library size default to 225 when porting from our old database

15 years agoMake the public library browsing page support several features from
Diane Trout [Fri, 6 Mar 2009 02:09:26 +0000 (02:09 +0000)]
Make the public library browsing page support several features from
django admin pages.

  * Search bar
  * Pagination
  * Filters

This took grabbing the "ChangeSet" class from django.contrib.admin and
slightly modifying, in addition to the css files.

To provide the css files I had to add the ability to serve static
files from the app level. I followed the django pattern that the
internal static pages would only be served by django when debug is true.

And because it's hard to split it out, I also added a new field
'hidden' to the library table. This allows hiding libraries from
the public library page (which is useful since a number of libraries
have a gel isolate pair, which many end users find rather confusing).

15 years agoAdd another username to manage for our made_for to affiliation script
Diane Trout [Fri, 6 Mar 2009 02:04:08 +0000 (02:04 +0000)]
Add another username to manage for our made_for to affiliation script

15 years agoAdd get_absolute_url to Library model and use it for linking from both the
Diane Trout [Wed, 4 Mar 2009 20:10:35 +0000 (20:10 +0000)]
Add get_absolute_url to Library model and use it for linking from both the
index and detail view to the public detail page.

15 years agoImplement experiment type as a seperate table.
Diane Trout [Wed, 4 Mar 2009 02:26:05 +0000 (02:26 +0000)]
Implement experiment type as a seperate table.
closes [ticket:107]
also initialize the experiment type table with the conversion script

15 years agoChange cluster_mac_id and seq_mac_id from text fields holding
Diane Trout [Wed, 4 Mar 2009 01:17:20 +0000 (01:17 +0000)]
Change cluster_mac_id and seq_mac_id from text fields holding
the contents of a drop down box to be seperate tables whose table names
are a bit closer to the official illumina names.
clust_mac_id -> cluster_station
seq_mac_id -> sequencer

15 years agoFor one of our more recent libraries, the successful_pM had more digits
Diane Trout [Wed, 4 Mar 2009 01:13:55 +0000 (01:13 +0000)]
For one of our more recent libraries, the successful_pM had more digits
that was allowed, so I bumped up the setting.
Also I changed the u in ul to \00b5l (so in most browsers it will
render as a micro)

15 years agoAdd a link to the public library summary page off from the libray admin list
Diane Trout [Sat, 28 Feb 2009 01:05:28 +0000 (01:05 +0000)]
Add a link to the public library summary page off from the libray admin list

15 years agoAdd my script to convert Caltech's made_for field to htsworkflow 0.2's affiliation...
Diane Trout [Fri, 27 Feb 2009 22:15:09 +0000 (22:15 +0000)]
Add my script to convert Caltech's made_for field to htsworkflow 0.2's affiliation field.
This could be used as an example for other sites.

15 years agoUpdate some of Rami's GERALD config file generator to Django 1.0 API
Diane Trout [Fri, 27 Feb 2009 22:04:32 +0000 (22:04 +0000)]
Update some of Rami's GERALD config file generator to Django 1.0 API
and get the config file to show up in a browser by setting the mime/type

15 years agoOnly configure logging if we're being run as a script
Diane Trout [Fri, 27 Feb 2009 01:01:02 +0000 (01:01 +0000)]
Only configure logging if we're being run as a script

15 years agobetter handle the case when Affilations.contact is None
Diane Trout [Thu, 26 Feb 2009 05:21:52 +0000 (05:21 +0000)]
better handle the case when Affilations.contact is None

15 years agoRemove a few more fields that we dont use, and change the Lanes display
Diane Trout [Thu, 26 Feb 2009 03:04:14 +0000 (03:04 +0000)]
Remove a few more fields that we dont use, and change the Lanes display
on the flowcell index page.

I switched from the <div><span> that Rami was using to a <ol><li>
hierarchy, I also made the items link to the corresponding library
page.

15 years agosimplify library admin index view
Diane Trout [Tue, 24 Feb 2009 18:42:17 +0000 (18:42 +0000)]
simplify library admin index view

15 years agoAdd script to convert current caltech v0.1.x database to the trunk schema
Diane Trout [Tue, 17 Feb 2009 23:45:59 +0000 (23:45 +0000)]
Add script to convert current caltech v0.1.x database to the trunk schema

15 years agoMerge flowcell 'paired_end' flag from v0.1 branch
Diane Trout [Tue, 17 Feb 2009 23:45:24 +0000 (23:45 +0000)]
Merge flowcell 'paired_end' flag from v0.1 branch
Then use this flag in the eland_config module to specify ANALYSIS eland_pair
Also change the default analysis to eland_extended

use the following to add the new field:
alter table fctracker_flowcell add column paired_end bool not null default false;

15 years agoremove cfg_defaults. It was part of an idea I was starting to implement
Diane Trout [Tue, 17 Feb 2009 22:39:59 +0000 (22:39 +0000)]
remove cfg_defaults. It was part of an idea I was starting to implement
but I didn't actually do it, so now its just causing problems

15 years agoReverse a mistake. DataRuns is from Rami and should use Rami's field names
Diane Trout [Fri, 13 Feb 2009 23:48:18 +0000 (23:48 +0000)]
Reverse a mistake. DataRuns is from Rami and should use Rami's field names

15 years agoenable sort of library view by descending library_id
Diane Trout [Fri, 13 Feb 2009 23:35:08 +0000 (23:35 +0000)]
enable sort of library view by descending library_id

15 years agosave longer comment describing change
Diane Trout [Fri, 13 Feb 2009 21:41:43 +0000 (21:41 +0000)]
save longer comment describing change

15 years agoAparently some more recent version of multi eland stopped
Diane Trout [Fri, 13 Feb 2009 01:59:42 +0000 (01:59 +0000)]
Aparently some more recent version of multi eland stopped
writing the - and just uses fewer fields. when there were too
many hits.

15 years agoMerge in the library list, detail, and results downloading feature from
Diane Trout [Fri, 13 Feb 2009 01:51:58 +0000 (01:51 +0000)]
Merge in the library list, detail, and results downloading feature from
the Caltech live site.

There's several components in the frontend tree to render the pages,
in addition this adds in some helper functions in pipelines.eland
to simplify computing summary statistics for an eland lane.

I also needed to merge in a generator based makebed code for
returning the files to the django database.

To use this, the settings file in this branch will need a variable
RESULT_HOME_DIR to be set.

15 years agouse the compression handling auto-opener for our eland files
Diane Trout [Fri, 13 Feb 2009 01:42:06 +0000 (01:42 +0000)]
use the compression handling auto-opener for our eland files

15 years agomake our API docstrings more epydoc friendly
Diane Trout [Thu, 12 Feb 2009 22:38:09 +0000 (22:38 +0000)]
make our API docstrings more epydoc friendly

15 years agoAdd load_pipeline_run_xml, a little function that feeds the xml file into
Diane Trout [Thu, 12 Feb 2009 22:37:08 +0000 (22:37 +0000)]
Add load_pipeline_run_xml, a little function that feeds the xml file into
ElementTree and grabs the useful root

15 years agoDrop 'using %s as cwd' down to just debug level.
Diane Trout [Thu, 5 Feb 2009 00:06:39 +0000 (00:06 +0000)]
Drop 'using %s as cwd' down to just debug level.
It was getting to annoying watching it scroll by constantly

15 years agoextended command line configuration parsing and add config file parsing
Diane Trout [Fri, 30 Jan 2009 20:47:35 +0000 (20:47 +0000)]
extended command line configuration parsing and add config file parsing
for finding the location of our database and sequence archive directories.

15 years agoTry to make runfolder results extraction more robust
Diane Trout [Fri, 30 Jan 2009 02:15:57 +0000 (02:15 +0000)]
Try to make runfolder results extraction more robust
If an IPAR or firecrest directory is missing some of the important
matrix files that implies there isn't actually a valid run present,
this patch will then (hopefully) issue a warning and skip that analysis
run.

I also added an option to scripts/runfolder to allow a user to specify
where the extracted results should go.

One questionable thing is that for one analysis some of the lanes
were run as sequence and not an eland analysis so were I expected
all the lanes to have an eland genome, it doesn't for these.
I hope that the code doesn't lose the index after serializing and
deserializing that chunk example.

15 years agoUpdate to not hard code the config file name and the error message
Diane Trout [Fri, 30 Jan 2009 01:51:50 +0000 (01:51 +0000)]
Update to not hard code the config file name and the error message
for when we don't find it

15 years agoinsert code to do ~ home directory expansion
Diane Trout [Sat, 24 Jan 2009 00:24:18 +0000 (00:24 +0000)]
insert code to do ~ home directory expansion

15 years agoAdd in Rami's report template, and adjust the paths to use "reports" instead
Diane Trout [Fri, 23 Jan 2009 02:23:21 +0000 (02:23 +0000)]
Add in Rami's report template, and adjust the paths to use "reports" instead
of "htsw_reports"

15 years agoAdd id as an AutoNumber(primary_key=True) field and remove the pk from
Diane Trout [Fri, 23 Jan 2009 02:21:09 +0000 (02:21 +0000)]
Add id as an AutoNumber(primary_key=True) field and remove the pk from
library_id.

Stanford decided to use library_id as a text field so they could use
library IDs like "SL100". Caltech just used the raw sql id, so the
foreign key reference in experiments_flowcells was expecting a numeric
id, but since the model had the text field as the primary key things
didn't work.

15 years agoMerge in Rami's changes from last friday.
Diane Trout [Wed, 21 Jan 2009 02:50:22 +0000 (02:50 +0000)]
Merge in Rami's changes from last friday.

Most of the admin pages work. Though there's a wsgi error with the reports.
I'll try to figure out tomorrow.

the biggest difference between caltech trunk and stanford schemas right now
is caltech is using made_for as a foreign key, and stanford is using it
as a text field.

15 years agoadd some testing code for the runner daemon
Diane Trout [Wed, 14 Jan 2009 01:18:42 +0000 (01:18 +0000)]
add some testing code for the runner daemon

15 years agoadd empty admin.py for eland_config app
Diane Trout [Wed, 14 Jan 2009 01:17:16 +0000 (01:17 +0000)]
add empty admin.py for eland_config app

15 years agoMerged much of the stanford htsworkflow frontend into trunk.
Diane Trout [Wed, 14 Jan 2009 01:12:47 +0000 (01:12 +0000)]
Merged much of the stanford htsworkflow frontend into trunk.
Updated to be compatable with Django 1.0

A big change for the 1.0 compatibility is the Admin class that was
attached to models was moved into a seperate file admin.py

I probably munged some of the fieldset formatting in the conversion process.

15 years agoThis is a partial merge of the stanford branch with the caltech branch of
Diane Trout [Thu, 8 Jan 2009 20:12:03 +0000 (20:12 +0000)]
This is a partial merge of the stanford branch with the caltech branch of
the web application, it doesn't work correctly yet, the libraries admin page
is broken, and lacks the ability to browe the 'made_for' column.

This is based on a merge that started a few month ago, but I hadn't finished
the merge, I'll need to check for more updates from their branch soon.

During the process I decided it would be a good idea to update to django 1.0
which is going to make things even more unstable, so I thought I should
check this work in progess in before continuing.

15 years agoLook in Temp directories for some of the files we have historically
Diane Trout [Tue, 6 Jan 2009 02:05:10 +0000 (02:05 +0000)]
Look in Temp directories for some of the files we have historically
used for our summary reports.

Version 1.1rc1 of the gapipeline started moving some of the files
into /Temp subdirectories of bustard and gerald.