htsworkflow.git
10 years agoDynamically respond to the number of lanes.
Diane Trout [Thu, 17 Oct 2013 21:55:14 +0000 (14:55 -0700)]
Dynamically respond to the number of lanes.

Illumina now has single lane flowcells so the assumtion that there
will be 8 lanes is wrong.

11 years agoAdd more entries to genome version detection map.
diane [Wed, 16 Jan 2013 01:29:06 +0000 (17:29 -0800)]
Add more entries to genome version detection map.

Renamed hg19 to hg18.
Added saccer1, Anopheles_gambiae detection

11 years agoSimplify linking fastq files to their library id.
Diane Trout [Wed, 12 Dec 2012 23:39:00 +0000 (15:39 -0800)]
Simplify linking fastq files to their library id.

Unlike my previous effort which required the fastq generation
script to generate dc:source entries to match fastqs to libraries,
this version just parses the generated fastq filename.

This does mean that a manually generated file might not work.

I accomplished this by writing a class to generate the
fastq (for submission) filenames and to parse them, so at least
all that code is in one place.

Also after attaching the fastq metadata to the file node,
I discovered I the websites use of language tags on strings
made my query fail. So I changed the toTypedNode to take an optional
language tag. (Defaults to "en").

11 years agoAdd a FastqName class to create and parse standardized fastq names.
Diane Trout [Sat, 8 Dec 2012 01:40:39 +0000 (17:40 -0800)]
Add a FastqName class to create and parse standardized fastq names.

I had a pretty standard naming convention for the fastq file names,
instead of duplicating the code for creating & parsing them,
I thought I should try to localize the code.

So I just added htsworkflow.submission.fastqname

11 years agoRemove prints that were being called in test code.
Diane Trout [Thu, 29 Nov 2012 00:01:04 +0000 (16:01 -0800)]
Remove prints that were being called in test code.

It messes up my dots. Unfortunately the fastq validator is still
ugly. But at least its messages are a lot shorter.

There was one debugging function ls_tree in simulate_runfolder
that I renamed to print_ls_tree because I'll occasionally remember
to grep for print to find things I shouldn't commit.

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Wed, 28 Nov 2012 19:37:34 +0000 (11:37 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoChanged name= to verbose_name= for LibraryType.
Diane Trout [Wed, 28 Nov 2012 19:34:40 +0000 (11:34 -0800)]
Changed name= to verbose_name= for LibraryType.

Strangely Django <1.4 didn't notice the error when importing
the fixtures. It was trying to use the name "Adapter Type"
as the database name, instead of the actual column name "name"
for the LibraryType table.

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Wed, 28 Nov 2012 19:22:19 +0000 (11:22 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoMake extra sure Django's setup test environment is run.
Diane Trout [Wed, 28 Nov 2012 19:19:58 +0000 (11:19 -0800)]
Make extra sure Django's setup test environment is run.

When running under unit2 discover, the mail test was using
a real mail server as it didn't know to run Django's
setup_test_environment. This rather heavy handedly runs
the django setup/teardown functions for the TestEmailNotify module.

11 years agoConvert to unittest2
Diane Trout [Wed, 28 Nov 2012 00:37:55 +0000 (16:37 -0800)]
Convert to unittest2

Test cases inherit from either unittest2 or django.test.TestCase
I should be able to use skip tests in the future.

I learned inheriting from django.test.TestCase will properly set up
the database for django tests. (Well at least mostly, I'm having
some possible errors on 1.4)

11 years agoConvert to unittest2
Diane Trout [Wed, 28 Nov 2012 00:37:55 +0000 (16:37 -0800)]
Convert to unittest2

Test cases inherit from either unittest2 or django.test.TestCase
I should be able to use skip tests in the future.

I learned inheriting from django.test.TestCase will properly set up
the database for django tests. (Well at least mostly, I'm having
some possible errors on 1.4)

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Tue, 27 Nov 2012 22:28:08 +0000 (14:28 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoIgnoring the missing type RDF error for the page under testing's url.
Diane Trout [Tue, 27 Nov 2012 22:22:33 +0000 (14:22 -0800)]
Ignoring the missing type RDF error for the page under testing's url.

When testing the stylesheet gets attached to the pages url. All
the meaningful information about the sample or experiment still
gets added to the right <host>/<category>/<id> pages,
unfortunately that means the page url doesn't have a type which
caused _validate_types to toss an error.

I'd previously fixed it by testing for the error message and
filtering it out from the test code, but that didn't work on
ubuntu 10.04 as the error message changes slightly with the
older version of redland rdf.

This version changes the sparql query to ignore the case
where the predicate is a stylesheet and there's no type.

11 years agoAdd dependencies to the setup.py
Diane Trout [Tue, 27 Nov 2012 18:30:36 +0000 (10:30 -0800)]
Add dependencies to the setup.py

I do need to make benderjab public before other people
could install this. Or maybe figure out how to use the optional
dependency mode.

11 years agoTest presence of species & species name on library index page.
Diane Trout [Tue, 20 Nov 2012 22:37:49 +0000 (14:37 -0800)]
Test presence of species & species name on library index page.

11 years agoCorrectly implement merging Notification & Manager sets
Diane Trout [Tue, 20 Nov 2012 22:36:49 +0000 (14:36 -0800)]
Correctly implement merging Notification & Manager sets

Apparently I was off imagining functiosn that don't exist.

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Tue, 20 Nov 2012 22:31:05 +0000 (14:31 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoRenamed various django tests.py to test_module.py
Diane Trout [Tue, 20 Nov 2012 22:29:20 +0000 (14:29 -0800)]
Renamed various django tests.py to test_module.py

It appears by default py.test was looking for packages named test_,
since it wasn't finding the tests.py modules. This seemed like
a reasonable alternative convention.

11 years agoIgnore .tox directory
Diane Trout [Tue, 20 Nov 2012 20:51:39 +0000 (12:51 -0800)]
Ignore .tox directory

I was experimenting with the virtualenv testing tool tox.testrun.org
and I might as well ignore its directory

11 years agoFix RDF schema problems with lane_number and species.
Diane Trout [Tue, 20 Nov 2012 20:46:41 +0000 (12:46 -0800)]
Fix RDF schema problems with lane_number and species.

My RDF schema was using the term "species" both for the species
name and a species class -- which doesn't make sense. This
version of the schema introduces a species_name which can
attach either to the Library object or the Species object.

Its still a little inconsistent as I'm using it for both
"common name" and "scientific name". But hey its an improvement.

Also there's a tweak to the library_number type on the library detail
page setting the type to string instead of number, as I decided
it should be treated internally as an opaque identifier.
In theory someone might start naming lanes A,B,C,D or 1T, 1B
(for the top and bottom of a flowcell slide).

Finally I decided that the gel_cut should be of type integer,
yes decimal is the "more general type" but I'm using integer in
my sql schema so its only going to return integers.

11 years agoRun the library detail page through RDF validation.
Diane Trout [Tue, 20 Nov 2012 20:41:57 +0000 (12:41 -0800)]
Run the library detail page through RDF validation.

Also ignore the missing type error message for http://localhost/
as that resource really shouldn't have a type.

This improved test does catch a few new model inconsistencies
which I'll fix in my next patch.

11 years agoChange add_default_schema to use pkg_resources feature to find schemas.
Diane Trout [Tue, 20 Nov 2012 06:16:08 +0000 (22:16 -0800)]
Change add_default_schema to use pkg_resources feature to find schemas.

I was trying to get py.test to work and it really wants to install
things, and my previous method to find the schema files wasn't working
very well with the egg distribution.

11 years agoAdd dependency information to the setup.py script
Diane Trout [Tue, 20 Nov 2012 01:09:47 +0000 (17:09 -0800)]
Add dependency information to the setup.py script

Though its still missing a bit as I don't have benderjab
hosted and librdf needs to be installed seperately.

11 years agordf:Resource can be either a resource or a blank node.
Diane Trout [Tue, 20 Nov 2012 01:04:12 +0000 (17:04 -0800)]
rdf:Resource can be either a resource or a blank node.

Thus we should only toss an error in the case of a node being
a literal.

11 years agoUniquely merge BCC and Manager lists for sending notification email.
Diane Trout [Tue, 20 Nov 2012 00:50:49 +0000 (16:50 -0800)]
Uniquely merge BCC and Manager lists for sending notification email.

This uses a set to only send one email address one copy of a notification.

11 years agoFurther attempts to validate RDF models.
Diane Trout [Fri, 16 Nov 2012 00:01:04 +0000 (16:01 -0800)]
Further attempts to validate RDF models.

I had a bug caused by lane numbers being langauage tagged strings,
and thus not being found by my sparql query.

I found a solution to filter based on just the contents of a string
ignoring the language tag. However I thought not only should I
make it easier to run my RDF model validation code, I should also
double check the literal types.

Previously I just tagged any literal as rdfs:Literal. For ones
that should have a known type, I've changed it to the xmlschema
types.

This patch doesn't actually fix the bug. Just introduces the
diagnostic tool.

11 years agoAdd error message for typoing a result map filename.
Diane Trout [Wed, 14 Nov 2012 19:30:35 +0000 (11:30 -0800)]
Add error message for typoing a result map filename.

As a gift to me when trying to do something while sleepy.

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Wed, 14 Nov 2012 00:38:20 +0000 (16:38 -0800)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoCatch another lookup error.
Diane Trout [Thu, 11 Oct 2012 23:32:27 +0000 (16:32 -0700)]
Catch another lookup error.

Apparently one of gerald_summary[read][lane] was a dictionary
and so threw a KeyError instead of an IndexError. I might as well
catch both.

11 years agoolder librdf wanted to include type information when showing query strings.
Diane Trout [Thu, 27 Sep 2012 18:46:56 +0000 (11:46 -0700)]
older librdf wanted to include type information when showing query strings.
So I ran everything through fromTypedNode to convert the nodes
to simple python types.

However fromTypedNode wasn't intended to handle non-literal nodes
so I had to adjust it a bit to return resource nodes safely.

11 years agoWhen collecting files for a geo submission, group on library id
Diane Trout [Thu, 27 Sep 2012 17:37:56 +0000 (10:37 -0700)]
When collecting files for a geo submission, group on library id
instead of the previous grouping on lane.

11 years agoThis might actually generate soft file with raw & supplemental data.
Diane Trout [Tue, 25 Sep 2012 23:18:42 +0000 (16:18 -0700)]
This might actually generate soft file with raw & supplemental data.

To make working with the development server easier, I changed
the submission class to take a host which it will use to generate
the base library url.

When constructing URLs for files, I'm now using the actual path names
instead of synthesizing something based on the submission name.
This is to limit the amount of knowledge that needs to be passed
between the fastq generation code.

For fastq files it looks at the source file to find the flowcell
information. For supplemental files it looks at the submission
class for that analysis directory and grabs the library id
from there.

11 years agoMerge changing lane_number to string and sequence finding code changes
Diane Trout [Mon, 24 Sep 2012 23:43:33 +0000 (16:43 -0700)]
Merge changing lane_number to string and sequence finding code changes
I started using actual file paths instead synthetic submission
paths for naming where my sequence files are.

This one still one generate geo submissions correctly as I'm
pretty sure not all of the queries have been updated yet.

11 years agoAdd a log message to for debugging
Diane Trout [Mon, 24 Sep 2012 23:37:30 +0000 (16:37 -0700)]
Add a log message to for debugging

11 years agoDefine XHTML_RDF_DTD as None when we can't load the DTD
Diane Trout [Mon, 24 Sep 2012 23:34:44 +0000 (16:34 -0700)]
Define XHTML_RDF_DTD as None when we can't load the DTD

11 years agoMake the public html pages valid xhtml, and validate more RDFa cases.
Diane Trout [Mon, 24 Sep 2012 22:28:10 +0000 (15:28 -0700)]
Make the public html pages valid xhtml, and validate more RDFa cases.

Also after I spent time playing with the w3c online validator,
I decided it was best to try and add modest validation to my
unit tests.

So now there's a validate_xhtml function in ethelp.

The one really weird thing is I tried to load the DTD
in the test case, however it looks like librdf clobbered the
XML catalog resolver at some point so the DTD resolver can't
find anything.

11 years agoremove some dead commented out code.
Diane Trout [Mon, 24 Sep 2012 22:26:14 +0000 (15:26 -0700)]
remove some dead commented out code.

11 years agoFix (some) missing closing tags.
Diane Trout [Thu, 20 Sep 2012 22:17:53 +0000 (15:17 -0700)]
Fix (some) missing closing tags.

11 years agoMake a validation error message between different ages of librdf.
Diane Trout [Thu, 20 Sep 2012 21:35:51 +0000 (14:35 -0700)]
Make a validation error message between different ages of librdf.
result.uri vs result again.

11 years agoAlso make the library index page conform to htsworkflow ontology.
Diane Trout [Thu, 20 Sep 2012 21:26:46 +0000 (14:26 -0700)]
Also make the library index page conform to htsworkflow ontology.

11 years agoMinor tweaks to deal with the older version of librdf on ubuntu 10.04
Diane Trout [Thu, 20 Sep 2012 00:08:43 +0000 (17:08 -0700)]
Minor tweaks to deal with the older version of librdf on ubuntu 10.04
things like utf-8 escaping a string, using str(node.uri) instead
of str(node).

11 years agoUse htsworkflow ontology to validate various RDF using components.
Diane Trout [Wed, 19 Sep 2012 23:10:57 +0000 (16:10 -0700)]
Use htsworkflow ontology to validate various RDF using components.
Of course to use the ontology I had to make one first.
Unsurprisingly implementing it touched a bunch of code & templates.

I tried to be more consisten with using mixed-case names for
classes and lower_case names for properties.

There's some inconsistencies. like i use the term notes & comments
in different areas. Also, should I be using my own terms or
do better at reusing more standard ontologies?

11 years agoRefactor property type validator to support multiple classes for domain/range.
Diane Trout [Wed, 19 Sep 2012 23:04:35 +0000 (16:04 -0700)]
Refactor property type validator to support multiple classes for domain/range.
Also test to make sure we can have more than one domain/range statement.

11 years agoAdd stub xhtml vocab ontology, to make model validation quieter.
Diane Trout [Wed, 19 Sep 2012 23:03:07 +0000 (16:03 -0700)]
Add stub xhtml vocab ontology, to make model validation quieter.
(the stylesheets got attached as a property of the library or flowcell)

11 years agoMerge ssh://jumpgate.caltech.edu/var/htsworkflow/htsworkflow
Diane Trout [Tue, 18 Sep 2012 23:34:27 +0000 (16:34 -0700)]
Merge ssh://jumpgate.caltech.edu/var/htsworkflow/htsworkflow

11 years agoemail.bcc should be a list, not a nested list.
Diane Trout [Tue, 18 Sep 2012 23:33:01 +0000 (16:33 -0700)]
email.bcc should be a list, not a nested list.
NOTIFICATION_BCC was already a list of options.

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Tue, 18 Sep 2012 18:36:16 +0000 (11:36 -0700)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoImprovements to rdfinfer.
Diane Trout [Tue, 18 Sep 2012 18:34:25 +0000 (11:34 -0700)]
Improvements to rdfinfer.

Add rule to infer class and subClassOf memberships,
add testing for the class case.

Add code to run all the validation rules.

11 years agoChange rdfhelp.dump_model so you can specify a destination stream.
Diane Trout [Tue, 18 Sep 2012 18:31:49 +0000 (11:31 -0700)]
Change rdfhelp.dump_model so you can specify a destination stream.

11 years agoProgress using rdf model to link fastqs with flowcell/lib metadata.
Diane Trout [Tue, 18 Sep 2012 18:20:26 +0000 (11:20 -0700)]
Progress using rdf model to link fastqs with flowcell/lib metadata.

I changed how I was using rdf:type -- the most raw data is now
a 'sequencer_result' and now there's a seperate file_type
attribute to indicate what kind of result file it is.

I renamed find_missing_targets to update_fastq_targets as
in addition to finding what fastqs we need to generate it'll
also download missing flowcell information.

I'm still having trouble fishing out the fastq files so this isn't
ready yet.

Finally minor tweaks to the soft file formatting to try
and get it to render everything without spurious spaces.

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Tue, 18 Sep 2012 17:55:36 +0000 (10:55 -0700)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoStart implementing infering triples.
Diane Trout [Sat, 15 Sep 2012 05:49:29 +0000 (22:49 -0700)]
Start implementing infering triples.

This includes utilities to import the common schemas, and
a bit of functionality for validating models, in
addition to the rule to compute inverseOf.

11 years agoBe more defensive if the database is missing some data instead of
Diane Trout [Wed, 12 Sep 2012 18:35:37 +0000 (11:35 -0700)]
Be more defensive if the database is missing some data instead of
crasing on trying to access an empty list.

11 years agofix a wrong variable name
Diane Trout [Wed, 12 Sep 2012 18:34:54 +0000 (11:34 -0700)]
fix a wrong variable name

11 years agoTweaks to make the sample gather query faster.
Diane Trout [Wed, 12 Sep 2012 18:32:55 +0000 (11:32 -0700)]
Tweaks to make the sample gather query faster.
Instead of creating a new library term for submissions, I
changed it back to using the stanrard library ontology term.
(making it easier to link things together).

Also apparently you can have nested optional statements.

11 years agoAdd a helpful error message when we need the submission name but it wasn't provided
Diane Trout [Wed, 12 Sep 2012 18:31:19 +0000 (11:31 -0700)]
Add a helpful error message when we need the submission name but it wasn't provided

11 years agoAnother tweak to allow parser detection to work on content-type text/plain urls
Diane Trout [Wed, 12 Sep 2012 18:30:26 +0000 (11:30 -0700)]
Another tweak to allow parser detection to work on content-type text/plain urls

11 years agoBetter parser guessing when grabbing plain text from a webserver.
Diane Trout [Tue, 11 Sep 2012 23:01:46 +0000 (16:01 -0700)]
Better parser guessing when grabbing plain text from a webserver.
Also fix a typo in an error message

11 years agoUse the same model for building fastqs as for building soft file
Diane Trout [Mon, 10 Sep 2012 21:16:08 +0000 (14:16 -0700)]
Use the same model for building fastqs as for building soft file
(And let user specify a base filename on command line)

11 years agoAllow retrying retrieving rdf from website a few times.
Diane Trout [Mon, 10 Sep 2012 21:15:32 +0000 (14:15 -0700)]
Allow retrying retrieving rdf from website a few times.

11 years agoImport flowcell information into our model this allows
Diane Trout [Sat, 1 Sep 2012 02:59:52 +0000 (19:59 -0700)]
Import flowcell information into our model this allows
our super library id & sequence & flowcell info query to work

11 years agoHave librdf open the urls in load_into_model instead of using python
Diane Trout [Sat, 1 Sep 2012 02:56:55 +0000 (19:56 -0700)]
Have librdf open the urls in load_into_model instead of using python
As a side effect I ended up discovering that my parser type detection
code for file types was wrong, and now it has test code

I was trying to fix a bug where librdf 1.0.10.1-1.1ubuntu1 was
segfaulting when loading into the model but I couldn't figure
out what was doing it.

11 years agoAutomatically add the flowcell subdirectory to the sequence archive path
Diane Trout [Sat, 1 Sep 2012 02:53:03 +0000 (19:53 -0700)]
Automatically add the flowcell subdirectory to the sequence archive path

11 years agoOlder rdf library doesn't like unicode strings as nodes
Diane Trout [Fri, 31 Aug 2012 21:39:45 +0000 (14:39 -0700)]
Older rdf library doesn't like unicode strings as nodes
so manually encode as utf8

11 years agoFix matching scanned sequence files to library IDs work for hiseq runs.
Diane Trout [Fri, 31 Aug 2012 19:29:59 +0000 (12:29 -0700)]
Fix matching scanned sequence files to library IDs work for hiseq runs.
The previous version was keying off of flowcell/lane so if you
had multiple libraries from the same flowcell/lane all the sequences
would end up in one of the libraries.

Hopefully this fixes that. Though to do this I ended up changing
the whole structure of condorfastq to be based on updating an RDF model.
This depends on the sequence.py module changes of saving things to
rdf models -- and the new code to infer library ids at that layer.

11 years agoInherit from MutableMapping for ResultMap.
Diane Trout [Fri, 31 Aug 2012 19:19:42 +0000 (12:19 -0700)]
Inherit from MutableMapping for ResultMap.
This gets me automatic methods.
I did it because my for x in resultmap wasn't working
correctly. It was using a list key (0) instead of a map key
(library id) and tossing a key error.

Also instead of using ResultMap.add_result(key, destination) I can do
ResultMap[key] = destination

11 years agoAllow option to save/restore a sequence class to a RDF model.
Diane Trout [Fri, 31 Aug 2012 19:08:08 +0000 (12:08 -0700)]
Allow option to save/restore a sequence class to a RDF model.
(After doing this I started having dreams of some set of mixins
designed to persist data into different types of storage).

I also renamed the sql save to indicate that its going to a SQL
database.

Also I renamed one of my simplify Uris to stripNamespace
to make it clearer what it was actually doing.

simplify_uri just returns the end of a uri -- regardless of type.
stripNamespace removes a specific namespave from a uri.

11 years agoTest RDFa encoded information for flowcell & libraries.
Diane Trout [Tue, 28 Aug 2012 23:03:45 +0000 (16:03 -0700)]
Test RDFa encoded information for flowcell & libraries.
This shouls make sure that I have the right class type on flowcells
so I can reliably fish it out of a triple store.

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Tue, 28 Aug 2012 22:57:14 +0000 (15:57 -0700)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agosanitize_literal had trouble with empty strings.
Diane Trout [Tue, 28 Aug 2012 22:56:22 +0000 (15:56 -0700)]
sanitize_literal had trouble with empty strings.
This fixes that and tests that edge case.

11 years agoAttempt to come up with regexp to detect hiseq formatted files.
Diane Trout [Tue, 28 Aug 2012 22:48:50 +0000 (15:48 -0700)]
Attempt to come up with regexp to detect hiseq formatted files.

11 years agoFix some mismatches between variable names in python code and in templates
Diane Trout [Thu, 9 Aug 2012 19:53:32 +0000 (12:53 -0700)]
Fix some mismatches between variable names in python code and in templates

11 years agoCreate a lane to file name turtle data file
Diane Trout [Tue, 7 Aug 2012 02:13:49 +0000 (19:13 -0700)]
Create a lane to file name turtle data file
this required passing the lane ID information back through
the json api.

11 years agoSimplify code to see if we already have a DataRun created.
Diane Trout [Tue, 7 Aug 2012 02:01:51 +0000 (19:01 -0700)]
Simplify code to see if we already have a DataRun created.
Make sure we update the data runs when getting flowcell lane details.
Test to make sure we can't accidentally add more than one data run.
In the test code also change to assert from failUnless.

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Mon, 6 Aug 2012 21:26:26 +0000 (14:26 -0700)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoDon't link to lane for the raw result files section.
Diane Trout [Mon, 6 Aug 2012 21:25:11 +0000 (14:25 -0700)]
Don't link to lane for the raw result files section.
The raw result file code for finding the lane ID was returning
a wrong value (first lane) which was causing trouble on
multi sample hiseq runs.

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Mon, 6 Aug 2012 21:17:01 +0000 (14:17 -0700)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoUse default printer settings again.
Diane Trout [Mon, 6 Aug 2012 21:16:34 +0000 (14:16 -0700)]
Use default printer settings again.

(Removing it broke things)

11 years agoTest proper setting of sequence project variable
Diane Trout [Mon, 6 Aug 2012 18:25:24 +0000 (11:25 -0700)]
Test proper setting of sequence project variable

11 years agoTest more of the sequences class.
Diane Trout [Thu, 2 Aug 2012 21:52:10 +0000 (14:52 -0700)]
Test more of the sequences class.

(And fix the bugs discovered with better test coverage)

11 years agoFix a few more formatting issues with the geo template
Diane Trout [Thu, 2 Aug 2012 21:51:41 +0000 (14:51 -0700)]
Fix a few more formatting issues with the geo template

11 years agoMerge branch 'master' of mus.cacr.caltech.edu:htsworkflow
Diane Trout [Wed, 1 Aug 2012 23:19:50 +0000 (16:19 -0700)]
Merge branch 'master' of mus.cacr.caltech.edu:htsworkflow

11 years agoGEO wanted both files for a paired lane to be on a single line.
Diane Trout [Wed, 1 Aug 2012 23:18:13 +0000 (16:18 -0700)]
GEO wanted both files for a paired lane to be on a single line.
This required a bit of formatting code to massage the
results of the sparql query into an acceptable form.

11 years agoFix issues introduced when switching to the django templating system for fastq genera...
Diane Trout [Fri, 27 Jul 2012 22:35:00 +0000 (15:35 -0700)]
Fix issues introduced when switching to the django templating system for fastq generation.
It needs to know where django is, and I didn't include a short form for
identifying flowcells.

11 years agoActually we want to capture the unanalyzed lanes if we can...
Diane Trout [Wed, 25 Jul 2012 18:53:11 +0000 (11:53 -0700)]
Actually we want to capture the unanalyzed lanes if we can...

11 years agoGet actual list of sequencers used for a library.
Diane Trout [Tue, 24 Jul 2012 19:39:34 +0000 (12:39 -0700)]
Get actual list of sequencers used for a library.
I do have pipeline version numbers available, but with the current
version of the query it would end up duplicating the sequencer
model number. (there's a lot more version combinations then
there are sequencers)

11 years agoUpdate to our current IPs. I'm tired of syncdb overwriting them.
Diane Trout [Mon, 23 Jul 2012 18:34:36 +0000 (11:34 -0700)]
Update to our current IPs. I'm tired of syncdb overwriting them.

11 years agoAdapt building qseq or srf archives to use SampleKey identified lanes
Diane Trout [Thu, 19 Jul 2012 18:23:40 +0000 (11:23 -0700)]
Adapt building qseq or srf archives to use SampleKey identified lanes

11 years agoUpdate save_raw_data to SampleKey api
Diane Trout [Thu, 19 Jul 2012 18:14:29 +0000 (11:14 -0700)]
Update save_raw_data to SampleKey api

11 years agoAdd in a few more genomes to detection code
Diane Trout [Thu, 19 Jul 2012 00:01:03 +0000 (17:01 -0700)]
Add in a few more genomes to detection code

11 years agoFix library viewing code to read HiSeq run xmls.
Diane Trout [Tue, 17 Jul 2012 22:56:41 +0000 (15:56 -0700)]
Fix library viewing code to read HiSeq run xmls.

Also still need to fix the samples.view code

11 years agoRemove default sequence-format so the new auto-detector code is actually called
Diane Trout [Mon, 16 Jul 2012 22:09:00 +0000 (15:09 -0700)]
Remove default sequence-format so the new auto-detector code is actually called

11 years agoAttempt to guess raw sequence type for a run.
Diane Trout [Sat, 14 Jul 2012 00:31:41 +0000 (17:31 -0700)]
Attempt to guess raw sequence type for a run.
Also will check for the Unaligned directory first
before looking for the BaseCalls directory, as there's still
a BaseCalls dir.
(It does this by checking for the aligned directory first, and
if it exists using the name in the aligned config file to find
the corresponding base call directory)

11 years agoFix a refactoring error
Diane Trout [Fri, 13 Jul 2012 23:31:48 +0000 (16:31 -0700)]
Fix a refactoring error

11 years agoLook for RTAConfig.xml file generated by pipeline 1.12 era.
Diane Trout [Thu, 12 Jul 2012 18:16:17 +0000 (11:16 -0700)]
Look for RTAConfig.xml file generated by pipeline 1.12 era.

11 years agoUse lane 1s matrix when pipeline was set to build matrix from all.
Diane Trout [Thu, 12 Jul 2012 17:59:19 +0000 (10:59 -0700)]
Use lane 1s matrix when pipeline was set to build matrix from all.
(HiSeq's pipeline doesn't make the lane-less matrix file)

11 years agoClean up flowcell ID detection and add support for reading HiSeq IDs
Diane Trout [Thu, 12 Jul 2012 17:58:15 +0000 (10:58 -0700)]
Clean up flowcell ID detection and add support for reading HiSeq IDs
Update the quick and dirty report generator for SampleKey API

11 years agofix a variable name typo
Diane Trout [Thu, 12 Jul 2012 17:54:37 +0000 (10:54 -0700)]
fix a variable name typo

11 years agoRe-enable some suppressed tests and update them for the SampleKey API.
Diane Trout [Thu, 12 Jul 2012 17:52:53 +0000 (10:52 -0700)]
Re-enable some suppressed tests and update them for the SampleKey API.
Also try to test getting the HiSeq flowcell ID out of RunInfo.xml

11 years agowhitespace fix.
Diane Trout [Thu, 12 Jul 2012 17:49:44 +0000 (10:49 -0700)]
whitespace fix.