X-Git-Url: http://woldlab.caltech.edu/gitweb/?p=samtools.git;a=blobdiff_plain;f=samtools.1;h=d2c78f1b32aeb00f4487f7944122ef9c7f5d596d;hp=45e16123948c1e72ba7193a68500e4430cc0d1c4;hb=4a17fa7e1f91b2fe04ad334a63fc2b0d5e859d8a;hpb=b27e00385f41769d03a8cca4dbd71275fc9fa906 diff --git a/samtools.1 b/samtools.1 index 45e1612..d2c78f1 100644 --- a/samtools.1 +++ b/samtools.1 @@ -1,4 +1,4 @@ -.TH samtools 1 "6 July 2009" "samtools-0.1.5" "Bioinformatics tools" +.TH samtools 1 "2 September 2009" "samtools-0.1.6" "Bioinformatics tools" .SH NAME .PP samtools - Utilities for the Sequence Alignment/Map (SAM) format @@ -33,12 +33,11 @@ output (stdout). Several commands can thus be combined with Unix pipes. Samtools always output warning and error messages to the standard error output (stderr). -Samtools is also able to open a BAM (not SAM) file on a remote FTP -server if the BAM file name starts with `ftp://'. Samtools checks the -current working directory for the index file and will download the index -upon absence. Samtools achieves random FTP file access with the `REST' -ftp command. It does not retrieve the entire alignment file unless it is -asked to do so. +Samtools is also able to open a BAM (not SAM) file on a remote FTP or +HTTP server if the BAM file name starts with `ftp://' or `http://'. +Samtools checks the current working directory for the index file and +will download the index upon absence. Samtools does not retrieve the +entire alignment file unless it is asked to do so. .SH COMMANDS AND OPTIONS @@ -73,17 +72,34 @@ Approximately the maximum required memory. [500000000] .TP .B merge -samtools merge [-n] [...] - -Merge multiple sorted alignments. The header of -.I +samtools merge [-h inh.sam] [-n] [...] + +Merge multiple sorted alignments. +The header reference lists of all the input BAM files, and the @SQ headers of +.IR inh.sam , +if any, must all refer to the same set of reference sequences. +The header reference list and (unless overridden by +.BR -h ) +`@' headers of +.I in1.bam will be copied to -.I +.IR out.bam , and the headers of other files will be ignored. .B OPTIONS: .RS .TP 8 +.B -h FILE +Use the lines of +.I FILE +as `@' headers to be copied to +.IR out.bam , +replacing any header lines that would otherwise be copied from +.IR in1.bam . +.RI ( FILE +is actually in SAM format, though any alignment records it may contain +are ignored.) +.TP .B -n The input alignments are sorted by read names rather than by chromosomal coordinates @@ -304,9 +320,7 @@ samtools tview [ref.fasta] Text alignment viewer (based on the ncurses library). In the viewer, press `?' for help and press `g' to check the alignment start from a -region in the format like `chr10:10,000,000'. Note that if the region -showed on the screen contains no mapped reads, a blank screen will be -seen. This is a known issue and will be improved later. +region in the format like `chr10:10,000,000'. .RE @@ -408,6 +422,15 @@ _ Unaligned words used in bam_import.c, bam_endian.h, bam.c and bam_aux.c. .IP o 2 CIGAR operation P is not properly handled at the moment. +.IP o 2 +In merging, the input files are required to have the same number of +reference sequences. The requirement can be relaxed. In addition, +merging does not reconstruct the header dictionaries +automatically. Endusers have to provide the correct header. Picard is +better at merging. +.IP o 2 +Samtools' rmdup does not work for single-end data and does not remove +duplicates across chromosomes. Picard is better. .SH AUTHOR .PP @@ -419,4 +442,4 @@ specification. .SH SEE ALSO .PP -Samtools website: http://samtools.sourceforge.net +Samtools website: