A healthy taste of resources available, specifically for this course - not a comprehensive catalog.

Sequencing Technologies
Community Resources
Fastq analysis/manipulation
Alignment and aligners
- Comparison of different aligners
- Aligners
- File formats
- fastq format
- The SAM (Sequence Alignment Map) format specification (pdf)
Alignment analysis
- SAM (Sequence Alignment Map) format specification (pdf)
- sam/bam tools
- samtools - sam/bam conversion, flag filtering, bam sort/index
- Picard - sam/bam utilities that are read-group aware
- Translate SAM file flags - type in a decimal number to see which flags are set
- SAMstat - produces detailed graphical statistics for sam/bam files.
- BEDTools - region overlap, merge, coverage & much more, w/bed, bam, vcf, gff support
UCSC Genome Browser
Format converters and generation tools
- SRA (Sequence Read Archive) from NCBI
- UCSC file format conversion scripts - useful for getting to/from wig and bed to corresponding binary formats.
- Make sure you download the correct script for your operating system!
- A directory containing these tools can be found on stampede at /work/01063/abattenh/local/UCSC_utilities
- Mason program for simulating second-generation sequencing reads.
Transcriptome analysis
- The Tuxedo pipeline: RNAseq with tophat/cufflinks
Variant calling
- The 1000 Genomes project - catalog of human genetic variants
- Tools
- Broad institute GATK - complex but powerful; used by 1000 Genomes
- File formats