A healthy taste of resources available, specifically for this course - not a comprehensive catalog.

Community Resources
Sequencing Technologies
Fastq analysis/manipulation
Alignment and aligners
- Comparison of different aligners
- Aligners
- File formats
- fastq format
- The SAM (Sequence Alignment Map) format specification (pdf)
Alignment analysis
- SAM (Sequence Alignment Map) format specification (pdf)
- sam/bam tools
- samtools - sam/bam conversion, flag filtering, bam sort/index
- Picard - sam/bam utilities that are read-group aware
- Translate SAM file flags - type in a decimal number to see which flags are set
- SAMstat - produces detailed graphical statistics for sam/bam files.
- BEDTools - region overlap, merge, coverage & much more, w/bed, bam, vcf, gff support
Aligners and SAM/BAM manipulation
UCSC Genome Browser
File formats and conversion
- SAM format specification – http://samtools.github.io/hts-specs/SAMv1.pdf
- crucial for performing format conversions, of which ChIP-seq analysis can have many
- Genome browser file formats – http://genome.ucsc.edu/FAQ/FAQformat.html
- BED, bedGraph, narrowPeak and many more
- SRA (Sequence Read Archive) from NCBI
- UCSC file format conversion scripts - useful for getting to/from wig and bed to corresponding binary formats.
- Make sure you download the correct script for your operating system!
- A directory containing these tools can be found on stampede at /work/01063/abattenh/local/UCSC_utilities
- Mason program for simulating second-generation sequencing reads
Transcriptome analysis
- The Tuxedo pipeline: RNAseq with tophat/cufflinks
Variant calling
- The 1000 Genomes project - catalog of human genetic variants
- Tools
- Broad institute GATK - complex but powerful; used by 1000 Genomes
- File formats