Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

A healthy taste of resources available, specifically for this course - not a comprehensive catalog.

Table of Contents

Sequencing Technologies

Community Resources

Getting started with Linux and NGS

Fastq analysis/manipulation

Alignment and aligners

  • Jeff Barrick's introduction to NGS presentation
  • Comparison of different aligners
  • Aligners
  • File formats
    • fastq format
    • The SAM (Sequence Alignment Map) format specification (pdf)

Alignment analysis

  • SAM (Sequence Alignment Map) format specification (pdf)
  • sam/bam tools
    • samtools - sam/bam conversion, flag filtering, bam sort/index
    • Picard - sam/bam utilities that are read-group aware
    • Translate SAM file flags - type in a decimal number to see which flags are set
  • SAMstat - produces detailed graphical statistics for sam/bam files.
  • BEDTools - region overlap, merge, coverage & much more, w/bed, bam, vcf, gff support

UCSC Genome Browser

Variant calling

  • The 1000 Genomes project - catalog of human genetic variants
  • Tools
  • Broad institute GATK - complex but powerful; used by 1000 Genomes
  • File formatsVCF (Variant Call

Format

...

Transcriptome analysis

...

converters and generation tools

Other courses with online tutorials

  • 2012 Next-Gen Sequence Analysis Workshop (Michigan State University) has similar tutorials to our course, but also includes introductions to using the Amazon EC2 where you can "rent" Linux machines (useful if you don't have access to TACC), Python, R, ChIP-Seq, etc.
    • maintained mRNA-seq protocols from this workshop here
    • A directory containing these tools can be found on stampede at /work/01063/abattenh/local/UCSC_utilities
  • Mason program for simulating second-generation sequencing reads.

Transcriptome analysis

Variant calling