Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

A healthy taste of resources available, specifically for this course - not a comprehensive catalog.

Table of Contents

Community Resources

...

Linux/TACC

Community Resources

 

Sequencing Technologies

 

Sequencing Technologies

...

      • – long (multi-Kb) reads often used in assemblies

FASTQ analysis/manipulation/QC

Alignment and aligners

Reference genomes

Basic alignment and aligners

  • File formats
    • input: FASTQ format
    • output: the SAM (Sequence Alignment Map) format specification (pdf)

Alignment analysis

  • SAM (Sequence Alignment Map) format specification (pdf)
  • sam/bam tools
    • samtools - sam/bam conversion, flag filtering, bam sort/index
    • Picard - sam/bam utilities that are read-group aware
    • Translate SAM file flags - type in a decimal number to see which flags are set
  • SAMstat - produces detailed graphical statistics for sam/bam files.
  • BEDTools - region overlap, merge, coverage & much more, w/bed, bam, vcf, gff support

...

  • Aligners
  • samtools http://samtools.sourceforge.net/
    • also by Heng Li
  • Picard toolkit – http://broadinstitute.github.io/picard/
  • bedtools http://bedtools.readthedocs.org/en/latest/
    • Swiss army knife for all manner of common BED/BAM file manipulation such as:
      • region filtering
      • generation of per-base genome-wide signal (bedGraph)
      • intersecting peaks with annotation files
      • extracting fasta corresponding to regions
  • Anna
      • fast, sensitive and extremely configurable
  • Comparison of different aligners
    • by Heng Li, developer of bwa, samtools, and many other bioinformatics tools
  • The BioITeam has some TACC-aware alignment scripts you might find useful:
    • bwa alignment
      • /work/01063projects/abattenhBioITeam/codecommon/script/align/align_bwa_illumina.sh
    • bowtie2 alignment
      • /work/01063projects/abattenhBioITeam/codecommon/script/align/align_bowtie2_illumina.sh
    • merging sorted BAM files (read-group aware)
      • /work/01063projects/abattenhBioITeam/codecommon/script/align/merge_sorted_bams.sh
    • kallisto pseudo-alignment to annotated transcripts
      • /work/projects/BioITeam/common/script/run_kallisto.sh
    • also available on many BRCF pods under /mnt/bioi/script.
      • many pre-built references also available in /mnt/bioi/ref_genome
    • email or come talk to me Anna if you have questions or problems

UCSC Genome Browser

Transcriptome-aware aligners

Alignment analysis

File formats and conversion

...

    • Also available as a BioContainers module

UCSC Genome Browser

RNAseq/Transcriptome analysis

Variant calling

Variant calling

...

Genome Annotation