Transcript Quantification with Kallisto

Transcript Quantification with Kallisto

Skip the mapping step with Kallisto  

*Thanks to Anna Battenhouse to the text and figures!

Kallisto is a tool from the Pachter lab that performs quanitfication of transcripts without requiring alignment. 

  • No explicit alignment to reference genome or transciptome 

  • Instead, uses “pseudoalignment” to transcriptome 

    • For each read, determine not where in each transcript it aligns, but rather which transcripts it is compatible with 

    • Simultaneously addresses 2 aspects of “multi-mapping” reads in traditional RNAseq pipelines 

      • Multiple possible genomic loci (addressed during alignment) 

      • Multiple possible transcripts of origin (addressed during quantification) 

  • Pseudoalignments are sufficient to quantify transcript abundances 

    • Expectation Maximization (EM) algorithm is applied to a “simple” RNAseq Likelihood function 

    • Report estimated abundances as Transcripts per Million (TPM) + counts 

No P-value reported or differential expression (DE) support, but… 

  • kallisto re-runs EM on multiple bootstrap re-samples to estimate variance 

  • then kallisto bootstraps are used by add-on sleuth DE package

 

Why Kallisto?

Speed and performance are greatly improved with Kallisto.