Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Let's parse the bam file to pull out just spliced alignments.

Looking at the CIGAR string at the CIGAR string
Expand
AnswerAnswer
Code Block
titleLooking
for spliced alignments
 samtools view accepted_hits.bam | cut -f 1,6 | grep 'N'|head
The

The CIGAR string "66M76N9M" represents a spliced sequence. The codes mean:

  • 66M - the first 66 bases match the reference
  • 76N - there are then 76 bases on the reference with no corresponding bases in the sequence (an intron)
  • 9M - the last 17 bases match the reference

Exercise 2b: What is the read ID for one of these reads with spliced alignment? 

...