Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Here is an honest write-up about the Ts/Tv metricreferencing this 2002 paper by Ebersberger.  Bottom line: between about 2.1 and 2.8 is OK for Ts/Tv (also called Ti/Tv). You can also get some quick stats with some linux one-liners on this page; there are more thorough analysis programs built to work with vcf's.

 

Code Block
titleADVANCED linux 1 liners to look at summary stats (indels and het vs. hom. alleles)
collapsetrue
grep -c INDEL trios_tutorial.raw.vcf  # count the number of INDELs in the data
# expected result: 3432
tacc:$SCRATCH/BDIB_Human_tutorial/single-sample-variant-speedup$ cat trios_tutorial.raw.vcf | awk 'BEGIN {FS=";"} {for (i=1;i<=NF;i++) {if (index($i,"AF1")!=0) {print $i} }}' | \
awk 'BEGIN {FS="="} {print int($2*10)/10}' | sort | uniq -c | sort -n -r | head
  36757 1
  19404 0.5
   1819 0.4
     36 0.6
     17 0.8
     16 0.7
      1 0