Atlassian uses cookies to improve your browsing experience, perform analytics and research, and conduct advertising. Accept all cookies to indicate that you agree to our use of cookies on your device. Atlassian cookies and tracking notice, (opens new window)
/
Practical advice - short read re-sequencing data

    Practical advice - short read re-sequencing data

    May 20, 2012
    • Inconsistent alignment at indels
      1. Example 1: ybaL mutation at 475,292 in REL8593A sample.
      2. Example 2:
    • Misalignment across structural variants
      1. Example 1: gltB mutation at 3,289,962 in REL8593A sample?
      2. Example 2: rbs mutation at 3,289,962 in REL8593A sample?
    • Mismapping of reads not present in reference genome
    • Dark matter: repetitive genomic regions
    • Reference-related issues:
      • Chromosome names MUST MATCH EXACTLY in all input data files - reference genome, genes, SNP databases, etc. Don't assume these all follow one convention. It's common to find chromosomes simply numbered 1, 2, 3, etc. It's also common to find them named "chr1", "chr2", "chr3"...
      • Some files, such as BED files, call the first base of a chromosome "base 0". Others, like SAM/BAM files, call the first base "base 1". The UCSC genome browser maintains a nice list of these details.
      • For the human genome, this web site has a nice cheat sheet in case you get in trouble.
    , multiple selections available,

    Confluence Documentation | Web Privacy Policy | Web Accessibility

    University Wiki Service

    Bioinformatics Team (BioITeam) at the University of Texas
    • File lists
      File lists
       This trigger is hidden
    • How-to articles
      How-to articles
       This trigger is hidden
    Results will update as you type.
    • Bioinformatics Services
    • CBRS Mini Symposium
    • BCG Full Service Pipelines
    • zArchive
      • TACC stuff
      • Old Home page
      • Old BCG Full Service Pipelines
      • Training
        • Bioinformatics Courses and Content
        • SSC Intro to NGS Bioinformatics Course
          • Lonestar Profile
          • Diagram of Lonestar's directories
          • Diagram of running a job on Lonestar
          • Evaluating your raw sequencing data
          • Mapping tutorial
          • Integrative Genomics Viewer (IGV) tutorial
          • Workflow diagram of variant calling
          • Getting started with Unix and Perl
          • Variant calling tutorial
          • Visualize mapped data at UCSC genome browser
          • Annotating Variants
          • Installing Linux tools
          • Shell Script
          • Mapped read data evaluation (SAMtools)
          • Calling variants in diploid or multiploid genomes
          • Variant calling with GATK
          • Genome variation in mixed samples (FreeBayes, deepSNV)
          • Identifying structural variants (SVDetect)
          • Practical advice - short read re-sequencing data
          • SRA toolkit
          • Differential gene expression analysis
          • Differential expression with splice variant analysis aug2012
          • Identifying mutations in microbial genomes (breseq)
          • non-coding RNA analysis
          • Genome Assembly
          • Genome Assembly (velvet)
          • Genome Annotation (Glimmer3)
          • Evaluating & Visualizing assemblies
          • Custom Genome Databases
          • Transcriptome assembly & annotation
          • Scott's list of linux one-liners
          • Exercises
          • Introduction to genome variation
          • instructor action item list
          • General introduction
          • Recap and "for further study"
          • Handling Sequences Overview
          • As you're getting settled
          • Editing files
          • Installing Linux virtual machine on Windows
          • Installing Virtual machine & Linux on Windows
          • Key take home points
          • Linux final
          • Linux start
          • Samtools tricks
          • Start tophat by submitting to lonestar
          • Tutorial - Start diploid mapping for Day 2
          • Using SFTP for file browsing on Linux.
        • BME 383J Course Content
        • Obsolete NGS course materials
        • Relevant classes offered at UT System Schools
        • Short Training Topics
      • Software
      • How to join the BioITeam
      • File lists
      • Bioinformatic Jobs at UT
      • Wish list
      • How-to articles
      • Appsoma-based pipeline development
      Calendars

    You‘re viewing this with anonymous access, so some content might be blocked.
    {"serverDuration": 12, "requestCorrelationId": "242c315db6bc4e1cbe7741bc1bf06bc8"}