Atlassian uses cookies to improve your browsing experience, perform analytics and research, and conduct advertising. Accept all cookies to indicate that you agree to our use of cookies on your device. Atlassian cookies and tracking notice, (opens new window)
/
Genome Variant Analysis Course 2022

    Genome Variant Analysis Course 2022

    Jun 24, 2022

    Course Overview

    The past 2 years this course was presented online due to COVID19, and due in part to positive responses received by participants, this year's course will continue with that model. You should have received zoom contact information in your email which will be used for each day of the course. 

    The course is designed to have 2 ~90 minute sections per day for 5 days, with the goal of teaching you how to preform the standard next-generation sequencing analysis to identify genomic variants. This will be accomplished through: presentations covering information essential to all types of analysis, guided tutorials to reinforce the essential concepts, and optional self guided tutorials to help you learn the skills that are most specific to your own analysis. By the end of this course, we hope to achieve the following goals:

    1. Teach you different ways next generation sequencing libraries are constructed, and the advantages/disadvantages associated with the different types. 
    2. Familiarize you with how the Texas Advanced Computing Center (TACC) can be used to simplify and speed up your data analysis.
    3. Introduce you to common ways of installing programs useful for NGS analysis.
    4. Teach you the basics of read mapping in both individuals and populations, and identifying variants within individuals and rare variants within populations.
    5. Provide reference materials covering a breadth of material sufficient to give you a starting point of where to begin you own data analysis, and enough experience that you can begin that analysis on your own.

    Your Instructor

    Name

    Initials

    Affiliation

    Expertise

    Daniel Deatherage

    DD

    Barrick Lab

    Unix, Python, NGS Library Prep, Capture, Rare Variant Identification

    A nod to the past

    I think it important to acknowledge a great deal of help with creating these web pages and materials from previous instructors of the Intro to NGS Bioinformatics course taught in 2013 and the Genome Variant Analysis Course taught in 2014-2016. Two individuals warrant special mention, the former director of the GSAF Scott Hunicke-Smith, and Jeffrey Barrick were the driving force behind this class for a number of years, and many of the tutorials presented here were originally developed by them or adapted from their work.


    Verifying setup/access week of June 13th:

    In order to ensure as smooth an experience as possible, the week prior to the course, each participant needs to:

    1. Log into TACC.
    2. Provide their TACC ID to Dan.

    Depending on your operating system you should complete either the window or mac tutorial below. If you are having difficulties be sure to email Dan so a zoom session can be scheduled, and avoid having to use class time on administrative things that may not be resolved without additional help from people at TACC who will not be present in the class.

    Windows10

    MacOS

    Course Schedule

    Monday, June 20th. Day 1 – "The Basics"

    Presentation: General Course Introduction

    Tutorial: Introduction to Linux and Stampede2

    Presentation: Experimental Design

    Tutorial: Evaluating Raw Sequencing Data

    Tuesday, June 21st. Day 2 – "Principles of Variant calling"

    Presentation: Read Mapping

    Day 1 catchup

    idev session reminder

    Tutorial: Mapping Reads with bowtie2

    Presentation: Single Nucleotide Variant Calling

    Presentation: Structural Variant Calling

    Tutorial: Using samtools to identify SNVs

    Tutorial: Using SVDetect to identify SV

    Bonus Presentation: Read Mapping Details and File Formats

    Wednesday, June 22nd. Day 3 – Visualization and User specific tutorials

    Presentation: Errors: Where do they come from and how do we identify them as noise rather than signal?

    Bonus Presentation: Alternative Library Prep Methods - for when errors really do matter.

    Tutorial: Visualization: Integrated Genome Viewer Tutorial

    Tutorial: Visualization: Bacterial genome variants the easiest way – breseq


    At this point in the course, you have the basic tools that will help you regardless of what type of research you are involved in. The remainder of the course is full of topics that are more specific to different research areas. They are divided into broad categories to help you decide which ones you want to complete during the remaining time. If you are unsure just ask and I'll help identify ones which may be more applicable to your work.

    Bacterial  Centric Tutorials

    Tutorial: Advanced Breseq

    Tutorial: breseq with multiple references

    Tutorial: Evaluating Error Correction Using Breseq


    Human and Higher Eukaryote Centric Tutorials

    Tutorial: Human Trios Analysis

    Tutorial: Comparing Multiple samples

    Tutorial: GATK

    Tutorial: Exome Capture Metrics – with GATK

    Tutorial: Annotating with Annovar

    Method based Tutorials that may be of help regardless of sample type

    Tutorial: MultiQC - fastQC summary tool for multiple samples

    Tutorial: Read processing with fastp

    Tutorial: Genome Assembly

    Tutorial: Novel DNA identification

    Tutorial: Advanced mapping

    Tutorial: Error Correction (Molecular Indexing)

    Tutorial: Annotating plasmids with pLannotate

    Friday, June 24th. Day 5 – User specific tutorials (continued) and TACC the "normal" way

    The first half of today's class will be done as a continuation of tutorials that you are most interested in. As was the case yesterday, choose your own tutorial, and please don't hesitate to ask what tutorials would be good for you to be working on given your data! After the break, we will be go over a brief review to put things back in prospective and give you a tutorial on how to do things the 'normal way' on TACC which means using the job submission system and commands files before giving you any remaining time to go through tutorials and ask any remaining questions.

    Tutorial: End of class review and data collection

    Presentation: Genome Variant Analysis Review

    , multiple selections available,

    Confluence Documentation | Web Privacy Policy | Web Accessibility

    University Wiki Service

    Bioinformatics Team (BioITeam) at the University of Texas
    • File lists
      File lists
       This trigger is hidden
    • How-to articles
      How-to articles
       This trigger is hidden
    Results will update as you type.
    • Bioinformatics Services
    • CBRS Mini Symposium
    • BCG Full Service Pipelines
    • zArchive
      • TACC stuff
      • Old Home page
      • Old BCG Full Service Pipelines
      • Training
        • Bioinformatics Courses and Content
          • Genome Variant Analysis Course 2014
          • Genome Variant Analysis Course 2015
          • Genome Variant Analysis Course 2016
          • Genome Variant Analysis Course 2017
          • Genome Variant Analysis Course 2019
          • Genome Variant Analysis Course 2020
          • Genome Variant Analysis Course 2021
          • Genome Variant Analysis Course 2022
            • Accessing stampede2 via ssh on MacOs GVA2022
            • Accessing stampede2 via ssh on Windows10 GVA 2022
            • Advanced bowtie2 -- GVA2022
            • Advanced breseq GVA2022
            • Annotating plasmids with pLannotate
            • Annovar 2022
            • Bedtools tutorial -- GVA2022
            • Breseq with multiple references GVA2022
            • Evaluating and processing raw sequencing data GVA2022
            • Exome Capture Metrics GVA2022
            • fastp - GVA2022
            • Genome Analysis Toolkit (GATK) . -- GVA2022
            • Genome Assembly (SPAdes) -- GVA2022
            • GVA2022 - Class Review
            • GVA2022 Day 1 Environment Checkup and Catchup
            • GVA Tutorial Template
            • Human Trios -- GVA2022
            • IGV Tutorial -- GVA2022
            • Launching an iDev session GVA2022
            • Linux and stampede2 Setup -- GVA2022
            • Molecular Index Error correction GVA2022
            • MultiQC - fastQC summary tool -- GVA2022
            • Novel DNA Identification -- GVA2022
            • Prokka Annotations -- GVA2022
            • Read Mapping with bowtie2 Tutorial GVA2022
            • Single Nucleotide Variant (SNV) calling Tutorial GVA2022
            • SSCS vs Trimmed Read Variant calls GVA2022
            • Stampede2 Breseq Tutorial GVA2022
            • Structural Variant (SV) calling with SVdetect 2022
          • Introduction to NGS Short Course 2015
          • Introduction to RAD-seq 2018
          • Introduction to RAD-seq short course 2017
          • Introduction to RNA Seq Course 2014
          • Introduction to RNA Seq Short Course Commands
          • Introduction To TACC Short Course Cheats
          • NGS RNA-seq exploration short course
          • Introduction to TACC (Video Tutorials):
          • Genome Variant Analysis Course 2023
          • Introduction to RNA Seq Course
        • SSC Intro to NGS Bioinformatics Course
      Calendars

    You‘re viewing this with anonymous access, so some content might be blocked.
    {"serverDuration": 12, "requestCorrelationId": "d925cd80218b43a897cf46d525a7e6a4"}