Content Comparison

...

(NB: I had to "fix" bp_genbank2gff3.pl to work on our GenBank file, by removing a line in the Perl script that caused an error. If someone knows of an easier way to get a GFF file from a file downloaded from Genbank, please share!)

Expand

	Side-note on displaying BLAST results as GFF files in IGV or other browsers
	Side-note on displaying BLAST results as GFF files in IGV or other browsers

Another useful trick with either IGV or UCSC: displaying your own BLAST results: BioPerl allows for super-easy conversion from blast output to a gff file; IGV and the UCSC browser both understand GFF files. The short script bl2gff.pl does the conversion.

Let's use the blast result we had from the earlier test for the JAG1 gene to show you how. You'll need to provide the input file - it's the ".oNNNNNN" output file from your blast job.

Code Block

title	Displaying BLAST results in IGV or UCSC


grep '^gi' blast_jag1.o586038 > jag1_blast.out
module load perl
module load bioperl
/corral-repl/utexas/BioITeam/bin/bl2gff.pl jag1_blast.out > jag1_blast.out.gff

The resulting jag1_blast.out.gff can be moved to your local machine and opened in IGV. Load the human reference first though!

Copy files to your desktop

...

Check out the rbsA gene region? What's going on here?
What is going on in the pykF gene region? You might see red read pairs. What does that mean?
Go to coordinate 2,698,092. Compare the bowtie and BWA alignments. Can you explain what's going on here?
Go to coordinate 475,263. ompare the bowtie and BWA alignments. What's the story here?

Advanced exercise: human data scavenger hunt

Data from the CEU trio from the 1000 Genomes Project can be found directly from the Broad's server for IGV.

Find the dbSNP accession number for the SNP apparent in the two 1000 genomes project trios in the intron between exons 8 and 9 of the GABBR1 gene.

Steps:

Download and install the Integrative Genome Viewer from the Broad Institute.
Select "Human hg18" as the reference genome
Get some data: File -> “Load from Server…” -> 1000 genomes -> CEU and YRI trios
Find the gene and the right exons
Zoom in until you find the SNP
Load and look at the SNP track: File -> Load from server -> Annotations -> Variants -> dbSNP

This is whole genome coverage data; later we'll look at exome data.

Expand

	Answer
	Answer

rs29220

Version	Old Version 31	New Version 32
Changes made by	Jeffrey E Barrick	Scott Patrick Hunicke-Smith
Saved on	May 21, 2012	May 21, 2012

Versions Compared

Key

Copy files to your desktop

Advanced exercise: human data scavenger hunt