...
(NB: I had to "fix" bp_genbank2gff3.pl to work on our GenBank file, by removing a line in the Perl script that caused an error. If someone knows of an easier way to get a GFF file from a file downloaded from Genbank, please share!)
| Expand | |||||
|---|---|---|---|---|---|
| |||||
Another useful trick with either IGV or UCSC: displaying your own BLAST results: BioPerl allows for super-easy conversion from blast output to a gff file; IGV and the UCSC browser both understand GFF files. The short script Let's use the blast result we had from the earlier test for the JAG1 gene to show you how. You'll need to provide the input file - it's the ".oNNNNNN" output file from your blast job.
The resulting jag1_blast.out.gff can be moved to your local machine and opened in IGV. Load the human reference first though! |
Copy files to your desktop
...
- Check out the rbsA gene region? What's going on here?
- What is going on in the pykF gene region? You might see red read pairs. What does that mean?
- Go to coordinate 2,698,092. Compare the bowtie and BWA alignments. Can you explain what's going on here?
- Go to coordinate 475,263. ompare the bowtie and BWA alignments. What's the story here?
Advanced exercise: human data scavenger hunt
Data from the CEU trio from the 1000 Genomes Project can be found directly from the Broad's server for IGV.
Find the dbSNP accession number for the SNP apparent in the two 1000 genomes project trios in the intron between exons 8 and 9 of the GABBR1 gene.
Steps:
- Download and install the Integrative Genome Viewer from the Broad Institute.
- Select "Human hg18" as the reference genome
- Get some data: File -> “Load from Server…” -> 1000 genomes -> CEU and YRI trios
- Find the gene and the right exons
- Zoom in until you find the SNP
- Load and look at the SNP track: File -> Load from server -> Annotations -> Variants -> dbSNP
This is whole genome coverage data; later we'll look at exome data.
| Expand | ||||
|---|---|---|---|---|
| ||||
rs29220 |