Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

Overview

...

In the previous tutorial we ran breseq with two different options to get a feel for the type of filtering breseq does by default. Here we will visualize these differences.

Learning Objectives

...

  • Use genome diff files to generate a comparison table to see differences between samples (or in this case different types of analysis).

Tutorial:

Breseq naturally creates numerous intermediate files and directories as it runs to facilitate picking up where it left off if the process is interrupted before it can finish. Therefor Therefore, most of these files can safely be ignored once the run is complete. The exceptions to this are the "output" directory  directory (which contains the human readable files generated during the run), and the "data" directory  directory (which can be used to generate additional figures for mutation determination). Information and directions for creating additional figures can be found in this shameless self promoting link. While the *.html files  files generated by breseq provide a good human visualization of how the mutation was called, .html files are not very useful computationally. Conversely, genome diff files are very useful computationally and human readable (although not as visually stimulating as the . html files).

In our previous tutorial we sent our output to a folder titled "breseq_output" for  for both the with and without_stats conditions. Within each of these folders is a genome diff file titled "output.gd" that  that we will use to generate a new html table comparing the different mutations called in each set of conditions. The command that we will use is installed with the breseq installation in the BioITeam community resources and is can be invoked using "gdtools compare". See if you can generate a file named "stat_comparison.html".

Expand
titleIf you need a hint without the answer click the triangle...
Code Block
gdtools compare -h

the above will display instructions stating that gdtools compare needs a genebank (gbk) reference file, and .gd files to compare. The -o command is used to direct the output to a specific file name and/or location.

Expand
titleIf you are still stuck and want the answer click the triangle...

This is all one line!

Code Block
gdtools compare -o stat_comparison.html -r reference/REL606.6.gbk breseq_output/with_stats/output/output.gd breseq_output/without_stats/output/output.gd

All of the necessary files to complete our analysis have now been generated, but not in the most usable forms. In order to facilitate an easier transfer of the breseq output two small helper scripts have been placed in the $BI/gva_course/mixed_population/helper_scripts folder  folder (export-breseq and  and batch-run.pl). Copy both files into the breseq_output directory  directory and then run the export-breseq script by just typing its name from within the breseq_output directory directory.

Expand
titleClick here for the step by step
Code Block
cp $BI/gva_course/mixed_population/helper_scripts/export-breseq breseq_output
cp $BI/gva_course/mixed_population/helper_scripts/batch_run.pl breseq_output
cd breseq_output
export-breseq

You should now have 2 files that now need to be  to copied to your local computer to visualize the data: 05_Output_Export.tar.gz and  and stat_comparison.html. Alternatively, these files can be downloaded from the barricklab server at: These can be opened in a web browser. Do so and explore the output!