runBWA Pipeline

runBWA.sh and runBWA_mem.sh pipelines are available on lonestar at:

/corral-repl/utexas/BioITeam/bin/runBWA.sh

/corral-repl/utexas/BioITeam/bin/runBWA_mem.sh

The pipelines do the following:

  • Split data file into smaller chunks
  • Run multiple, parallel BWA aln+sampe/mem instances 
  • Concatenate results and provide that as the output.

Inputs:

  • R1 fastq file

  • R2 fastq file

  • Prefix of BWA reference index (the absolute path)
  • Number of chunks to split
  • Output Directory
  • TACC Allocation

Outputs:

  • rs.cat.sam - mapping output in sam format

Run this pipeline on the head node. It will submit all jobs to the compute nodes.