Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Code Block
titlecutadapt command for R1 sequences
/corral-repl/utexas/BioITeam/bin/cutadapt-1.3/bin/cutadapt -m 22 -Oa 10 -a AGATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCGGAAGAGCACACGTCTGAACTCCAGTCAC Sample1_R1.fastq
Code Block
titlecutadapt command for R2 sequences
/corral-repl/utexas/BioITeam/bin/cutadapt-1.3/bin/cutadapt -m 22 -O 10 -a TGATCGTCGGACTGTAGAACTCTGAACGTGTAGA Sample1_R1.fastq  

Notes:

  • The -m 22 option says to discard any sequence that is smaller than 22 bases after trimming. This avoids problems trying to map very short, highly ambiguous sequences.the -O 10 option says not to trim 3' adapter sequences unless at least the first 10 bases of the adapter are ssen at the 3' end of the read. This prevents trimming short 3' sequences that just happen by chance to match the first few adapter sequence bases.

Paired end commands are a little more complicated:  

Code Block
titlecutadapt command for R2 sequences
/corral-repl/utexas/BioITeam/bin/cutadapt-1.3/bin/cutadapt -a GATCGGAAGAGCACACGTCTGAACTCCA -m 22 --paired-output tmp.2.fastq -o tmp.1.fastq  Sample1_R1.fastq Sample1_R2.fastq &
/corral-repl/utexas/BioITeam/bin/cutadapt-1.3/bin/cutadapt -a ATCGTCGGACTGTAGAACTCTGAACGTG -m 22 --paired-output trimmed.1.fastq -o trimmed.2.fastq tmp.2.fastq tmp.1.fastq &
rm tmp.1.fastq tmp.2.fastq
Expand
titleThe gory details on the *-a* adapter sequence argument

Please refer to https://wikis.utexas.edu/display/GSAF/Illumina+-+all+flavors for Illumina library adapter layout.

 

...