Atlassian uses cookies to improve your browsing experience, perform analytics and research, and conduct advertising. Accept all cookies to indicate that you agree to our use of cookies on your device. Atlassian cookies and tracking notice, (opens new window)
/
ABYSS

    ABYSS

    Jun 14, 2011

    Current version installed on Fourierseq: 1.2.7 by SPHS 6/1/11

    Try "man abyss-pe" or "man ABYSS"

    Basics of running (straight from the README):

    • Single-end assembly
      Assemble short reads in a file named reads.fa into contigs in a
      file named contigs.fa with the following command:
      ABYSS -k25 reads.fa -o contigs.fa
      where -k is an appropriate k-mer length. The only method to find the
      optimal value of k is to run multiple trials and inspect the results.
      The following shell snippet will assemble for every value of k from 20
      to 40.
      for k in {20..40}; do ABYSS -k$k reads.fa -o contigs-k$k.fa
      done
      The maximum value for k is 64. This limit may be changed at compile
      time using the --enable-maxk option of configure. It may be decreased
      to 32 to decrease memory usage, which is particularly useful for large
      parallel jobs, or increased to 96.
    • Paired-end assembly
      To assemble paired short reads in two files named reads1.fa and
      reads2.fa into contigs in a file named ecoli-contigs.fa, run the
      command:
      abyss-pe k=25 n=10 in='reads1.fa reads2.fa' name=ecoli
      where k is the k-mer length as before.
      n is the minimum number of pairs needed to consider joining two
      contigs. The optimal value for n must be found by trial.
      in specifies the input files to read, which may be in FASTA, FASTQ,
      qseq, export, SAM or BAM format and compressed with gz, bz2 or xz and
      may be tarred.
      The assembled contigs will be stored in ${name}-contigs.fa.
      The suffix of the read identifier for a pair of reads must be one of
      '1' and '2', or 'A' and 'B', or 'F' and 'R', or 'F3' and 'R3', or
      'forward' and 'reverse'. The reads may be interleaved in the same file
      or found in different files. If the mates are in different files, it's
      highly recommended to place each pair of files adjacent on the command
      line and to use an even number of threads. Even if you are running on
      a single-processor machine, using two threads will help performance.
      Do not group together all the files containing the forwards reads
      followed by all the files containing the reverse reads.
      Reads without mates should be placed in a file specified by the `se'
      (single-end) parameter. Reads without mates in the paired-end files
      will slow down the paired-end assembler considerably during the
      ParseAligns stage.
    , multiple selections available,

    Confluence Documentation | Web Privacy Policy | Web Accessibility

    University Wiki Service

    Bioinformatics Team (BioITeam) at the University of Texas
    • File lists
      File lists
       This trigger is hidden
    • How-to articles
      How-to articles
       This trigger is hidden
    Results will update as you type.
    • Bioinformatics Services
    • CBRS Mini Symposium
    • BCG Full Service Pipelines
    • zArchive
      • TACC stuff
      • Old Home page
      • Old BCG Full Service Pipelines
      • Training
      • Software
        • ABI's SOLiD de novo pipeline
        • ABYSS
        • Annovar
        • BFAST
        • Bioconductor
        • BioMart Perl APIs
        • Blast
        • BLAST tools
        • BOOST libraries
        • Bowtie
        • Breakdancer
        • BWA
        • BWA-SAMTOOLS workflow
        • CASAVA
        • Circos
        • CGView
        • Clustering programs - MCL and usearch, uclust
        • Conversion of gene ID's from one form to another (i.e. NCBI to Ensembl & vice-versa)
        • Conversion of mapreads output to GFF, SAM, or BAM format
        • Convert ABI SOLiD data to fasta fastq
        • Cufflinks
        • Data compression programs
        • Extracting barcode split data from SOLiD 5500 XSQ files
        • FastQC
        • Fastx Toolkit
        • Galaxy Setup
        • Geneid
        • General parser scripts
        • Generation of gene counts from results of mapping to genome
        • Generation of wig files from mapreads output
        • GeneWise
        • Genome Analysis Tool Kit
        • Getting an account on GSAF server-fourierseq
        • Get Tm (melting temperature), length, and %GC from a bunch of sequences
        • glimmer3
        • GMAP and GSNAP
        • Graphic programs
        • GS Amplicon variant analyzer
        • GS De novo assembler
        • GS Reference mapper
        • GS Run processor and run browser
        • Hmmer
      Calendars

    You‘re viewing this with anonymous access, so some content might be blocked.
    {"serverDuration": 11, "requestCorrelationId": "9e1af795d6d14bafb69c456faab131d1"}