Collapse All
|1.0: Overview
The Genome Assembly Evaluation Metrics and Reporting (GAEMR) package is an assembly analysis framework composed a number of integrated modules. These modules can be executed as a single program to generate a complete analysis report, or executed individually to generate specific charts and tables. GAEMR standardizes input by converting a variety of read types to Binary Alignment Map (BAM) format, allowing a single input format to be entered into GAEMR’s analysis pipeline, hence enabling the generation of standard reports.
GAEMR’s analysis philosophy is centered on contiguity, correctness, and completeness -- how many pieces in an assembly composed of, how well those pieces accurately represent the genome sequenced, and how much of that genome is represented by those pieces. By performing over twenty different analyses based on these principles, GAEMR gives a clear picture of the condition of a genome assembly. For a broadly-defined list of these analyses, see the Features section below.
1.1: Features
The features of GAEMR are as follows:
- Modular design that allows all analyses to be created with a single command but is flexible enough to allow users to request specific reports
- Assembly statistics and details
- Read alignment statistics
- Kmer analysis
- Gap end reports
- Reference analysis
- Blast reports
- rRNA details
- HTML and text reporting formats
1.2: System Requirements & Dependencies
GAEMR and its modules are written in the Python programming language and run in a UNIX environment. The third-party packages below are required by GAEMR. Please contact your systems administrator to see if you have them installed.
Programming Language: Python 2.7 with Biopython, Matplotlib, Optparse, NumPy and Pysam modules
Alignment Software: BWA, Bowtie 2, Blast+ and Nucmer
SAM/BAM Utilities: Picard, SAMtools
Classification: RNAmmer, RDP-classifier 2.4, NCBI taxonomy dump
1.3: Installation
Follow the steps below to set up GAEMR:
- Download the GAEMR package from the download site.
- Move or copy the package to the Unix directory where you’d like GAEMR to be installed.
- Inside this directory, unpack the the file by typing “tar -xzvf GAEMR-1.0.0.tar.gz”. A directory called GAEMR-1.0.0 will be created.
- Read and follow the instructions in the README.txt file located inside the GAEMR-1.0.0 directory.
1.4: Next Steps
The GAEMR website is designed to be user-friendly and informative. Throughout the website, you will find words with tool-tip definitions to help users unfamiliar with bioinformatics and genomics jargon. Each page is composed of sections that can be expanded or collapsed to help users focus only on the sections of interest.
The The website is divided into the following pages to make accessing and learning how to use GAEMR as easy as possible:
1.0: Getting Started
This page, where you will find an overview of GAEMR, as well as features, system requirements, dependencies, and installation instructions.
2.0 User’s Guide
The User’s Guide is designed with a ‘how-to’ approach ideal for those just getting started with GAEMR. The guide contains basic usage instructions and tutorials on how to create and understand GAEMR output.
3.0 Reference Manual
The reference manual is intended for those familiar with GAEMR who want to access the full complement of GAEMR arguments or understand and execute individual GAEMR modules. Examples of all tables and charts produced by GAEMR can also be found here.
4.0 Downloads
The downloads section contains available versions of the GAEMR package as well as sample data used in the the User’s Guide tutorials.
1.5: Support
GAEMR is supported through the GAEMR-help mailing list. Changes are reported on the Change Log. Both may be accessed from the GAEMR homepage.