GATK 4 will be the next major version of GATK, bringing together well-established tools from the GATK and Picard codebases under a single simplified, streamlined framework, and enabling selected tools to be run in a massively parallel way on local clusters or in the cloud using Apache Spark. This package contains the latest alpha preview release.
This project is in an alpha development stage and is not yet ready for general use. Available documentation can be found in the GATK 4 Alpha forum and in the source code repositories (see further below).
All POSIX operating systems (Unix, Linux, MacOSX etc) are supported. Microsoft Windows is not supported. The current version requires Java 8. Note that the Oracle Java is preferred; OpenJDK is not officially supported. A few tools require R 3.1.3 (mainly for plotting). An Rscript is available here to install any R libraries not already present on your system. This Rscript can be invoked as follows:
Rscript install_R_packages.R. There is a frontend launching script provided for convenience that requires Python version 2.6 or greater.
The GATK 4 Alpha package is open source under a BSD (3-clause) license.
The source code for the core GATK tools is available in the broadinstitute/gatk repository on Github.
Starting with the beta release, docker images of all GATK4 versions will be available in the Dockerhub at broadinstitute/gatk repository on Dockerhub.