GATK 2.5 was released on April 30, 2013. Highlights are listed below. Read the detailed version history overview here: http://www.broadinstitute.org/gatk/guide/version-history

Reduce Reads

  • DRASTIC improvements in the compression algorithm plus myriad bug fixes. Too many to list here; see detailed version history for more information.

Unified Genotyper

  • Fixed bug for indel calling with really long reads (assigning the wrong genotypes).
  • Automatic contamination fixing now works on reduced reads.
  • Fixed rare bug in the general ploidy SNP likelihood model when there are no informative reads in a pileup.
  • Fixed bug where haplotypes with 0 bases were being created.
  • Fixed problem where our internal PairHMM was generating positive likelihoods.

Haplotype Caller

  • Comprehensive performance improvements to the accuracy of calling both SNPs and indels; runtime is also much improved (but still slower than the Unified Genotyper; we expect it to be faster than UG in the next release though). See detailed version history for more information.
  • Fixed bug for calling on reduced reads (counts were not being assigned correctly).
  • Fixed problem where our internal PairHMM was generating positive likelihoods.
  • Can now write BAMs showing the assembled haplotypes.

Diagnose Targets

  • Significantly refactored this tool; it now works with a "plugin" system (see documentation for more information).
  • Fixed bug where LOW_MEDIAN_COVERAGE was output when no reads are covering the interval.
  • Fixed bug where intervals were skipped when they were not covered by any reads.

Base Recalibrator

  • Fixed the tool to work correctly with empty BQSR tables.
  • Fixed issue where Print Reads was running out of disk space when using the -BQSR option even for small bam files.
  • Fixed bug for RNA seq alignments with Ns.

Select Variants

  • Fixed bug where using the --exclude_sample_file argument was giving bad results.
  • Fixed bug when using the --keepOriginalAC argument which caused it to emit bad VCFs.
  • Fixed bug where maxIndelSize argument wasn't getting applied to deletions.

Variant Annotator

  • Added support for snpEff "GATK compatibility mode".
  • Can now list available annotations by doing java -cp GenomeAnalysisTK.jar org.broadinstitute.sting.tools.ListAnnotations
  • QualByDepth remaps QD values > 40 to a gaussian around 30.
  • Removed several deprecated annotations (AverageAltAlleleLength, MappingQualityZeroFraction, and TechnologyComposition) and others are no longer marked as experimental.

Variant Filtration

  • Don't allow users to specify keys and IDs that contain angle brackets or equals signs (which are not allowed in the VCF specification).
  • Added feature that allows one to filter sites outside of a given mask.

Left Align Variants

  • Renamed to LeftAlignAndTrimVariants.
  • Added ability to trim common bases in front of indels before left-aligning.
  • Added ability to split multiallelic records and then left align them.

Miscellaneous

  • We removed the auto-creation of fai/dict files for fasta references because it was too buggy.
  • Fixed bug where we could fail to find the intersection of unsorted/missorted interval lists.
  • Fixed @PG tag uniqueness issue with BAMs we were producing.
  • Fixed rare bug in GenotypeConcordance for multi-allelic sites.
  • Added check for reads without stored bases (i.e. that use '*') which we do not support.
  • Added support to reduce reads to CallableLoci.
  • Added a new walker to split MNPs into their allelic primitives (SNPs).
  • We no longer allow the use of compressed (.gz) references in the GATK.
  • Picard/Tribble/Variant jars updated to version 1.90.1442.

Geraldine_VdAuwera


FYI -- at time of posting, the release is in the pipe, so the packages should be available for download by the end of the day if all goes well. Version highlights will be available within a couple of days.

Tue 30 Apr 2013

sbaheti


Is there a difference between version GATK 2.5 GATK 2.5-2, if there is any difference then where can i find the version differences b/w the two. Thanks Saurabh

Tue 30 Apr 2013

Geraldine_VdAuwera


No, 2.5-2 was the first publicly released version of GATK 2.5. Sometimes there are jumps between -dash version numbers that are available to the public due to technical reasons.

Tue 30 Apr 2013

droazen


The only difference between GATK 2.5 and 2.5-2 is some post-release cleanup we did related to our test infrastructure. You can see the exact changes in the commit logs to our github repository: [https://github.com/broadgsa/gatk-protected/commits/master](https://github.com/broadgsa/gatk-protected/commits/master) The topmost commit is 2.5-2 (the current release), the one below that is 2.5-1, and version 2.5 is the third commit from the top.

Tue 30 Apr 2013




At a glance



Follow us on Twitter

GATK Dev Team

@gatk_dev

RT @broadinstitute: .@NIH funds available for investigators in need of cloud computing & storage resources. Apply: https://t.co/IoiZMUNBM8
26 Apr 17
@DataKimist Enjoy! And let us know if we can help.
19 Apr 17
@mjpchaisson Not meant that way - just depending on what you're doing you may want to cite earlier framework or lat… https://t.co/QpIbwRf0bC
18 Apr 17
@cabioinformatic For more recent versions see https://t.co/QCbos5KBWw
15 Apr 17
@thatdnaguy @notigor @David_McGaughey @brent_p Indel Realign is redundant with assembly-based realign done by HC, w… https://t.co/77Lyil7BJY
13 Apr 17

Our favorite tweets from others

best error output: Please do NOT post this error to the GATK forum unless you have really tried to fix it yourself.
4 Apr 17
From the @gatk_dev page describing .vcf files: "Don't write home-brewed VCF parsing scripts. It never ends well” https://t.co/28KcRoV14j
28 Feb 17
Our 3-day course on GATK https://t.co/mtN60KRTyS finished - 38 participants very happy! Big thanks to @gatk_dev team for excellent lessons.
24 Feb 17
@froggleston @dgmacarthur Sounds like ExAC is reaching Uber stage. ‘Uber but for pizza’. ‘ExAC but for wheat’.
14 Jan 17
#ESRenpeinture grad school - postdoc - after postdoc https://t.co/o3vQMgBDgk
6 Jan 17
See more of our favorite tweets...
Search blog by tag

appistry ashg ashg16 benchmarks best-practices bug bug-fixed cloud cluster cnv collaboration community compute conference conferences cram cromwell depthofcoverage diagnosetargets error forum gatk3 gatk4 genotype-refinement genotypegvcfs google grch38 gvcf haploid haplotypecaller help hg38 holiday hts htsjdk ibm intel java8 job job-offer jobs license meetings mutect mutect2 ngs outreach pairhmm parallelism patch pdf performance picard pipeline plans ploidy polyploid poster presentations printreads profile promote release release-notes rnaseq runtime saas script sequencing service slides snow speed status support talks team terminology topstory troll tutorial unifiedgenotyper vcf-gz version-highlights wdl workflow workshop xhmm