GATK 2.3 was released on December 17, 2012. Highlights are listed below. Read the detailed version history overview here: http://www.broadinstitute.org/gatk/guide/version-history

Base Quality Score Recalibration

  • Soft clipped bases are no longer counted in the delocalized BQSR.
  • The user can now set the maximum allowable cycle with the --maximum_cycle_value argument.

Unified Genotyper

  • Minor (5%) run time improvements to the Unified Genotyper.
  • Fixed bug for the indel model that occurred when long reads (e.g. Sanger) in a pileup led to a read starting after the haplotype.
  • Fixed bug in the exact AF calculation where log10pNonRefByAllele should really be log10pRefByAllele.

Haplotype Caller

  • Fixed the performance of GENOTYPE_GIVEN_ALLELES mode, which often produced incorrect output when passed complex events.
  • Fixed the interaction with the allele biased downsampling (for contamination removal) so that the removed reads are not used for downstream annotations.
  • Implemented minor (5-10%) run time improvements to the Haplotype Caller.
  • Fixed the logic for determining active regions, which was a bit broken when intervals were used in the system.

Variant Annotator

  • The FisherStrand annotation ignores reduced reads (because they are always on the forward strand).
  • Can now be run multi-threaded with -nt argument.

Reduce Reads

  • Fixed bug where sometime the start position of a reduced read was less than 1.
  • ReduceReads now co-reduces bams if they're passed in toghether with multiple -I.

Combine Variants

  • Fixed the case where the PRIORITIZE option is used but no priority list is given.

Phase By Transmission

  • Fixed bug where the AD wasn't being printed correctly in the MV output file.

Miscellaneous

  • A brand new version of the per site down-sampling functionality has been implemented that works much, much better than the previous version.
  • More efficient initial file seeking at the beginning of the GATK traversal.
  • Fixed the compression of VCF.gz where the output was too big because of unnecessary call to flush().
  • The allele biased downsampling (for contamination removal) has been rewritten to be smarter; also, it no longer aborts if there's a reduced read in the pileup.
  • Added a major performance improvement to the GATK engine that stemmed from a problem with the NanoSchedule timing code.
  • Added checking in the GATK for mis-encoded quality scores.
  • Fixed downsampling in the ReadBackedPileup class.
  • Fixed the parsing of genome locations that contain colons in the contig names (which is allowed by the spec).
  • Made ID an allowable INFO field key in our VCF parsing.
  • Multi-threaded VCF to BCF writing no longer produces an invalid intermediate file that fails on merging.
  • Picard jar remains at version 1.67.1197.
  • Tribble jar updated to version 119.

severin


In the new release for the HaplotypeCaller function does downsampling do anything? Can we still use --enable_experimental_downsampling? What is your recommendation.

Mon 17 Dec 2012

Geraldine_VdAuwera


You no longer need to use `--enable_experimental_downsampling` for anything; the experimental downsampling is now the regular downsampling (see Version history/Version highlights for details) and is used by default by all tools that downsample reads.

Mon 17 Dec 2012




- Recent posts


- Upcoming events

See Events calendar for full list and dates


- Recent events

See Events calendar for full list and dates



- Follow us on Twitter

GATK Dev Team

@gatk_dev

Our thanks to @edgenome for hosting us! A pleasure as always! https://t.co/C9XgB34OBX
22 Jul 17
RT @vchris21: FireCloud. Something to watch out for while working with GATK variant discovery. Very interesting https://t.co/oo8zzbG41f #ga…
19 Jul 17
RT @dr_gabry: Another great GATK workshop in Cambridge @BioInfoCambs @gatk_dev. Looking forward to see you in 2018!!!!!! https://t.co/v55zR
14 Jul 17
@tai_pach @wordpressdotcom The first beta is actually out already :) https://t.co/ILiFifpdcG Formal announcement coming next week.
8 Jul 17
RT @geoffjentry: @BGI_Genomics adopts GATK, @WDL_dev and Cromwell. Cromwell to get AliCloud support. New GATK & WDL outreach effort in Chin…
2 Jul 17

- Our favorite tweets from others

The @gatk_dev team, that delivered an excellent "GATK Best Practices for Variant Discovery" workshop this week, on… https://t.co/Z8GNduuDeJ
20 Jul 17
Amazing session @edgenome with @gatk_dev comes to an end. Enriched learning! Thx #gatk #gatk2017 #Genomics #Edinburgh #Bioinformatics
19 Jul 17
Great day of lectures on GATK Best Practices yesterday. Very clear and useful. Thanks! @gatk_dev @edgenome
18 Jul 17
Another great GATK workshop in Cambridge @BioInfoCambs @gatk_dev. Looking forward to see you in 2018!!!!!! https://t.co/v55zRztpHL
14 Jul 17
Getting started with #GoogleCloud and learnt that #Cromwell + #WDL by @broadinstitute is just awesome. Go #GATK
13 Jun 17
See more of our favorite tweets...