The core GATK Best Practices workflow has historically focused on variant discovery --that is, the existence of genomic variants in one or more samples in a cohorts-- and consistently delivers high quality results when applied appropriately. However, we know that the quality of the individual genotype calls coming out of the variant callers can vary widely based on the quality of the BAM data for each sample. To address the increasing need for more accurate genotypes, especially in the context of analyses performed on families, we have developed a new workflow that uses additional data such as pedigrees and genotype priors to improve the accuracy of genotype calls, to filter genotype calls that are not reliable enough for downstream analysis and to tag possible de novo mutations.

The Genotype Refinement workflow is meant to serve as an optional extension of the variant calling workflow, intended for researchers whose work requires high-quality identification of individual genotypes. It is documented in a new method article, with mathematical details available separately. See the corresponding tutorial for step-by-step instructions on how to actually run it in practice.

Note that although all tools involved in this workflow are already available in GATK 3.2, some functionalities will only be available in the latest development version (see nightly builds in the Downloads section) until the release of version 3.3 (which is imminent).

Let us know how the new workflow works for you; as usual, we crave feedback and are happy to answer any and all questions.

Comment on this article

- Recent posts

- Upcoming events

See Events calendar for full list and dates

- Recent events

See Events calendar for full list and dates

- Follow us on Twitter

GATK Dev Team


RT @evolscientist: #AcademicValentines
15 Feb 19
@camerongenomics Ok will check on those, thanks. Been pretty wrapped up in getting 4.1 out
12 Feb 19
@camerongenomics Ooh... Can you please open an issue in our GitHub repo so our core engineers can discuss?
11 Feb 19
@micknudsen Glad you enjoyed the workshop, our crew had a blast too! Let us know in the forum if you need anything else to get going.
1 Feb 19
GATK 4.1 has been released! See highlights and the start of a multi-part blog series here:
1 Feb 19

- Our favorite tweets from others

Roses are red Violets are blue Please stop using gnomAD as a control group for association studies Unless you have…
14 Feb 19
14 Feb 19
On my way home after a fabulous @gatk_dev workshop in Copenhagen. Looking forward to get started implementing…
1 Feb 19
Beautiful graphic from @lea_starita on explosion of variants of unknown significance as sequencing data has grown!!…
23 Jan 19
Have Cromwell running on AWS Batch, very easy to work with WDL and get things working. Cool stuff!
6 Nov 18

See more of our favorite tweets...