Mutect2 resources guide

Posted by shlee on 31 Jan 2018 (1)


A new tutorial for somatic calling

We have a new tutorial, Tutorial#11136, that outlines how to call somatic short variants, i.e. SNVs and indels, with GATK4 Mutect2. The tutorial provides small example data to follow along with.

Mutect2-compatible germline resources

Full-length Mutect2-compatible human germline resources are available on our [FTP server]( https://software.broadinstitute.org/gatk/download/bundle) and at gs://gatk-best-practices/. The resources are simplified from the gnomAD resource and retain population allele frequencies. Mutect2 and GetPileupSummaries are the two tools in the workflow that each require a germline resource.

Working WDL scripts

If you want to run the Somatic Short Variant Discovery Best Practices workflow using WDL, be sure to check out the official Mutect2 WDL script in the gatk-workflows repository. @bshifaw and other engineers optimize the scripts in the repository to run efficiently in the cloud. Furthermore, the scripts come with example JSON format inputs files filled out with publically-accessible cloud data.

For other Mutect2-related scripts, e.g. towards panel of normals generation, check out the gatk repository's scripts/mutect2_wdl directory. Our developers update these scripts on a continual basis.

For background information

If you are new to somatic calling, be sure to read Article#11127. It gives an overview of what traditional somatic calling entails. For one, somatic calling is NOT just a difference between two callsets in that germline variant sites are excluded from consideration.

For those switching from GATK3 MuTect2, Blog#10911 will bring you up to speed on the differences.

An off-label tutorial for simple difference calling

If you are interested in simply calling differences between two samples, Blog#11315 outlines an off-label two-pass Mutect2 workflow. Off-label means the workflow is not a part of the Best Practices and is therefore unsupported. However, if given enough community interest, we may be convinced to further flesh out the workflow. Please do post to the forum to express interest.



Return to top

Wed 31 Jan 2018



- Recent posts


- Upcoming events

See Events calendar for full list and dates


- Recent events

See Events calendar for full list and dates



- Follow us on Twitter

GATK Dev Team

@gatk_dev

@YungangXu @NatureBiotech Interesting question, might need to be a bit more specific. #GATK is a toolkit that cover… https://t.co/Q04BhgPgSb
25 Sep 18
@PoisonEcology GATK code overall is at https://t.co/x5Pv5EoNYU, happy to point you to specific classes if needed. U… https://t.co/l67ctR2zG5
25 Sep 18
Many thanks to @xdopazo and colleagues for hosting our #GATK workshop crew in beautiful Sevilla! Lovely location an… https://t.co/S6iXKviMNz
21 Sep 18
@vanilla Forums and docs are back online; everything should be working properly now.
20 Sep 18
Service alert: forums and most docs are currently down due to a @vanilla database outage. Will advise when service… https://t.co/JbHVrYANhQ
20 Sep 18

- Our favorite tweets from others

If you think your fascination with #GATK hit the roof wait until you meet @gatk_dev team! Has been a wonderful week… https://t.co/KwfHm1SzNh
21 Sep 18
@xdopazo @gatk_dev @ClinicalBioinfo @FProgresoysalud @INB_Official @CIBERER @jpflorido Thank you very much for such amazing time!
21 Sep 18
Workshop "From reads to disease variants". Big thanks to @gatk_dev staff for sharing #GATK4 variant calling apps' e… https://t.co/kwjCA9HCy6
21 Sep 18
The couse "From reads to disease variants" https://t.co/Wmh8HeqmbY ends today. Thanks @gatk_dev instructors for the… https://t.co/TSf9L7JJMM
21 Sep 18

See more of our favorite tweets...