Tandem repeat unit composition and counts per allele
INFO=<ID=STR,Number=0,Type=Flag,Description="Variant is a short tandem repeat">
INFO=<ID=RU,Number=1,Type=String,Description="Tandem repeat unit (bases)">
INFO=<ID=RPA,Number=.,Type=Integer,Description="Number of times tandem repeat unit is repeated, for each allele (including reference)">
This annotation tags variants that fall within tandem repeat sets. It also provides the composition of the tandem repeat units and the number of times they are repeated for each allele (including the REF allele).
A tandem repeat unit is composed of one or more nucleotides that are repeated multiple times in series. Repetitive sequences are difficult to map to the reference because they are associated with multiple alignment possibilities. Knowing the number of repeat units in a set of tandem repeats tells you the number of different positions the tandem repeat can be placed in. The observation of many tandem repeat units multiplies the number of possible representations that can be made of the region.
GATK version 3.8-0-ge9d806836 built at 2017/07/29 01:40:22.