Variant tools import different types of variants as follows:
Type | Reference | Alternative | Imported Variant(s) | Note |
---|---|---|---|---|
SNV | A |
G |
A ,G |
|
TC |
TG |
C ,G |
pos + 1 | |
Deletion | TC |
T |
C,- |
pos + 1 |
TCG |
TG |
C,- |
pos + 1 | |
TCGC |
TC |
GC,- |
pos + 2, * | |
TC |
- or . |
TC,- |
Not VCF compatible | |
Insertion | TCG |
TCAG |
-,A |
pos + 2 |
TC |
TCA |
-,A |
pos + 2 | |
- or . |
A |
-,A |
not VCF compatible | |
MNP | AA |
ATAAC |
A,TAAC |
pos + 1 |
TACT |
TCTA |
ACT,CTA |
pos + 1 | |
Mixed | A |
C,G |
A,C A,G |
Two single nucleotide variants |
TC |
TCGG,T |
-,GG C,- |
A deletion and an insertion |
Note that
-
or .
are treated as missing allele and can be used to import indels.10, ACG, A
will be imported as variant CG,-
at position 11. The Common ending alleles are also removed. We remove common leading alleles greedily to avoid ambiguity. For example, deletion TCGC
->GC
(case * in the table) can be intepretted as a deletion of GC
at pos + 2 and CG
at pos + 1, variant tools uses the first interpretation.vtools import --format ANNOVAR
to import annotation for ANNOVAR might be a good choice.