How are rsIDs mapped to unique variants in Open Targets Genetics?

SirTarget · 16 February 2022 09:32

Hello!

In the Genetics portal FAQs you mention, regarding mapping conflicts:

Many multi-allelic sites can be assigned a single rsID, and some rsIDs can point to different positions in the genome.

This means that rsIDs are not unique to a single variant. We have mapped all rsIDs from GWAS Catalog to unique variants.

A small minority of rsIDs will map to multiple variant IDs (approximately 0.6% of lead variants). When this occurs, variants will be duplicated in the portal.

Where does the mapping from rsId to unique variants happen? I’ve looked at the variant annotation on GitHub but there is no explicit disambiguation process there that I can see.

To be clear my question is: when a given rsId is mapped to more than one locus, how do you pick the one locus to which this rsId will be mapped inside the variant-index dataset?

This question was sent to the Open Targets helpdesk and has been anonymised.

JeremyS · 1 March 2022 10:48

This mapping actually happens in GWAS catalog, and not directly by Open Targets. When we import GWAS with summary statistics from GWAS catalog, the data have already been harmonised to ensure that the effect allele is clear, that the alleles are with respect to the forward strand, and that each row is unique (chromosome, position, effect allele, other allele). In Open Targets Genetics, we use this information to display any effect sizes (beta or OR) with the alternative (non-reference) allele as the effect allele.

This FAQ entry is simply to point out that a single rsID may correspond with more than one variant as defined by chr:pos:ref:alt, since there may be more than one alt allele at a single position. This doesn’t mean that a single rsID maps to more than one locus. (We don’t use data from alternative contigs in our pipelines.)

Topic		Replies	Views
Querying using rsId Data Access genetics-portal	5	530	23 June 2021
Get V2G genes/scores from variant rsid? Google BigQuery/Cloud genetics-portal	5	340	4 May 2023
BigQuery for a variant Google BigQuery/Cloud	4	297	22 April 2022
Variant to Gene Query for multiple variants in OT Genetics GraphQL API genetics-portal	2	364	10 March 2022
GWAS lead variants via API GraphQL API	3	375	18 February 2022

How are rsIDs mapped to unique variants in Open Targets Genetics?

Related Topics