Difference in variant and variant_gene files

Dimitris_Zisis · 22 May 2023 15:05

I was wondering how to explain the main difference between variants and v2g files?
In variants there are geneIDs retrieved and most_severe_consequence to explain what are those genes and gene distance.
How those genes are related to the variants ? Is it just the nearest protein coding genes based on distance or there is another reason of associating each variant to each gene in variant file ? I understand that this is completely different from the v2g file in which there are genes retrieved after the application of v2g pipeline.

Xiangyu · 24 May 2023 16:01

Hi Dimitris,

The geneIDs and most_severe_consequence from the variants table are derived from the Variant Effect Predictor (VEP), most of the time, the geneIDs directly overlaps the SNPs, and the effects are in-silico predictions of the consequences of the SNPs, these predictions are based entirely on the DNA sequence alone and does not taken into account any additional functional data (which is what v2g does).

Best wishes,
Xiangyu

Dimitris_Zisis · 25 May 2023 09:10

Thank you very much for the clarification

Topic		Replies	Views
Variant to Gene Query for multiple variants in OT Genetics GraphQL API genetics-portal	4	629	20 August 2024
Difference between variant and gene pages Data issue genetics-portal	2	234	23 November 2023
Get V2G genes/scores from variant rsid? Google BigQuery/Cloud genetics-portal	7	873	20 August 2024
Why do some genes in Open Targets V2G data have 'NA' typeId and no reported distance? GraphQL API	1	50	26 July 2024
Downloading data for V2G pipeline Data Access datadownloads , genetics-portal	1	382	19 January 2023

Difference in variant and variant_gene files

Related topics