We have just released the latest data update to the Open Targets Platform — 22.04.
Key highlights for this release:
- New datasource: gene burden analyses from Regeneron in the GWAS Catalog and the AstraZeneca PheWAS portal
- Integration of structural variants from ClinVar
- Additional information in ChEMBL from DailyMed drug label text-mining
- NLP classification of why clinical trials stopped, which affects scoring
- New data: Gene2phenotype cardiac panel
- Also: renaming Phenodigm to IMPC, removing the PheWAS Catalog datasource
This release integrates 13,829,174 evidence strings to build 7,541,360 target-disease associations between 18,520 diseases and 61,524 targets from the following 22 public resources:
- 1,145,558 genetic evidence from European Variation Archive (EVA)
- 693,567 genetic evidence from Open Targets Genetics Portal
- 3,594 genetic evidence from Gene2Phenotype
- 35,622 genetic evidence from the Genomics England PanelApp
- 1,636 genetic evidence from ClinGen
- 6,001 genetic evidence from Orphanet
- 18,715 genetic evidence from Gene burden
- 6,272 genetic evidence from UniProt Literature
- 11,653 somatic evidence from European Variation Archive (EVA)
- 3,299 somatic evidence from intOGen
- 74,159 somatic evidence from the Cancer Gene Census
- 39,419 somatic evidence from Uniprot
- 594,375 drug evidence from ChEMBL
- 231,165 expression evidence from Expression Atlas
- 10,345 affected pathway evidence from Reactome
- 72,369 affected pathway evidence from SLAPenrich
- 378 affected pathway evidence from PROGENy
- 389 systems biology evidence from SysBio
- 1,299 somatic evidence from the Cancer Genome Interpreter
- 1,846 CRISPR-Cas9 (Cancer Cell Lines) evidence from Behan et al. 2019
- 1,166,212 mouse model evidence from IMPC
- 9,746,463 scientific literature evidence from co-occurence mining in Europe PMC
Additionally, the Platform now allows users to explore data on 12,854 drugs.
For more details, read the 22.04 release blog post.