Score values from Disease->Target vs. Target->Disease

Heather · 21 June 2021 19:24

Hi,
I was looking for the association score of MAPT to neurodegenerative disease. When I search for “Neurodegenerative Disease” (Open Targets Platform) and then filter to MAPT, the score comes up as 0.86. However, when I start from MAPT and filter for neurodegenerative disease (Open Targets Platform) the score comes up as 0.01.
Please could you explain the difference?
Thank you!

ahercules · 23 June 2021 08:59

Hello @Heather!

Welcome to the Open Targets Community!

The difference in the scores presented in the UI is due to how we handle direct and indirect evidence. By this, I mean how we handle evidence for the specific disease/phenotype (direct) and evidence for a child term of the disease/phenotype (indirect).

When you search for “neurodegenerative disease”, we show you the list of associated targets and the overall score and data type scores are based on both direct evidence (e.g. text mining for MAPT and neurodegenerative disease) and indirect evidence (e.g. genetic association evidence for MAPT and Parkinson’s disease, a child term of neurodegenerative disease).

However, when you search for MAPT, we show you the list of associated diseases and phenotypes and the overall score and data type scores are based on direct evidence only (e.g. text mining for MAPT and neurodegenerative disease).

Despite the different scores depending on your search term, the evidence page for MAPT and neurodegenerative disease contains both direct and indirect evidence.

Please let me know if this has answered your question.

Thank you!

Heather · 23 June 2021 23:37

Thanks so much for your reply @ahercules !
I do have a few follow-on questions that I hope you can also help with. Firstly, when I use the GraphQL API (GraphQL Playground) the results I get show “score” and dont specify whether the score is direct or indirect (I assume it is an overall score though). Performing a disease search I get this:
Disease-TargetScore
And when I use the target search I see:
Target-DiseaseScore
But I dont see how I can know which score is direct and which is not - it is possible to know?

The other question is regarding the type of evidence - is it possible to see the scores coming from different evidence sources using the GraphQL API?

Thanks once again,

Heather

ahercules · 25 June 2021 11:55

Hi @Heather!

No, unfortunately it’s not possible to see direct and indirect scores using the API. Similar to what you will see in the UI, if you start your query with the target endpoint (second screenshot), your results will include associations only based on direct evidence and so the score will only be based on direct evidence. If you start your query with the disease endpoint (first screenshot), your results will include associations based on both direct and indirect evidence and the score will be based on both types of evidence.

If you are keen on getting separate direct and indirect association scores, you can use our data downloads and sample Python and R scripts to access and query specific datasets for direct and indirect associations.

Alternatively, you can also use our Google BigQuery instance to explore the direct and indirect datasets for a given target-disease association and export the results in various formats. See below for example queries for EGFR and lung cancer using both the direct and indirect datasets.

Query using the direct dataset: https://console.cloud.google.com/bigquery?sq=352646847630:755651bdf6f345d8a895cce534b53653
Query using the indirect dataset: https://console.cloud.google.com/bigquery?sq=352646847630:5f698db78d23423cb7f6f5a2b8b186cd

If you run both of the above queries, you will see how the chembl and europepmc scores are different depending on whether you are using direct or indirect datasets.

In regards to your second question, if you are using the GraphQL API, you can query for the per data source scores using the following query:

query targetAssociations {
  target(ensemblId:"ENSG00000146648"){
    id
    approvedSymbol
    associatedDiseases{
      count
      rows{
        disease{
          id
          name
        }
        score
        datasourceScores{
          id
          score
        }
      }
    }
  }
}

Sample query for EGFR

Cheers,

Andrew

Topic		Replies	Views
Differences in overall association scoring between target-disease and disease-target General	3	360	28 September 2022
Spurious indirect association/evidence via GraphQL API? Bug reports	6	442	19 January 2023
How does the Platform display direct and indirect evidence? Frequently Asked Questions	7	863	13 May 2022
R script for GraphQL query: query targetDiseaseEvidence GraphQL API	7	989	11 January 2024
Incoherency between score in a disease page and a gene page Bug reports ot-platform	4	25	24 February 2025

Score values from Disease->Target vs. Target->Disease

Related topics