Clinical precedence not capturing entire data

Hi again @Karl_Gemayel,

I guess these are the inconsistencies you commented on the other thread :rofl:

The reason you are seeing discrepancies is because the clinical precedence dataset and the indications dataset basically draw from two different sources:

  • Mechanism of Action / Indication / Drug Warnings, they are reproductions of the widgets you can find on ChEMBL. We download them directly from their database.
  • Clinical Precedence / Known Drugs, are datasets that we create derived from an ad-hoc pipeline that generates the disease/target evidence data from ChEMBL. In some cases it extends the data present in ChEMBL with extra annotation for us such as new mechanisms of action.

The reason why clindamycin palmitate is not in the evidence set is because the target is not human, and therefore it falls out of the Clinical Precedence dataset as well. You can find similar cases also when we are missing the mechanism of action annotation, as for Tozinameran.
Clinical precedence is derived from the evidence mainly for historical reasons (drug annotations were incorporated later). We will discuss in the team how we want to scope this task and I will keep you posted.

Thanks for reporting it!
Irene