Where to find tractability_buckets file in the latest version, for example, version:22.11?

Dear Team,

Where to find tractability_buckets file in the latest version, for example, version:22.11?

http://ftp.ebi.ac.uk/pub/databases/opentargets/platform/latest/input/

tractability_buckets-2021-03-08.tsv

Meanwhile, do we have a webpage for the details for each column name? for example:

[1] “ensembl_gene_id” “symbol”
[3] “accession” “Entry”
[5] “name” “Protein”
[7] “names” “IDG_family”
[9] “IDG_dtoclass” “IDG_tdl”
[11] “IDG_fam” “GO_BioProcess”
[13] “GO_MolFunction” “SM_B1_Approved”
[15] “Drug” “SM_B2_Advanced”
[17] “Clinical” “SM_B3_Phase”
[19] “1” “Clinical_1”
[21] “SM_B4_Structure” “with”
[23] “Ligand” “SM_B5_High-Quality”
[25] “Ligand_1” “SM_B6_High-Quality”
[27] “Pocket” “SM_B7_Med-Quality”
[29] “Pocket_1” “SM_B8_Druggable”
[31] “Family” “Bucket_sum_sm”
[33] “Top_bucket_sm” “drug_chembl_ids_sm”
[35] “drug_names_sm” “clinical_phases_sm”
[37] “PDB_Known_Ligand” “PDB_other”
[39] “High_Quality_ChEMBL_compounds” “DrugEBIlity_score”
[41] “Small_Molecule_Druggable_Genome_Member” “Category_sm”
[43] “Clinical_Precedence_sm” “Discovery_Precedence_sm”
[45] “Predicted_Tractable_sm” “drug_names_dict_sm”
[47] “clinical_phases_dict_sm” “AB_B1_Approved”
[49] “Drug_1” “AB_B2_Advanced”
[51] “Clinical_2” “AB_B3_Phase”
[53] “1_1” “Clinical_3”
[55] “AB_B4_UniProt” “loc”
[57] “high” “conf”
[59] “AB_B5_GO” “CC”
[61] “high_1” “conf_1”
[63] “AB_B6_UniProt” “loc_1”
[65] “med” “conf_2”
[67] “AB_B7_UniProt” “SigP”
[69] “or” “TMHMM”
[71] “AB_B8_GO” “CC_1”
[73] “med_1” “conf_3”
[75] “AB_B9_Human” “Protein_1”
[77] “Atlas” “loc_2”
[79] “Bucket_sum_ab” “Top_bucket_ab”
[81] “drug_chembl_ids_ab” “drug_names_ab”
[83] “clinical_phases_ab” “Uniprot_high_conf_loc”
[85] “GO_high_conf_loc” “Uniprot_med_conf_loc”
[87] “GO_med_conf_loc” “Gene”
[89] “ontology” “(cellular”
[91] “component)” “Transmembrane”
[93] “Signal_peptide” “HPA_main_location”
[95] “Clinical_Precedence_ab” “Predicted_Tractable_ab_High_confidence”
[97] “Predicted_Tractable_ab_Medium_to_low_confidence” “Category_ab”
[99] “drug_names_dict_ab” “clinical_phases_dict_ab”
[101] “PR_B1_Approved” “Drug_2”
[103] “PR_B2_Advanced” “Clinical_4”
[105] “PR_B3_Phase” “1_2”
[107] “Clinical_5” “PR_B4_Literature”
[109] “PR_B5_UniProt” “Ubiquitination”
[111] “PR_B6_Database” “Ubiquitination_1”
[113] “PR_B7_Half-life” “Data”
[115] “PR_B8_Small” “Molecule”
[117] “Binder” “Bucket_sum_PROTAC”
[119] “Top_bucket_PROTAC” “PR_locB_Location”
[121] “Score” “mentioned_in_PROTAC_literature”
[123] “literature_count_PROTAC” “pub_id”
[125] “full_id” “title”
[127] “Max_halflife” “Min_halflife”
[129] “Uniprot_keyword” “Uniprot_PTM”
[131] “Uniprot_CrossLink” “Ub_PhosphoSitePlus”
[133] “Ub_mUbiSiDa_2013” “number_of_ubiquitination_sites”
[135] “Bcell_mean” “NKcell_mean”
[137] “Hepatocytes_mean” “MouseNeuorons_mean”
[139] “count_compound_chembl_ids_PROTAC” “Clinical_Precedence_PROTAC”
[141] “Literature_Precedence_PROTAC” “Discovery_Opportunity_PROTAC”
[143] “Category_PROTAC” “OC_B1_Approved”
[145] “Drug_3” “OC_B2_Advanced”
[147] “Clinical_6” “OC_B3_Phase”
[149] “1_3” “Clinical_7”
[151] “Bucket_sum_othercl” “Top_bucket_othercl”
[153] “drug_chembl_ids_othercl” “drug_names_othercl”
[155] “clinical_phases_othercl” “Clinical_Precedence_othercl”
[157] “Category_othercl” “drug_names_dict_othercl”
[159] “clinical_phases_dict_othercl”

Thanks.

Shicheng

Hello!

If you want to investigate the tractability raw assessments, this data is made available through our FTP. For example, the target annotation of the latest release is based on this file: http://ftp.ebi.ac.uk/pub/databases/opentargets/platform/latest/input/target-inputs/tractability/tractability.tsv

To know more about the provenance of each of those columns, the best place to look at is the source code at tractability_pipeline_v2/README.md at 1712a5e001bc5daedf912f2ed60ca8a7bf97b081 · chembl/tractability_pipeline_v2 · GitHub

Here you can see how each of the buckets are assessed. Please let us know if you have further questions!

Best,
Irene