Where to find tractability_buckets file in the latest version, for example, version:22.11?

Shicheng_Guo · 14 January 2023 00:20

Dear Team,

http://ftp.ebi.ac.uk/pub/databases/opentargets/platform/latest/input/

tractability_buckets-2021-03-08.tsv

Meanwhile, do we have a webpage for the details for each column name? for example:

[1] “ensembl_gene_id” “symbol”
[3] “accession” “Entry”
[5] “name” “Protein”
[7] “names” “IDG_family”
[9] “IDG_dtoclass” “IDG_tdl”
[11] “IDG_fam” “GO_BioProcess”
[13] “GO_MolFunction” “SM_B1_Approved”
[15] “Drug” “SM_B2_Advanced”
[17] “Clinical” “SM_B3_Phase”
[19] “1” “Clinical_1”
[21] “SM_B4_Structure” “with”
[23] “Ligand” “SM_B5_High-Quality”
[25] “Ligand_1” “SM_B6_High-Quality”
[27] “Pocket” “SM_B7_Med-Quality”
[29] “Pocket_1” “SM_B8_Druggable”
[31] “Family” “Bucket_sum_sm”
[33] “Top_bucket_sm” “drug_chembl_ids_sm”
[35] “drug_names_sm” “clinical_phases_sm”
[37] “PDB_Known_Ligand” “PDB_other”
[39] “High_Quality_ChEMBL_compounds” “DrugEBIlity_score”
[41] “Small_Molecule_Druggable_Genome_Member” “Category_sm”
[43] “Clinical_Precedence_sm” “Discovery_Precedence_sm”
[45] “Predicted_Tractable_sm” “drug_names_dict_sm”
[47] “clinical_phases_dict_sm” “AB_B1_Approved”
[49] “Drug_1” “AB_B2_Advanced”
[51] “Clinical_2” “AB_B3_Phase”
[53] “1_1” “Clinical_3”
[55] “AB_B4_UniProt” “loc”
[57] “high” “conf”
[59] “AB_B5_GO” “CC”
[61] “high_1” “conf_1”
[63] “AB_B6_UniProt” “loc_1”
[65] “med” “conf_2”
[67] “AB_B7_UniProt” “SigP”
[69] “or” “TMHMM”
[71] “AB_B8_GO” “CC_1”
[73] “med_1” “conf_3”
[75] “AB_B9_Human” “Protein_1”
[77] “Atlas” “loc_2”
[79] “Bucket_sum_ab” “Top_bucket_ab”
[81] “drug_chembl_ids_ab” “drug_names_ab”
[83] “clinical_phases_ab” “Uniprot_high_conf_loc”
[85] “GO_high_conf_loc” “Uniprot_med_conf_loc”
[87] “GO_med_conf_loc” “Gene”
[89] “ontology” “(cellular”
[91] “component)” “Transmembrane”
[93] “Signal_peptide” “HPA_main_location”
[95] “Clinical_Precedence_ab” “Predicted_Tractable_ab_High_confidence”
[97] “Predicted_Tractable_ab_Medium_to_low_confidence” “Category_ab”
[99] “drug_names_dict_ab” “clinical_phases_dict_ab”
[101] “PR_B1_Approved” “Drug_2”
[103] “PR_B2_Advanced” “Clinical_4”
[105] “PR_B3_Phase” “1_2”
[107] “Clinical_5” “PR_B4_Literature”
[109] “PR_B5_UniProt” “Ubiquitination”
[111] “PR_B6_Database” “Ubiquitination_1”
[113] “PR_B7_Half-life” “Data”
[115] “PR_B8_Small” “Molecule”
[117] “Binder” “Bucket_sum_PROTAC”
[119] “Top_bucket_PROTAC” “PR_locB_Location”
[121] “Score” “mentioned_in_PROTAC_literature”
[123] “literature_count_PROTAC” “pub_id”
[125] “full_id” “title”
[127] “Max_halflife” “Min_halflife”
[129] “Uniprot_keyword” “Uniprot_PTM”
[131] “Uniprot_CrossLink” “Ub_PhosphoSitePlus”
[133] “Ub_mUbiSiDa_2013” “number_of_ubiquitination_sites”
[135] “Bcell_mean” “NKcell_mean”
[137] “Hepatocytes_mean” “MouseNeuorons_mean”
[139] “count_compound_chembl_ids_PROTAC” “Clinical_Precedence_PROTAC”
[141] “Literature_Precedence_PROTAC” “Discovery_Opportunity_PROTAC”
[143] “Category_PROTAC” “OC_B1_Approved”
[145] “Drug_3” “OC_B2_Advanced”
[147] “Clinical_6” “OC_B3_Phase”
[149] “1_3” “Clinical_7”
[151] “Bucket_sum_othercl” “Top_bucket_othercl”
[153] “drug_chembl_ids_othercl” “drug_names_othercl”
[155] “clinical_phases_othercl” “Clinical_Precedence_othercl”
[157] “Category_othercl” “drug_names_dict_othercl”
[159] “clinical_phases_dict_othercl”

Thanks.

Shicheng

irene · 16 January 2023 18:07

Hello!

If you want to investigate the tractability raw assessments, this data is made available through our FTP. For example, the target annotation of the latest release is based on this file: http://ftp.ebi.ac.uk/pub/databases/opentargets/platform/latest/input/target-inputs/tractability/tractability.tsv

To know more about the provenance of each of those columns, the best place to look at is the source code at tractability_pipeline_v2/README.md at 1712a5e001bc5daedf912f2ed60ca8a7bf97b081 · chembl/tractability_pipeline_v2 · GitHub

Here you can see how each of the buckets are assessed. Please let us know if you have further questions!

Best,
Irene

Topic		Replies	Views
Where to access PROTAC data? Data Access	2	296	3 August 2021
Tractability SM bucket (#9 is missing ?) Data issue	3	304	17 August 2021
Opentargets Tractability dataset (from chembl) Data downloads	1	329	20 June 2023
Antibodies tractability pipe and results issues Community Feedback	5	500	22 April 2022
Documentation for reproducing platform target profiles from data downloads? Technical Support	1	311	16 March 2022

Where to find tractability_buckets file in the latest version, for example, version:22.11?

Related topics