Unable to parse JSON files

sigven · 30 April 2021 16:23

Hi,

I am unable to parse the JSON files available for download (using the jsonlite package in R). I have tried both drugs, targets etc, but all return the same error:

Error in parse_con(txt, bigint_as_char) : parse error: trailing garbage
k/drug/moclobemide.html"]}]} {“actionType”:“ANTAGONIST”,"mec
(right here) ------^

ochoa · 30 April 2021 18:18

If you are keen on R, I would suggest you to try with the sparklyr tutorial in Documentation. That should circumvent some of the issues you are having.

The main reason why this is not working for you, it’s that the files are in JSON lines not in JSON.

sigven · 30 April 2021 19:17

OK, thanks for the swift response. From the single example looking into ClinVar it is not clear how a similar approach should be modified for targets/molecules/drugs etc? In particular, I do not understand how the local downloads of the parquet files (which I assume is necessary for this to be working?) should be arranged?

What should be present/downloaded in “/Users/OpenTargetsUser/Datasets” for the example to work?

Although the spark approach is likely a convenient and efficient approach (my first time exposed to parquet, I must admit :-)), showing how the JSON data can be accessed or parsed would in my opinion be highly valuable to the community, considering that this is a well-established format.

sigven · 1 May 2021 07:01

Seems I now manage to load the JSON data in R (example with targets):

con ← file(‘targets-part-00000.json’,‘r’)
target_data ← jsonlite::stream_in(con)

Topic		Replies	Views
L2G JSON download for Open Targets Genetics Data Access datadownloads , genetics-portal	4	271	7 December 2023
Difference between parquet files and website/API Data issue data , data-updates	3	146	7 February 2024
Batch-query variant-centric evidence for a list of targets (R) Data downloads	2	1071	15 August 2022
GraphQL Interface failing to fetch 'knownDrugs' Technical Support	2	503	20 August 2021
Can we have data as flat files instead of JSON? Genetics feature requests	4	389	8 May 2022

Unable to parse JSON files

Related topics