Datasets to be downloaded:
1.Opentargets: /pub/databases/opentargets/genetics/latest/
2.Opentargets non genetics: /pub/databases/opentargets/platform/latest/
2.VGNC: /pub/databases/genenames/vgnc/
3.HGNC: /pub/databases/genenames/hgnc/
@Kirill_Tsukanov From the past two weeks we are facing the issue . We tested again today but still facing the same issue.
Here are the OS details.
NAME=“Red Hat Enterprise Linux”
VERSION=“8.7 (Ootpa)”
ID=“rhel”
ID_LIKE=“fedora”
VERSION_ID=“8.7”
PLATFORM_ID=“platform:el8”
We are using FTPClient in linux for fetching files.
Could you please help us on how to download the files. Let us know if you need any further details.
Looking at the command you’re using, you don’t need to fetch the entire etl/ directory, because its subdirectories parquet/ and json/ contain exactly the same data, just in different formats. So as the first suggestion, try downloading just one of the subdirectories, depending on which format you want. Perhaps reducing the number of files will help with the reliability of the transfer.
Another flag which sometimes help inrease reliability is --passive-ftp. This way, wget needs to open less connections for a download, also potentially increasing stability.
Taking the above suggestions into account, then, this is the command that I suggest you use (assuming you want “json” format; substitute it with “parquet” otherwise):
The job is still running since 14 days and 13298/15703 files succeedded which usually takes one day approximately for the job to complete all the files.
Hi @Vijay, I’m sorry to hear you are still having issues.
At this point I think we’ve tried most options we could. The FTP server is managed by EBI directly, rather than by Open Targets, so as your next step I suggest you contact EBI support: Support and feedback | EMBL-EBI — please explain your problem and the issues you had trying to download the data. (You can send a link to this post to avoid repeating all steps that we tried and the error messages.)