I am trying to load the 22.09 release data into my own instance of the genetics portal and the loader script genetics-backend/loaders/clickhouse/create_and_load_everything_from_scratch.sh
seems to assume that the JSON data for the studies and genes are in directory:
"${base_path}"/json/lut/study-index/part-*
and
"${base_path}"/json/lut/genes-index/part-*
But there is no JSON folder in the released data on FTP, nor on Google Cloud…
There IS a directory that contains JSON files that seems to be the same, but I did not check thoroughly in directory:
"${base_path}"/search/study/part-*
and
"${base_path}"/search/genes/part-*
Are those indeed the same and should I just change the elasticsearch-loader to read from those?
Thanks
Thon