Export DICOM images via FTPS #190

stefpiatek · 2023-12-18T09:31:47Z

Definition of Done / Acceptance Criteria

De-identified DICOM images will be downloaded from orthanc-anon and sent via FTPS to the DSH under the project slug directory using the pseudonymised id as the filename (e.g. {project-slug}/{study-pseduonymised-id}.zip)

Breakdown of steps for progress:

get pseudonymised identifier for image from DICOM data (currently in patient ID tag you can do this in a REST request: {dicom-server}/instances/{resource-id}/content/0010-0020 gets us just the patient idenfier, see below for details)
get project name slug from PIXL db using the pseudonymised identifier (in pixl db: Image.hashed_identifier)
download zip of DICOM data (see details below)
upload zip of DICOM data to {project-slug}/{study-pseduonymised-id}.zip on the DSH (see details and prototype repo below)
Update the existing image row in the database with the current time added to exported_at

Testing

In an ideal world we'd have a mock DSH in docker compose which we can ftp into. If we can set this up in a day then probably worth it. Have a prototype repo for setting this up so I think this should be quick,

Current State

At the moment DICOM images are pushed via the DICOMweb protocol to an azure DICOM server. Sadly the DSH doesn't have a DICOM server that we can use, so we'll have to use FTPS for 100 days.

Documentation

No response

Dependencies

DSH account for uploading data only for the project UCLH-Foundry/the-rolling-skeleton#77
de-identification done using Image pseudonymisation stored in database #188

Details and Comments

Getting hashed identifier

API documentation to get a raw tag Example using demo instance for tag only: https://orthanc.uclouvain.be/demo/instances/83354841-24928346-008ed987-8032f76c-6363d8eb/content/0010-0020
API documentation for a resource's tags : Example to get all common https://orthanc.uclouvain.be/demo/instances/83354841-24928346-008ed987-8032f76c-6363d8eb/tags?simplify

Downloading dicom data

It looks like its possible to do this using the resourceId that we currently use in the dicomweb request in SendViaStow in orthanc-anon.

All instances of a study can be retrieved as a zip file as follows:

$ curl http://localhost:8042/studies/6b9e19d9-62094390-5f9ddb01-4a191ae7-9766b715/archive > Study.zip

So in python we could do something like this?

# Query orthanc-anon for the study
query = f "{orthanc_url}/studies/{resource_id}/archive"
response_study = requests.get(query, verify=False,   
							  auth=(orthanc_user, orthanc_password))  
if response_study.status_code != 200:  
	raise SpecificException(f"Could not download archive of resource '{resource_id}'")
# get the zip content
zip_content = response_study.content

Copying file

@tcouch uses this curl command to send data to the DSH

curl --netrc --tlsv1.2 --ftp-ssl-reqd --ftp-create-dirs -T $ZIPFILE.zip "ftps://filetransfer.idhs.ucl.ac.uk/"

We can convert this to python that creates a directory in the FTP server if it doesn't exist. 🎉 Prototype repo that can be used as a basis for automated testing 🎉

The text was updated successfully, but these errors were encountered:

docsteveharris · 2024-01-03T11:05:37Z

request from Tim to bear in mind that we'd need to the same for non-imaging tasks

stefpiatek added this to the 100-days milestone Dec 18, 2023

stefpiatek mentioned this issue Dec 18, 2023

Export current extract parquet files to DSH #192

Closed

docsteveharris assigned jeremyestein and milanmlft Jan 3, 2024

stefpiatek assigned ruaridhg and unassigned jeremyestein Jan 15, 2024

milanmlft changed the title ~~Export DICOM images via SFTP~~ Export DICOM images via FTPS Jan 15, 2024

milanmlft mentioned this issue Jan 16, 2024

Upload DICOM images with FTPS #226

Merged

milanmlft closed this as completed in #226 Jan 23, 2024

stefpiatek mentioned this issue Feb 1, 2024

Add FTPS server to system test and test successful upload #268

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Export DICOM images via FTPS #190

Export DICOM images via FTPS #190

stefpiatek commented Dec 18, 2023 •

edited by milanmlft

Loading

docsteveharris commented Jan 3, 2024

Export DICOM images via FTPS #190

Export DICOM images via FTPS #190

Comments

stefpiatek commented Dec 18, 2023 • edited by milanmlft Loading

Definition of Done / Acceptance Criteria

Testing

Current State

Documentation

Dependencies

Details and Comments

Getting hashed identifier

Downloading dicom data

Copying file

docsteveharris commented Jan 3, 2024

stefpiatek commented Dec 18, 2023 •

edited by milanmlft

Loading