Add section about hdf5 in the headless mode doc. #142

jjerphan · 2020-08-10T20:02:12Z

This change adds a section of interest for Ilastik's headless mode documentation.

See discussions on image.sc forum:

https://forum.image.sc/t/notable-memory-usage-difference-when-running-ilastik-in-headless-mode-on-different-machines/41144/2

See discussions on image.sc forum: https://forum.image.sc/t/notable-memory-usage-difference-when-running-ilastik-in-headless-mode-on-different-machines/41144/2

imagesc-bot · 2020-08-10T20:04:04Z

This pull request has been mentioned on Image.sc Forum. There might be relevant details there:

https://forum.image.sc/t/notable-memory-usage-difference-when-running-ilastik-in-headless-mode-on-different-machines/41144/6

k-dominik

Hi @jjerphan,

thank you very much for the contribution! The text is on point. One thing that is missing is maybe the complete list of methods to convert your data to hdf5 as mentioned in the FAQ.

The only thing I am afraid of is that users find this information too late. When you're in the headless stage you have already undergone quite the suffering. Would you think this could be more findable in the Data Selection documentation? Would you, from a user perspective even read it?

jjerphan · 2020-08-11T14:16:29Z

Hi @k-dominik

One thing that is missing is maybe the complete list of methods to convert your data to hdf5 as mentioned in the FAQ.

That's true. On my side, I am using a simple python script to convert TIF stacks to hdf5 which can be boiled down to:

#! /usr/bin/env python

import argparse
import os
import h5py

from skimage import io

def main():
    parser = argparse.ArgumentParser("TIF Stack to hdf5 converter")

    parser.add_argument("in_tif", help="Input TIF stack (3D image)")
    parser.add_argument("out_folder", help="Output folder")

    args = parser.parse_args()

    tif_stack_file = args.in_tif
    data = io.imread(tif_stack_file)

    os.makedirs(args.out_folder, exist_ok=True)

    # Convert a path like '/path/to/file.name.ext' to 'file.name'
    basename = ".".join(tif_stack_file.split(os.sep)[-1].split(".")[:-1])

    file_name = os.path.join(args.out_folder, f"{basename}.h5")
    hf = h5py.File(file_name, 'w')
    # Chunking for better 3D access then
    hf.create_dataset("dataset", data=data, chunks=True)
    hf.close()

if __name__ == "__main__":
    main()

This example might be handy for people who are more comfortable using scripts that the plugin for example or who would like to automate some processing. I don't what the best ways to list methods is to be honest. Do you have any idea? 🙂

Would you think this could be more findable in the Data Selection documentation? Would you, from a user perspective even read it?

I don't really know: I am the kind of reader who only read the documentation when I have a problem — I don't remember to have read that section for example. From a user perspective, I mainly rely on the indication given by the GUI or CLI or explicit warnings (in logs for example). If I have ant trouble, I am first searching in the forum, in the issues and then in the doc with specific keywords.

But I think that this might really depend on the background of users.

k-dominik · 2020-08-13T15:23:29Z

Hi @jjerphan :)

That's true. On my side, I am using a simple python script to convert TIF stacks to hdf5 which can be boiled down to:

I'd think that people who are capable of writing their own scripts to convert data probably don't need any hints. So I'd probably just add those from the FAQ (which would be the only change I'd suggest for this PR).

But looking at your nice script (Thanks for sharing!) makes me think that we need a place to put things like these... -> #143

As a follow up on this PR I've opened #144 to maybe make all those performance tips more findable.

jjerphan · 2020-08-13T17:24:09Z

Hi @k-dominik

I'd think that people who are capable of writing their own scripts to convert data probably don't need any hints. So I'd probably just add those from the FAQ (which would be the only change I'd suggest for this PR).

Which scripts are you referring to? So that I can add them to this PR. 🙂

k-dominik · 2020-08-14T08:12:39Z

Hi @k-dominik

I'd think that people who are capable of writing their own scripts to convert data probably don't need any hints. So I'd probably just add those from the FAQ (which would be the only change I'd suggest for this PR).

Which scripts are you referring to? So that I can add them to this PR. slightly_smiling_face

I'd propose to keep this separate. I meant the script you have shared here for tif-stack to hdf5 conversion. i wanted to collect some more ideas/opinions on where to put them in #143

jjerphan · 2020-08-14T08:32:25Z

OK I see. Shall I modify or add something to this PR? 🙂

jjerphan · 2020-08-26T17:19:09Z

Up @k-dominik.

k-dominik · 2020-10-16T14:34:44Z

thank you very much for your contribution @jjerphan !

Add section about hdf5 in the headless mode doc.

1b5498e

See discussions on image.sc forum: https://forum.image.sc/t/notable-memory-usage-difference-when-running-ilastik-in-headless-mode-on-different-machines/41144/2

k-dominik reviewed Aug 11, 2020

View reviewed changes

k-dominik mentioned this pull request Aug 13, 2020

Need a place for users to share scripts/tools that employ ilastik #143

Open

k-dominik merged commit 34347f4 into ilastik:master Oct 16, 2020

jjerphan deleted the headless_hdf5_section branch October 16, 2020 21:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add section about hdf5 in the headless mode doc. #142

Add section about hdf5 in the headless mode doc. #142

jjerphan commented Aug 10, 2020

imagesc-bot commented Aug 10, 2020

k-dominik left a comment

jjerphan commented Aug 11, 2020 •

edited

Loading

k-dominik commented Aug 13, 2020

jjerphan commented Aug 13, 2020

k-dominik commented Aug 14, 2020

jjerphan commented Aug 14, 2020

jjerphan commented Aug 26, 2020

k-dominik commented Oct 16, 2020

Add section about hdf5 in the headless mode doc. #142

Add section about hdf5 in the headless mode doc. #142

Conversation

jjerphan commented Aug 10, 2020

imagesc-bot commented Aug 10, 2020

k-dominik left a comment

Choose a reason for hiding this comment

jjerphan commented Aug 11, 2020 • edited Loading

k-dominik commented Aug 13, 2020

jjerphan commented Aug 13, 2020

k-dominik commented Aug 14, 2020

jjerphan commented Aug 14, 2020

jjerphan commented Aug 26, 2020

k-dominik commented Oct 16, 2020

jjerphan commented Aug 11, 2020 •

edited

Loading