Use int64 when calculating data size in split_acquisition_function #795

khurram-ghani · 2023-11-17T17:58:13Z

Related issue(s)/PRs: None

Summary

Use int64 to calculate the input tensor size when splitting acquisition functions. Otherwise, we overflow for large tensors, e.g. of shape [30000, 1000, 100].

It is tricky to add a unit test to cover the limit, as we run into memory issues. Using mocking or sparse-tensors gets a bit messy, so skipping unit testing for this simple change.

Fully backwards compatible: yes

PR checklist

The quality checks are all passing
The bug case / new feature is covered by tests
Any new features are well-documented (in docstrings or notebooks)

hstojic

see the comment...

hstojic · 2023-11-20T13:24:01Z

trieste/acquisition/utils.py

@@ -48,7 +48,8 @@ def wrapper(x: TensorType) -> TensorType:
        if length == 0:
            return fn(x)

-        elements_per_block = tf.size(x) / length
+        # Use int64 to calculate the input tensor size, otherwise we can overflow for large tensors.


this is good but could you please add in the docstrings few sentences about memory requirements wrt scaling?

Do you mean docstring for split_acquisition_function? Could you give an example please? I could say that the memory usage of tensor x is its flattened size times 8, but that is fairly obvious.
Or do you mean in the optimizer docstring?

i think it would be more appropriate in the generate_continuous_optimizer docstring where tiled_candidates becomes very big - I was thinking of an additional note with a warning that using a large num_initial_samples with high dimensional search space and/or batch sizes will require a lot of memory.

I have added a note.

Use int64 to allow large x sizes

cfbb790

khurram-ghani requested a review from hstojic November 17, 2023 17:59

Add comment

c57b45b

hstojic approved these changes Nov 20, 2023

View reviewed changes

Add warning about mem usage

dedd6c9

khurram-ghani merged commit ae11f17 into develop Nov 22, 2023
12 checks passed

khurram-ghani deleted the khurram/split_ack_to_int64 branch November 22, 2023 17:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use int64 when calculating data size in split_acquisition_function #795

Use int64 when calculating data size in split_acquisition_function #795

khurram-ghani commented Nov 17, 2023

hstojic left a comment

hstojic Nov 20, 2023

khurram-ghani Nov 20, 2023

hstojic Nov 22, 2023

khurram-ghani Nov 22, 2023

Use int64 when calculating data size in split_acquisition_function #795

Use int64 when calculating data size in split_acquisition_function #795

Conversation

khurram-ghani commented Nov 17, 2023

Summary

PR checklist

hstojic left a comment

Choose a reason for hiding this comment

hstojic Nov 20, 2023

Choose a reason for hiding this comment

khurram-ghani Nov 20, 2023

Choose a reason for hiding this comment

hstojic Nov 22, 2023

Choose a reason for hiding this comment

khurram-ghani Nov 22, 2023

Choose a reason for hiding this comment