Implementing Putmask #667

ipdemes · 2022-10-24T22:39:16Z

No description provided.

ipdemes · 2022-10-26T04:06:53Z

This PR implements logic for putmask funtion.
It will also optimize case of Advanced Indexing operation like a[bool_indices]=scalar by calling putmask instead of doing Sparse Indirect Copy.
The last commit is a replacement to PR #639

cunumeric/module.py

magnatelee · 2022-11-03T06:24:12Z

src/cunumeric/execution_policy/indexing/bool_mask.cuh

+};
+
+template <>
+struct BoolMaskPolicy<VariantKind::GPU, false> {


does this need to be specific to boolean masks? this policy cannot be used without materializing a boolean mask in a case where, for example, the kernel needs to run when the input is not NaN. it'd be more general if this policy took a predicate that evaluates to a boolean value on each point.

Makes sense. I will make it more general and call it parallel_loop, probably.

magnatelee · 2022-11-03T06:29:41Z

cunumeric/deferred.py

+        is_scalar_value = False
+        if values.shape != self.shape and values.size == 1:
+            if values.shape == ():
+                values = values._convert_future_to_regionfield(True)


is there a reason that you do this? can't you just promote this scalar store to match the shape of self?

replaced this with broadcasting

magnatelee · 2022-11-03T06:30:24Z

cunumeric/deferred.py

+        task.add_input(mask.base)
+        task.add_input(values.base)
+        task.add_output(self.base)
+        task.add_scalar_arg(is_scalar_value, bool)  # value is scalar


again, is there a reason that you don't do numpy broadcasting on the scalar value? that would make the code a lot simpler.

makes sense. replaced the logic with broadcasting

magnatelee · 2022-11-03T06:31:13Z

cunumeric/deferred.py

+            task.add_broadcast(values.base)
+        else:
+            task.add_alignment(self.base, values.base)
+        task.add_broadcast(self.base, axes=range(1, self.ndim))


I don't understand this broadcast constraint... why is this necessary?

removed. I don't need it anymore

magnatelee · 2022-11-03T06:34:21Z

src/cunumeric/index/putmask_template.inl

+using namespace Legion;
+using namespace legate;
+
+template <VariantKind KIND, LegateTypeCode CODE, int DIM, int VDIM, bool SCALAR_VALUE = false>


like I said in the other comment, if you do numpy broadcasting on the values, you can remove the SCALAR_VALUE == true case.

magnatelee · 2022-11-03T06:36:35Z

cunumeric/module.py

+
+    if a.dtype != values.dtype:
+        values = values._warn_and_convert(a.dtype)
+    if values.shape != a.shape and values.size != 1:


the first condition should be values.size != size. Here's an example you can handle without wrapping or tiling but simply using numpy broadcasting:

import numpy as np a = np.full((3, 3), 3) np.putmask(a, np.full((3, 3), True), np.full(3, 10))

unless stated otherwise, it's safe to assume that all related array arguments follow the numpy broadcasting semantics.

I adding the check if arrays can be broadcasted. We will cal wrap only for the case when they can't

magnatelee · 2022-11-03T06:41:49Z

tests/integration/test_putmask.py

+    num.putmask(num_arr, num_mask, num_val)
+    assert np.array_equal(np_arr, num_arr)
+
+    # val is different shape


you need to subdivide this test into two: 1) when the shapes are different but the sizes are the same, 2) when both the shapes and sizes are different.

magnatelee · 2022-11-15T08:05:37Z

cunumeric/deferred.py

+            has_set_value = set_value is not None and set_value.size == 1
+            if has_set_value:
+                mask = DeferredArray(
+                    self.runtime,
+                    base=key_store,
+                    dtype=self.dtype,
+                )
+                rhs.putmask(mask, set_value)
+                return False, rhs, rhs, self


I'm not a big fan of this code. the name of the function is _create_indexing_array, but this code makes it do more than just creating indexing arrays. and in fact, the function was already complicated enough to warrant a refactoring, which I was thinking of doing at some point. I guess I'm fine with accepting this edit for this PR, but keep in mind that _create_indexing_array can use some refactoring for better maintainability.

magnatelee · 2022-11-15T08:06:48Z

cunumeric/module.py

@@ -3506,6 +3506,58 @@ def put(
    a.put(indices=indices, values=values, mode=mode)


+def _can_broadcast_to_shape(self: ndarray, shape: NdShape) -> bool:


can't we just use np.broadcast_shapes instead of this custom code?

sure, will use it instead with `try\ except"

ipdemes added 4 commits October 21, 2022 11:50

implementing putmask logic through where

a2e8e18

creating the putmask kernel

fbb67c0

formatting

848fe7a

some clean-up

dc4d556

ipdemes added the category:new-feature PR introduces a new feature and will be classified as such in release notes label Oct 24, 2022

ipdemes requested a review from magnatelee October 24, 2022 22:39

ipdemes self-assigned this Oct 24, 2022

ipdemes added 2 commits October 24, 2022 16:52

some clean-up

4777784

fixing mypy errors

e26f2dd

ipdemes force-pushed the putmask branch from 4752b03 to e26f2dd Compare October 25, 2022 16:05

ipdemes added 6 commits October 25, 2022 10:24

clean-up

eb66539

making putmask task to work with scalar values

0328561

fixing config test

8c8f0ac

clean-up

b0c7dd0

Merge remote-tracking branch 'origin/branch-22.12' into putmask

9d9cab7

using putmask for the case a[bool_indices]=scalar

20074b8

ipdemes added 4 commits October 31, 2022 23:43

fixing the case for ai when lhs is transformed

70fa887

Merge remote-tracking branch 'origin/branch-22.12' into putmask

0a16b8f

fixing some issues

1faab6e

fixing logic for the case a[cond]=scalar

e106426

magnatelee requested changes Nov 3, 2022

View reviewed changes

ipdemes added 6 commits November 3, 2022 22:20

Merge remote-tracking branch 'origin/branch-22.12' into putmask

e71bbb1

making loop execution policy more general

b11d1dc

simplifying putmask implementation by broadcasting values when needed

32bf643

adding missing files

d3ced52

putmask: calling only in the case when we can't broadcast

a505eed

putmask: improving test

580f421

ipdemes requested a review from magnatelee November 10, 2022 23:12

Merge remote-tracking branch 'origin/branch-22.12' into putmask

3a66133

magnatelee reviewed Nov 15, 2022

View reviewed changes

ipdemes added 2 commits November 15, 2022 16:46

replacing _can_broadcast_shapes with np.broadcast_shapes

0bf40f5

fixing logic for accessor

3af105c

magnatelee approved these changes Nov 16, 2022

View reviewed changes

ipdemes merged commit db2a4f8 into nv-legate:branch-22.12 Nov 16, 2022

ipdemes deleted the putmask branch January 12, 2023 05:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing Putmask #667

Implementing Putmask #667

ipdemes commented Oct 24, 2022

ipdemes commented Oct 26, 2022

magnatelee Nov 3, 2022

ipdemes Nov 4, 2022

magnatelee Nov 3, 2022

ipdemes Nov 10, 2022

magnatelee Nov 3, 2022

ipdemes Nov 10, 2022

magnatelee Nov 3, 2022

ipdemes Nov 10, 2022

magnatelee Nov 3, 2022

magnatelee Nov 3, 2022

magnatelee Nov 3, 2022

ipdemes Nov 10, 2022

magnatelee Nov 3, 2022

magnatelee Nov 15, 2022

magnatelee Nov 15, 2022

ipdemes Nov 15, 2022

		@@ -3506,6 +3506,58 @@ def put(
		a.put(indices=indices, values=values, mode=mode)


		def _can_broadcast_to_shape(self: ndarray, shape: NdShape) -> bool:

Implementing Putmask #667

Implementing Putmask #667

Conversation

ipdemes commented Oct 24, 2022

ipdemes commented Oct 26, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment