Implementing Putmask by ipdemes · Pull Request #667 · nv-legate/cupynumeric

ipdemes · 2022-10-24T22:39:16Z

No description provided.

ipdemes · 2022-10-26T04:06:53Z

This PR implements logic for putmask funtion.
It will also optimize case of Advanced Indexing operation like a[bool_indices]=scalar by calling putmask instead of doing Sparse Indirect Copy.
The last commit is a replacement to PR #639

cunumeric/module.py

magnatelee · 2022-11-03T06:24:12Z

src/cunumeric/execution_policy/indexing/bool_mask.cuh

+};
+
+template <>
+struct BoolMaskPolicy<VariantKind::GPU, false> {


does this need to be specific to boolean masks? this policy cannot be used without materializing a boolean mask in a case where, for example, the kernel needs to run when the input is not NaN. it'd be more general if this policy took a predicate that evaluates to a boolean value on each point.

Makes sense. I will make it more general and call it parallel_loop, probably.

magnatelee · 2022-11-03T06:29:41Z

cunumeric/deferred.py

+        is_scalar_value = False
+        if values.shape != self.shape and values.size == 1:
+            if values.shape == ():
+                values = values._convert_future_to_regionfield(True)


is there a reason that you do this? can't you just promote this scalar store to match the shape of self?

replaced this with broadcasting

magnatelee · 2022-11-03T06:30:24Z

cunumeric/deferred.py

+        task.add_input(mask.base)
+        task.add_input(values.base)
+        task.add_output(self.base)
+        task.add_scalar_arg(is_scalar_value, bool)  # value is scalar


again, is there a reason that you don't do numpy broadcasting on the scalar value? that would make the code a lot simpler.

makes sense. replaced the logic with broadcasting

magnatelee · 2022-11-03T06:31:13Z

cunumeric/deferred.py

+            task.add_broadcast(values.base)
+        else:
+            task.add_alignment(self.base, values.base)
+        task.add_broadcast(self.base, axes=range(1, self.ndim))


I don't understand this broadcast constraint... why is this necessary?

removed. I don't need it anymore

magnatelee · 2022-11-03T06:34:21Z

src/cunumeric/index/putmask_template.inl

+using namespace Legion;
+using namespace legate;
+
+template <VariantKind KIND, LegateTypeCode CODE, int DIM, int VDIM, bool SCALAR_VALUE = false>


like I said in the other comment, if you do numpy broadcasting on the values, you can remove the SCALAR_VALUE == true case.

magnatelee · 2022-11-03T06:36:35Z

cunumeric/module.py

+
+    if a.dtype != values.dtype:
+        values = values._warn_and_convert(a.dtype)
+    if values.shape != a.shape and values.size != 1:


the first condition should be values.size != size. Here's an example you can handle without wrapping or tiling but simply using numpy broadcasting:

import numpy as np a = np.full((3, 3), 3) np.putmask(a, np.full((3, 3), True), np.full(3, 10))

unless stated otherwise, it's safe to assume that all related array arguments follow the numpy broadcasting semantics.

I adding the check if arrays can be broadcasted. We will cal wrap only for the case when they can't

magnatelee · 2022-11-03T06:41:49Z

tests/integration/test_putmask.py

+    num.putmask(num_arr, num_mask, num_val)
+    assert np.array_equal(np_arr, num_arr)
+
+    # val is different shape


you need to subdivide this test into two: 1) when the shapes are different but the sizes are the same, 2) when both the shapes and sizes are different.

magnatelee · 2022-11-15T08:05:37Z

cunumeric/deferred.py

+            has_set_value = set_value is not None and set_value.size == 1
+            if has_set_value:
+                mask = DeferredArray(
+                    self.runtime,
+                    base=key_store,
+                    dtype=self.dtype,
+                )
+                rhs.putmask(mask, set_value)
+                return False, rhs, rhs, self


I'm not a big fan of this code. the name of the function is _create_indexing_array, but this code makes it do more than just creating indexing arrays. and in fact, the function was already complicated enough to warrant a refactoring, which I was thinking of doing at some point. I guess I'm fine with accepting this edit for this PR, but keep in mind that _create_indexing_array can use some refactoring for better maintainability.

magnatelee · 2022-11-15T08:06:48Z

cunumeric/module.py

    a.put(indices=indices, values=values, mode=mode)


+def _can_broadcast_to_shape(self: ndarray, shape: NdShape) -> bool:


can't we just use np.broadcast_shapes instead of this custom code?

sure, will use it instead with `try\ except"

The version file is ironically causing the version string to be incorrect. Issue #666 documents what should be done to fix this correctly. In the short term I need a clean release version for the upcoming initial wheels release hence overriding it. Longer term we should override in CI anyway on the release branch to switch to a post release versioning scheme in release branches.

The version file is ironically causing the version string to be incorrect. Issue nv-legate#666 documents what should be done to fix this correctly. In the short term I need a clean release version for the upcoming initial wheels release hence overriding it. Longer term we should override in CI anyway on the release branch to switch to a post release versioning scheme in release branches.

ipdemes added 4 commits October 21, 2022 11:50

implementing putmask logic through where

a2e8e18

creating the putmask kernel

fbb67c0

formatting

848fe7a

some clean-up

dc4d556

ipdemes added the category:new-feature PR introduces a new feature and will be classified as such in release notes label Oct 24, 2022

ipdemes requested a review from magnatelee October 24, 2022 22:39

ipdemes self-assigned this Oct 24, 2022

ipdemes added 2 commits October 24, 2022 16:52

some clean-up

4777784

fixing mypy errors

e26f2dd

ipdemes force-pushed the putmask branch from 4752b03 to e26f2dd Compare October 25, 2022 16:05

ipdemes added 6 commits October 25, 2022 10:24

clean-up

eb66539

making putmask task to work with scalar values

0328561

fixing config test

8c8f0ac

clean-up

b0c7dd0

Merge remote-tracking branch 'origin/branch-22.12' into putmask

9d9cab7

using putmask for the case a[bool_indices]=scalar

20074b8

ipdemes added 4 commits October 31, 2022 23:43

fixing the case for ai when lhs is transformed

70fa887

Merge remote-tracking branch 'origin/branch-22.12' into putmask

0a16b8f

fixing some issues

1faab6e

fixing logic for the case a[cond]=scalar

e106426

magnatelee requested changes Nov 3, 2022

View reviewed changes

ipdemes added 6 commits November 3, 2022 22:20

Merge remote-tracking branch 'origin/branch-22.12' into putmask

e71bbb1

making loop execution policy more general

b11d1dc

simplifying putmask implementation by broadcasting values when needed

32bf643

adding missing files

d3ced52

putmask: calling only in the case when we can't broadcast

a505eed

putmask: improving test

580f421

ipdemes requested a review from magnatelee November 10, 2022 23:12

Merge remote-tracking branch 'origin/branch-22.12' into putmask

3a66133

magnatelee reviewed Nov 15, 2022

View reviewed changes

ipdemes added 2 commits November 15, 2022 16:46

replacing _can_broadcast_shapes with np.broadcast_shapes

0bf40f5

fixing logic for accessor

3af105c

magnatelee approved these changes Nov 16, 2022

View reviewed changes

ipdemes merged commit db2a4f8 into nv-legate:branch-22.12 Nov 16, 2022

ipdemes deleted the putmask branch January 12, 2023 05:21

		a.put(indices=indices, values=values, mode=mode)


		def _can_broadcast_to_shape(self: ndarray, shape: NdShape) -> bool:

Conversation

ipdemes commented Oct 24, 2022

Uh oh!

ipdemes commented Oct 26, 2022

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants