Add NAN handling to convert() needed for some prefix routines with integer outputs. by rkarim2 · Pull Request #502 · nv-legate/cupynumeric

rkarim2 · 2022-08-04T21:06:15Z

No description provided.

…Ns when converting from complex/float to ints. Eager currently incomplete. On C++ side need to add OP handling to allow templatization on NAN identity values (for performance reasons).

Remove old comments.

…o identity based on operation. Not tested and likely buggy!

Bugfixes. Changed the prefix routines to use the convert with NAN handling. Remove unnecessary code from convert. Added nan handling to eager variant of convert. Intial testing seems to work correctly.

Modified the test code for scan routines, enabling int type outputs on float/complex inputs with NAN handling. Modified the scan test code's parametrization. Removed commented out unnecessary code from convert. pre-commit fixes.

Change input ranges to avoid overflows resulting in NANs (NANs are still tested based on n0)

…nge to avoid overflows.

cunumeric/deferred.py

cunumeric/eager.py

magnatelee · 2022-08-18T06:50:05Z

src/cunumeric/unary/convert_template.inl

-  type_dispatch(args.in.code(), SourceTypeDispatch<KIND>{}, args);
+  ConvertArgs args{
+    context.outputs()[0], context.inputs()[0], context.scalars()[0].value<ConvertCode>()};
+  op_dispatch(args.nan_op, ConvertDispatch<KIND>{}, args);


Doing this dispatch upfront triples the number of template instantiations, even though the dispatch on nan_op is unnecessary when the source has an integer type (and please remember there are more integer types than floating point types and complex types combined). A more desirable implementation would be to instantiate a special conversion logic only for pairs of types that need it. You can express those pairs using a template like this:

template <LegateTypeCode SRC_TYPE, LegateTypeCode DST_TYPE> using needs_dispatch_on_nan_op = (legate::is_floating_type(SRC_TYPE)::value || legate::is_complex_type(SRC_TYPE)::value) && legate::is_integer_type(DST_TYPE)::value;

Then you move the dispatch on nan_op to the innermost template and do it only when needs_dispatch_on_nan_op is true.

Resolved in multiple commits, completed fix in d20450a

…ecessary code.

…sabling unnecessary templates when input is not float/complex (to be disabled in a future commit)

Adjusted nancumsum/nancumprod implementation to switch to the faster cumsum/cumprod if NAN conversion is already handled by convert.

With the change to convert's templatization it's needed (and beneficial) to reroute nancumsum/nancumprod to cumsum/cumprod at python level before convert is called for non-float/complex types. Modified test to cover nancumsum/nancumprod for non-float/complex input types to catch any potential bugs (needed due to how convert's templatization is now done).

magnatelee · 2022-09-02T17:15:35Z

LGTM. feel free to merge it once you fix the merge conflict

* Initial pass * todos.rst * Address comments * Fix warnings * Update product positioning * Add supported platform info * Move all Jupyter instructions to Legate * more warnings * Remove todos --------- Co-authored-by: Manolis Papadakis <mpapadakis@nvidia.com>

rkarim2 requested a review from magnatelee August 4, 2022 23:19

rkarim2 added 11 commits August 9, 2022 18:03

Changes to convert() in python side to allow special treatment for NA…

b683376

…Ns when converting from complex/float to ints. Eager currently incomplete. On C++ side need to add OP handling to allow templatization on NAN identity values (for performance reasons).

Modified the python side of convert for proper OP handling.

8fb671e

Remove old comments.

Partial NAN_OP implementation for convert() to allow NAN conversion t…

45fc7dc

…o identity based on operation. Not tested and likely buggy!

Bugfix. Added missing header for isnan.

76c0eb7

Multiple Bugfixes. Builds without errors now.

2fd1858

Multiple changes:

6043bad

Bugfixes. Changed the prefix routines to use the convert with NAN handling. Remove unnecessary code from convert. Added nan handling to eager variant of convert. Intial testing seems to work correctly.

Multiple changes:

23bb30e

Modified the test code for scan routines, enabling int type outputs on float/complex inputs with NAN handling. Modified the scan test code's parametrization. Removed commented out unnecessary code from convert. pre-commit fixes.

Remove tests for NAN to int conversion.

4af64c1

Change input ranges to avoid overflows resulting in NANs (NANs are still tested based on n0)

Change range to avoid overflows in complex types.

1988a38

Reduced test variations and size to speed up CI, reduce test value ra…

c4f4d8b

…nge to avoid overflows.

Refactor isnan (replace Isnan with is_nan) and remove redundant file.

07e8340

rkarim2 force-pushed the convert_dev branch from 701bb4d to 07e8340 Compare August 10, 2022 01:28

magnatelee requested changes Aug 18, 2022

View reviewed changes

rkarim2 added 8 commits August 25, 2022 12:27

Merge branch 'branch-22.10' into convert_dev

6fc9bb6

bugfix.

1960182

Change to how eager processes nan conversion.

661ee85

Set default value for ConvertCode in eager and deffered to remove unn…

bd361b6

…ecessary code.

Moved the convert OP template distapcth to after src_type to allow di…

06b58de

…sabling unnecessary templates when input is not float/complex (to be disabled in a future commit)

Removed Unnecessary templates for convert.

f92ae2d

Adjusted nancumsum/nancumprod implementation to switch to the faster cumsum/cumprod if NAN conversion is already handled by convert.

Fixing typing for convert in thunk to match eager and deferred.

d20450a

rkarim2 added the category:new-feature PR introduces a new feature and will be classified as such in release notes label Aug 31, 2022

magnatelee approved these changes Sep 2, 2022

View reviewed changes

Merge branch 'branch-22.10' into convert_dev

076e978

rkarim2 merged commit 569e527 into nv-legate:branch-22.10 Sep 4, 2022

rkarim2 deleted the convert_dev branch September 4, 2022 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NAN handling to convert() needed for some prefix routines with integer outputs.#502

Add NAN handling to convert() needed for some prefix routines with integer outputs.#502
rkarim2 merged 20 commits intonv-legate:branch-22.10from
rkarim2:convert_dev

rkarim2 commented Aug 4, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

magnatelee Aug 18, 2022

Uh oh!

rkarim2 Aug 26, 2022

Uh oh!

magnatelee commented Sep 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rkarim2 commented Aug 4, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

magnatelee Aug 18, 2022

Choose a reason for hiding this comment

Uh oh!

rkarim2 Aug 26, 2022

Choose a reason for hiding this comment

Uh oh!

magnatelee commented Sep 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants