Make reduced-precision cuBLAS mode opt-in by manopapad · Pull Request #519 · nv-legate/cupynumeric

manopapad · 2022-08-09T22:12:43Z

No description provided.

manopapad · 2022-08-09T22:14:07Z

src/cunumeric/cudalibs.cu

-    if (nullptr == disable_tensor_cores) {
-      // No request to disable tensor cores so turn them on
-      cublasStatus_t status = cublasSetMathMode(cublas_, CUBLAS_TENSOR_OP_MATH);
+    const char* fast_math = getenv("CUNUMERIC_FAST_MATH");


I chose this against CUNUMERIC_DISABLE_TENSOR_CORES, because if we don't set anything then cuBLAS will stil use tensor cores, but only if that wouldn't result in precision loss.

manopapad · 2022-08-09T22:14:23Z

src/cunumeric/cudalibs.cu

+    const char* fast_math = getenv("CUNUMERIC_FAST_MATH");
+    if (fast_math != nullptr && atoi(fast_math) > 0) {
+      // Enable acceleration of single precision routines using TF32 tensor cores.
+      cublasStatus_t status = cublasSetMathMode(cublas_, CUBLAS_TF32_TENSOR_OP_MATH);


CUBLAS_TENSOR_OP_MATH is deprecated.

Are TF32 tensor cores not used when this math mode isn't set? This table says that CUBLAS_DEFAULT_MATH can use tensor cores whenever possible, which might include TF32 tensor cores:

https://docs.nvidia.com/cuda/cublas/index.html#cublasmath_t

If we want to be absolutely sure about precision, I guess we should set CUBLAS_PEDANTIC_MATH in the else branch.

AFAIU in CUBLAS_PEDANTIC_MATH mode tensor cores are disabled altogether, and (I assume) other mostly-safe optimizations like operation reordering.

It sounds like CUBLAS_DEFAULT_MATH is guaranteed to respect the (minimum) level of precision prescribed by the operation (e.g. cublassSgemmEx prescribes the use of fp32-wide intermediates) but can adjust upwards if it makes sense, and will use tensor cores where safe (in contrast to CUBLAS_TF32_TENSOR_OP_MATH, which allows use of tf32 intermediates for cublassSgemmEx, and CUBLAS_TENSOR_OP_MATH, which allows use of fp16 intermediates).

Therefore, I don't think we want CUBLAS_PEDANTIC_MATH to be the default. How about I just change the default to be CUBLAS_DEFAULT_MATH, and activate CUBLAS_PEDANTIC_MATH if the user passes CUNUMERIC_DISABLE_TENSOR_CORES? We can then pass this flag during testing, if necessary.

I just wanted to confirm that the default setting is sufficient to avoid the precision problem we're seeing on A100s. CUBLAS_PEDANTIC_MATH is unnecessary if CUBLAS_DEFAULT_MATH does the right thing already.

Co-authored-by: Manolis Papadakis <manopapad@gmail.com>

Make reduced-precision cuBLAS mode opt-in

8512422

manopapad requested a review from magnatelee August 9, 2022 22:12

manopapad commented Aug 9, 2022

View reviewed changes

manopapad merged commit 61c974b into nv-legate:branch-22.10 Aug 16, 2022

marcinz pushed a commit to marcinz/cunumeric that referenced this pull request Aug 16, 2022

Make reduced-precision cuBLAS mode opt-in (nv-legate#519)

5594bc6

marcinz mentioned this pull request Aug 16, 2022

Make reduced-precision cuBLAS mode opt-in (#519) #540

Merged

sbak5 pushed a commit to sbak5/cunumeric that referenced this pull request Aug 17, 2022

Make reduced-precision cuBLAS mode opt-in (nv-legate#519)

dc22fc6

marcinz added a commit that referenced this pull request Aug 17, 2022

Make reduced-precision cuBLAS mode opt-in (#519) (#540)

6378334

Co-authored-by: Manolis Papadakis <manopapad@gmail.com>

manopapad deleted the fast-math-opt-in branch July 19, 2023 17:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make reduced-precision cuBLAS mode opt-in#519

Make reduced-precision cuBLAS mode opt-in#519
manopapad merged 1 commit intonv-legate:branch-22.10from
manopapad:fast-math-opt-in

manopapad commented Aug 9, 2022

Uh oh!

manopapad Aug 9, 2022

Uh oh!

manopapad Aug 9, 2022

Uh oh!

magnatelee Aug 16, 2022

Uh oh!

manopapad Aug 16, 2022

Uh oh!

magnatelee Aug 16, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

manopapad commented Aug 9, 2022

Uh oh!

manopapad Aug 9, 2022

Choose a reason for hiding this comment

Uh oh!

manopapad Aug 9, 2022

Choose a reason for hiding this comment

Uh oh!

magnatelee Aug 16, 2022

Choose a reason for hiding this comment

Uh oh!

manopapad Aug 16, 2022

Choose a reason for hiding this comment

Uh oh!

magnatelee Aug 16, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants