[ET-VK] Using TmpTensor for width packed versions of q_linear op shader to reduce memory usage.#7929

Merged

facebook-github-bot merged 8 commits intogh/trivedivivek/53/basefrom

gh/trivedivivek/53/head

Jan 28, 2025

Contributor

trviv commented Jan 24, 2025 •

edited

Loading

Stack from ghstack (oldest at bottom):

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: D68561647


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

3b95e14

…er to reduce memory usage.

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

[ghstack-poisoned]

pytorch-bot bot commented Jan 24, 2025 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7929

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 61cf64e with merge base bdd3d9c ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

This was referenced Jan 24, 2025

[ET-VK] Minor improvement to q_linear op shader. #7728

Merged

[ET-VK] Using push constants for conv2d pw. #7814

Merged

[ET-VK] Minor improvements to conv2d pw and dw bounds check. #7815

Merged

[ET-VK] Splitting TILE_SIZE to TILE_SIZE_X and TILE_SIZE_Y in conv2d pw. #7816

Merged

[ET-VK] Using shared memory offsetting in conv2d pw and saving ivec3 pos instead of ivec2 to improve performance. #7817

Merged

Contributor

facebook-github-bot commented Jan 24, 2025

This pull request was exported from Phabricator. Differential Revision: D68561647

trviv mentioned this pull request

[ET-VK] Using shared memory to save position in conv2d dw output op. #7923

Merged

facebook-github-bot added the fb-exported label

trviv mentioned this pull request

[ET-VK] Using push constants for conv2d dw. #7928

Merged

trviv added a commit that referenced this pull request


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

865dc96

…er to reduce memory usage.

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

ghstack-source-id: 262822997
Pull Request resolved: #7929


          Update on "[ET-VK] Using TmpTensor for width packed versions of q_lin…

1f9f2dd

…ear op shader to reduce memory usage."

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jan 24, 2025

This pull request was exported from Phabricator. Differential Revision: D68561647

trviv added a commit that referenced this pull request


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

83a89b9

…er to reduce memory usage.

Pull Request resolved: #7929

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.
ghstack-source-id: 262858894
@exported-using-ghexport

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

trviv added the topic: not user facing label

SS-JIA approved these changes

View reviewed changes


          Update on "[ET-VK] Using TmpTensor for width packed versions of q_lin…

7e9e9b9

…ear op shader to reduce memory usage."

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jan 24, 2025

This pull request was exported from Phabricator. Differential Revision: D68561647

trviv added a commit that referenced this pull request


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

c6ca568

…er to reduce memory usage.

Pull Request resolved: #7929

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.
ghstack-source-id: 262972267
@exported-using-ghexport

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)


          Update on "[ET-VK] Using TmpTensor for width packed versions of q_lin…

597b534

…ear op shader to reduce memory usage."

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jan 27, 2025

This pull request was exported from Phabricator. Differential Revision: D68561647

trviv added a commit that referenced this pull request


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

017914c

…er to reduce memory usage.

Pull Request resolved: #7929

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.
ghstack-source-id: 263214012
@exported-using-ghexport

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)


          Update on "[ET-VK] Using TmpTensor for width packed versions of q_lin…

05bf256

…ear op shader to reduce memory usage."

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jan 27, 2025

This pull request was exported from Phabricator. Differential Revision: D68561647

trviv added a commit that referenced this pull request


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

52d7cbe

…er to reduce memory usage.

Pull Request resolved: #7929

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.
ghstack-source-id: 263238737
@exported-using-ghexport

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)


          Update on "[ET-VK] Using TmpTensor for width packed versions of q_lin…

0d82b5e

…ear op shader to reduce memory usage."

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

[ghstack-poisoned]

trviv added a commit that referenced this pull request


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

6a5e2da

…er to reduce memory usage.

Pull Request resolved: #7929

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.
ghstack-source-id: 263366400
@exported-using-ghexport

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

Contributor

facebook-github-bot commented Jan 28, 2025

This pull request was exported from Phabricator. Differential Revision: D68561647


          Update on "[ET-VK] Using TmpTensor for width packed versions of q_lin…

8859a72

…ear op shader to reduce memory usage."

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jan 28, 2025

This pull request was exported from Phabricator. Differential Revision: D68561647

trviv added a commit that referenced this pull request


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

a47f5ba

…er to reduce memory usage.

Pull Request resolved: #7929

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.
ghstack-source-id: 263440664
@exported-using-ghexport

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)


          Update on "[ET-VK] Using TmpTensor for width packed versions of q_lin…

61cf64e

…ear op shader to reduce memory usage."

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jan 28, 2025

This pull request was exported from Phabricator. Differential Revision: D68561647

trviv added a commit that referenced this pull request


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

f28179e

…er to reduce memory usage.

Pull Request resolved: #7929

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.
ghstack-source-id: 263456691
@exported-using-ghexport

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

facebook-github-bot merged commit a88e880 into gh/trivedivivek/53/base

47 checks passed

facebook-github-bot deleted the gh/trivedivivek/53/head branch

January 28, 2025 19:37

facebook-github-bot temporarily deployed to cherry-pick-bot

January 28, 2025 19:37

— with

GitHub Actions Inactive

pytorchbot mentioned this pull request

[ET-VK] Using TmpTensor for width packed versions of q_linear op shader to reduce memory usage. #8009

Merged

manuelcandales pushed a commit that referenced this pull request


          [ET-VK] Using TmpTensor for width packed versions of q_linear op shad…

d1104a7

…er to reduce memory usage. (#8009)

Pull Request resolved: #7929

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.
ghstack-source-id: 263456691
@exported-using-ghexport

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

Co-authored-by: Vivek Trivedi <5340687+trivedivivek@users.noreply.github.com>

trviv added a commit that referenced this pull request


          [ET-VK] Using push constants for conv2d dw. (#8008)

c5fea7e

* [ET-VK] Using shared memory to save position in conv2d dw output op.

Pull Request resolved: #7923

This diff introduces a change to conv2d dw op to save output positions in shared memory, which reduces register usage and improves performance.
ghstack-source-id: 263440666
@exported-using-ghexport

Differential Revision: [D68400890](https://our.internmc.facebook.com/intern/diff/D68400890/)

* [ET-VK] Using push constants for conv2d dw.

Pull Request resolved: #7928

This diff is related to the use of push constants for convolutional dw (depthwise) in Executorch's Vulkan backend. This optimization improves memory usage.
ghstack-source-id: 263440665
@exported-using-ghexport

Differential Revision: [D68493849](https://our.internmc.facebook.com/intern/diff/D68493849/)

* [ET-VK] Using TmpTensor for width packed versions of q_linear op shader to reduce memory usage. (#8009)

Pull Request resolved: #7929

This diff introduces the use of temporary tensors to reduce memory usage in the width packed versions of the q_linear op shader.
ghstack-source-id: 263456691
@exported-using-ghexport

Differential Revision: [D68561647](https://our.internmc.facebook.com/intern/diff/D68561647/)

Co-authored-by: Vivek Trivedi <5340687+trivedivivek@users.noreply.github.com>

---------

Co-authored-by: Vivek Trivedi <5340687+trivedivivek@users.noreply.github.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported topic: not user facing