[Executorch][LLM] Use caching allocator for runner#15656
[Executorch][LLM] Use caching allocator for runner#15656kimishpatel wants to merge 1 commit intogh/kimishpatel/207/basefrom
Conversation
We observed that on iOS it improves perf by 6% because SDPA op does temp allocations. No significant difference on android though. Differential Revision: [D86120038](https://our.internmc.facebook.com/intern/diff/D86120038/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15656
Note: Links to docs will display an error until the docs builds have been completed. ❌ 116 New Failures, 5 Unrelated FailuresAs of commit 66a37e3 with merge base 15a0fcd ( NEW FAILURES - The following jobs have failed:
BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
We observed that on iOS it improves perf by 6% because SDPA op does temp allocations. No significant difference on android though. Differential Revision: [D86120038](https://our.internmc.facebook.com/intern/diff/D86120038/) ghstack-source-id: 321483010 Pull Request resolved: #15656
This PR needs a
|
|
Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as |
|
Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as |
Stack from ghstack (oldest at bottom):
We observed that on iOS it improves perf by 6% because SDPA op does temp allocations.
No significant difference on android though.
Differential Revision: D86120038