[Cria][Lllama runner] Use caching temp allocator#16081
[Cria][Lllama runner] Use caching temp allocator#16081kimishpatel wants to merge 4 commits intogh/kimishpatel/217/basefrom
Conversation
Use of caching allocator improves TITO model performance by 6+ %. Will add repro instructions here but requires next diff to see the impact Differential Revision: [D85532078](https://our.internmc.facebook.com/intern/diff/D85532078/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16081
Note: Links to docs will display an error until the docs builds have been completed. ❌ 3 New FailuresAs of commit 64a7cc7 with merge base 8af8252 ( NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
Use of caching allocator improves TITO model performance by 6+ %. Will add repro instructions here but requires next diff to see the impact Differential Revision: [D85532078](https://our.internmc.facebook.com/intern/diff/D85532078/) [ghstack-poisoned]
Use of caching allocator improves TITO model performance by 6+ %. Will add repro instructions here but requires next diff to see the impact Differential Revision: [D85532078](https://our.internmc.facebook.com/intern/diff/D85532078/) [ghstack-poisoned]
Use of caching allocator improves TITO model performance by 6+ %. Will add repro instructions here but requires next diff to see the impact Differential Revision: [D85532078](https://our.internmc.facebook.com/intern/diff/D85532078/) [ghstack-poisoned]
|
Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as |
Stack from ghstack (oldest at bottom):
Use of caching allocator improves TITO model performance by 6+ %.
Will add repro instructions here but requires next diff to see the impact
Differential Revision: D85532078