[Cria][Lllama runner] Use caching temp allocator#16080
[Cria][Lllama runner] Use caching temp allocator#16080kimishpatel wants to merge 1 commit intogh/kimishpatel/216/basefrom
Conversation
Use of caching allocator improves TITO model performance by 6+ %. Will add repro instructions here but requires next diff to see the impact Differential Revision: [D85532078](https://our.internmc.facebook.com/intern/diff/D85532078/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16080
Note: Links to docs will display an error until the docs builds have been completed. ❌ 3 New FailuresAs of commit c6289da with merge base 8af8252 ( NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
|
Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as |
Stack from ghstack (oldest at bottom):
Use of caching allocator improves TITO model performance by 6+ %.
Will add repro instructions here but requires next diff to see the impact
Differential Revision: D85532078