Commit daa8239
[NV] DSR1 FP8 GB200 Dynamo TRT (#617)
* initial configs
Signed-off-by: jthomson04 <jothomson@nvidia.com>
* update perf changelog
Signed-off-by: jthomson04 <jothomson@nvidia.com>
* Fix YAML syntax error in perf-changelog.yaml
Add missing space after dash in config-keys entry for dsr1-fp8-gb200-dynamo-trt.
'-config-keys:' should be '- config-keys:'
Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>
* Fix model-prefix typo: dsr1-fp8 -> dsr1
The model-prefix 'dsr1-fp8' is not a supported prefix. The supported
prefixes are 'gptoss' and 'dsr1'. Changed to 'dsr1' to fix the
launch_gb200-nv.sh runner error.
Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>
* fix gb200
Signed-off-by: jthomson04 <jothomson@nvidia.com>
* fix nvidia-master.yaml
Signed-off-by: jthomson04 <jothomson@nvidia.com>
* Update perf-changelog with detailed DSR1 FP8 GB200 scenario descriptions
Co-authored-by: Cameron Quilici <cquil11@users.noreply.github.com>
* update dsr1 fp8 path
Signed-off-by: jthomson04 <jothomson@nvidia.com>
---------
Signed-off-by: jthomson04 <jothomson@nvidia.com>
Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com>
Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>
Co-authored-by: Cameron Quilici <cquil11@users.noreply.github.com>1 parent 70bfef5 commit daa8239
3 files changed
Lines changed: 592 additions & 2 deletions
0 commit comments