cuda-perf

cuda-perf #52

Triggered via schedule March 22, 2026 08:22

dijopaul

⁠ 5d73a1b

main

Status Cancelled

Total duration 2d 0h 23m 44s

Artifacts –

cuda-perf.yml

on: schedule

set-parameters

Matrix: export-models

Matrix: benchmark-cuda

upload-benchmark-results

2m 32s

Annotations

47 errors and 2 warnings

export-models (mistralai/Voxtral-Mini-3B-2507, quantized-int4-weight-only, mistralai_Voxtral-Mini... / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (mistralai/Voxtral-Mini-3B-2507, non-quantized, mistralai_Voxtral-Mini-3B-2507, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (openai/whisper-medium, non-quantized, openai_whisper-medium, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (openai/whisper-small, quantized-int4-tile-packed, openai_whisper-small, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (openai/whisper-small, quantized-int4-weight-only, openai_whisper-small, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (openai/whisper-medium, quantized-int4-tile-packed, openai_whisper-medium, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (openai/whisper-large-v3-turbo, quantized-int4-weight-only, openai_whisper-large-v3... / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (mistralai/Voxtral-Mini-3B-2507, quantized-int4-tile-packed, mistralai_Voxtral-Mini... / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (openai/whisper-large-v3-turbo, quantized-int4-tile-packed, openai_whisper-large-v3... / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (openai/whisper-small, non-quantized, openai_whisper-small, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (google/gemma-3-4b-it, quantized-int4-weight-only, google_gemma-3-4b-it, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (openai/whisper-medium, quantized-int4-weight-only, openai_whisper-medium, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (google/gemma-3-4b-it, non-quantized, google_gemma-3-4b-it, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (google/gemma-3-4b-it, quantized-int4-tile-packed, google_gemma-3-4b-it, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

export-models (openai/whisper-large-v3-turbo, non-quantized, openai_whisper-large-v3-turbo, 50) / linux-job

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

benchmark-cuda (mistralai/Voxtral-Mini-3B-2507, quantized-int4-weight-only, mistralai_Voxtral-Min... / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-large-v3-turbo, quantized-int4-tile-packed, openai_whisper-large-v... / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (google/gemma-3-4b-it, quantized-int4-weight-only, google_gemma-3-4b-it, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-large-v3-turbo, quantized-int4-weight-only, openai_whisper-large-v... / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-medium, quantized-int4-tile-packed, openai_whisper-medium, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (mistralai/Voxtral-Mini-3B-2507, quantized-int4-tile-packed, mistralai_Voxtral-Min... / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-medium, non-quantized, openai_whisper-medium, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (mistralai/Voxtral-Mini-3B-2507, non-quantized, mistralai_Voxtral-Mini-3B-2507, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (google/gemma-3-4b-it, non-quantized, google_gemma-3-4b-it, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-medium, quantized-int4-weight-only, openai_whisper-medium, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-small, quantized-int4-weight-only, openai_whisper-small, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-small, non-quantized, openai_whisper-small, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-large-v3-turbo, non-quantized, openai_whisper-large-v3-turbo, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (openai/whisper-small, quantized-int4-tile-packed, openai_whisper-small, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

benchmark-cuda (google/gemma-3-4b-it, quantized-int4-tile-packed, google_gemma-3-4b-it, 50) / linux-job

The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s

upload-benchmark-results

Could not assume role with OIDC: Not authorized to perform sts:AssumeRoleWithWebIdentity

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

cuda-perf

Canceling since a higher priority waiting request for cuda-perf-main-5d73a1b57e40699ecc5ebd8267521bdae5c4557b-false-true exists

set-parameters

Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@v3, actions/setup-python@v4. Actions will be forced to run with Node.js 24 by default starting June 2nd, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/

upload-benchmark-results

Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@v3, actions/download-artifact@v4, actions/setup-python@v4, aws-actions/configure-aws-credentials@v4. Actions will be forced to run with Node.js 24 by default starting June 2nd, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda-perf #52

Summary

cuda-perf #52

Uh oh!

cuda-perf.yml

Annotations