Skip to content

fix: ensure enough tokens for structured output in vLLM test (#591)#595

Merged
planetf1 merged 1 commit into
generative-computing:mainfrom
planetf1:fix/vllm-truncation-test
Mar 6, 2026
Merged

fix: ensure enough tokens for structured output in vLLM test (#591)#595
planetf1 merged 1 commit into
generative-computing:mainfrom
planetf1:fix/vllm-truncation-test

Conversation

@planetf1
Copy link
Copy Markdown
Contributor

@planetf1 planetf1 commented Mar 6, 2026

Ensure enough tokens for structured output in vLLM test

Type of PR

  • Bug Fix
  • New Feature
  • Documentation
  • Other

Description

Fixes #591

Flaky test_generate_from_raw_with_format in test/backends/test_openai_vllm.py caused by vLLM's low default token limit, resulting in truncated JSON objects. This PR sets MAX_NEW_TOKENS to 256.

Note: While this change is independent for merge, successful execution in isolated environments depends on the associated PYTHONPATH fix (#592). Combined, these fixes result in 100% pass rate for vLLM tests on LSF.

Testing

  • Tests added to the respective file if code was changed
  • New code has 100% coverage if code as added
  • Ensure existing tests and github automation passes (a maintainer will kick off the github automation when the rest of the PR is populated)

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Mar 6, 2026

The PR description has been updated. Please fill out the template for your PR to be reviewed.

@mergify
Copy link
Copy Markdown

mergify Bot commented Mar 6, 2026

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert|release)(?:\(.+\))?:

@planetf1 planetf1 marked this pull request as ready for review March 6, 2026 09:20
@planetf1 planetf1 requested a review from a team as a code owner March 6, 2026 09:20
@planetf1 planetf1 added this pull request to the merge queue Mar 6, 2026
Merged via the queue into generative-computing:main with commit ac6a4cf Mar 6, 2026
7 of 8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

test_generate_from_raw_with_format flaky due to missing MAX_NEW_TOKENS

2 participants