Skip to content

test: backend resource cleanup post-PR #721 #736

@planetf1

Description

@planetf1

Context

Part of the Testing Infrastructure & Strategy Overhaul epic (item 7). PR #721 (@avinash2692) introduces cleanup_gpu_backend() and directly fixes the core issues: #625 (vLLM redundant servers), #630 (HF memory leaks), #620 (HF metrics OOM).

Objective

Once PR #721 lands, follow up by removing or consolidating workarounds that are no longer needed. Centralise any remaining platform-specific logic into shared fixtures.

Work

Notes

Metadata

Metadata

Assignees

No one assigned

    Labels

    area/backendsProvider-specific work: Ollama, HF, LiteLLM, OpenAI, Bedrock, vLLMtesting

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions