[CI/Build] Skip spec decode tests using Triton backend #27888

zhewenl · 2025-10-31T15:40:41Z

Purpose

More details in #27619.

EAGLE speculative decoding is failing on AMD GPUs with a HSA_STATUS_ERROR_MEMORY_APERTURE_VIOLATION error. The error occurs immediately when processing prompts starts, after successful CUDA graph capturing.
It can be reproduced with spec decode + EAGLE + Triton Attention Backend(default for AMD), example:

python3 offline_inference/spec_decode.py --test --method eagle --num_spec_tokens 3 --dataset-name hf --dataset-path philschmid/mt-bench --num-prompts 80 --temp 0 --top-p 1.0 --top-k -1 --tp 1 --enable-chunked-prefill --max-model-len 2048

Error Details

:0:rocdevice.cpp:3675: Callback: Queue 0x7f4db0300000 aborting with error:
HSA_STATUS_ERROR_MEMORY_APERTURE_VIOLATION: The agent attempted to access memory beyond the largest legal address. code: 0x29

(using a different backend could work: eg. ROCM_AITER_FA, ROCM_AITER_UNIFIED_ATTN, but TritonAttentionBackend is the default attention backend for AMD: gist:fb8bbb2cbde391905d86908ca4a46c02)

Test Plan

 pytest -v -s tests/v1/e2e/test_spec_decode.py::test_eagle_correctness

CI: https://buildkite.com/vllm/amd-ci/builds/814

Signed-off-by: zhewenli <[email protected]>

mergify · 2025-11-03T23:23:53Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @zhewenl.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

update test

9a4eb88

Signed-off-by: zhewenli <[email protected]>

mergify bot added ci/build speculative-decoding v1 labels Oct 31, 2025

zhewenl added 5 commits October 31, 2025 09:32

update test

ef9e6a4

Signed-off-by: zhewenli <[email protected]>

update test

b117463

Signed-off-by: zhewenli <[email protected]>

update test

3b32cc4

Signed-off-by: zhewenli <[email protected]>

update test

4026bb1

Signed-off-by: zhewenli <[email protected]>

update test

e585be4

Signed-off-by: zhewenli <[email protected]>

mergify bot added the needs-rebase label Nov 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[CI/Build] Skip spec decode tests using Triton backend #27888

[CI/Build] Skip spec decode tests using Triton backend #27888

zhewenl commented Oct 31, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

[CI/Build] Skip spec decode tests using Triton backend #27888

Are you sure you want to change the base?

[CI/Build] Skip spec decode tests using Triton backend #27888

Conversation

zhewenl commented Oct 31, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Error Details

Test Plan

Uh oh!

mergify bot commented Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

zhewenl commented Oct 31, 2025 •

edited by github-actions bot

Loading