Fix the incompatibility issue caused by `top_p=0` when using vllm to inference (#1265) by akawincent · Pull Request #1277 · EvolvingLMMs-Lab/lmms-eval

akawincent · 2026-03-27T13:08:06Z

Summary

Fix a vLLM backend compatibility issue where task configs that set top_p: 0 crash during request construction.

This change normalizes top_p=0 to top_p=1.0 before building vLLM SamplingParams, which preserves the intended greedy decoding behavior while satisfying vLLM's validation rules.

Root Cause

Several task configs in lmms-eval use:

temperature: 0
top_p: 0
do_sample: false

This pattern is commonly used to express greedy decoding and works in Hugging Face-style generation paths.

However, the vLLM wrappers were forwarding top_p directly into SamplingParams, and vLLM requires top_p to be in (0, 1]. As a result, tasks using top_p: 0 failed before generation started.

What Changed

Added a small normalization step in the shared vLLM wrapper to convert top_p=0 to 1.0
Reused that logic in:
- vllm
- vllm_chat
- vllm_generate
Added a lightweight regression test covering the normalization path

Impact

Tasks that rely on greedy-style generation settings with top_p: 0 can now run on vLLM without crashing.

This keeps task YAML unchanged and limits the fix to the vLLM compatibility layer.

Validation

Ran:

python -m unittest test.models.test_vllm_sampling_params -v
python -m unittest test.models.test_model_registry_v2 -v

akawincent · 2026-04-09T06:59:55Z

Hi @Luodian

Could you review this PR please?

The code changes are minimal: it mainly converts top_p=0 to top_p=1 when calling the vLLM backend, so that vLLM's greedy decoding can start.

Additionally, I added a regression test.

Fix vLLM top_p=0 handling

5321e50

akawincent changed the title ~~Fix the incompatibility issue caused by top_p=0 when using vllm to inference (#1265)~~ Fix the incompatibility issue caused by top_p=0 when using vllm to inference (#1265) Mar 27, 2026

kcz358 reviewed Apr 9, 2026

View reviewed changes

Comment thread test/models/test_vllm_sampling_params.py Outdated

remove test file

734f00c

kcz358 approved these changes Apr 10, 2026

View reviewed changes

kcz358 merged commit 17c4461 into EvolvingLMMs-Lab:main Apr 10, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the incompatibility issue caused by `top_p=0` when using vllm to inference (#1265)#1277

Fix the incompatibility issue caused by `top_p=0` when using vllm to inference (#1265)#1277
kcz358 merged 2 commits intoEvolvingLMMs-Lab:mainfrom
akawincent:fix/1265_vllm_top_p

akawincent commented Mar 27, 2026

Uh oh!

akawincent commented Apr 9, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

akawincent commented Mar 27, 2026

Summary

Root Cause

What Changed

Impact

Validation

Uh oh!

akawincent commented Apr 9, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants