[Feature] Add reasoning_budget to cap thinking tokens via existing reasoning parser by abhinand5 · Pull Request #37112 · vllm-project/vllm

Purpose Adds reasoning_budget and reasoning_budget_message parameters to cap reasoning tokens inside <think>...</think> markers for reasoning models (DeepSeek-R1, Qwen3, etc.). Related ...