Skip to content

Commit d7c11b6

Browse files
karen-syalec-flowersjthomson04
authored
chore: bump vLLM to 0.12.0 (#4736)
Signed-off-by: alec-flowers <[email protected]> Signed-off-by: Karen Chung <[email protected]> Signed-off-by: jthomson04 <[email protected]> Co-authored-by: alec-flowers <[email protected]> Co-authored-by: jthomson04 <[email protected]>
1 parent c099252 commit d7c11b6

File tree

3 files changed

+7
-8
lines changed

3 files changed

+7
-8
lines changed

container/Dockerfile.vllm

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -15,14 +15,13 @@ ARG RUNTIME_IMAGE_TAG="12.9.0-runtime-ubuntu24.04"
1515
ARG CUDA_VERSION="12.9"
1616

1717
# Make sure to update the dependency version in pyproject.toml when updating this
18-
ARG VLLM_REF="v0.11.2"
18+
ARG VLLM_REF="v0.12.0"
1919
# FlashInfer Ref used to install flashinfer-cubin and flashinfer-jit-cache
20-
ARG FLASHINF_REF="v0.5.2"
20+
ARG FLASHINF_REF="v0.5.3"
2121

2222
# If left blank, then we will fallback to vLLM defaults
2323
ARG DEEPGEMM_REF=""
24-
# LMCache version - 0.3.9+ required for vLLM 0.11.2 compatibility
25-
ARG LMCACHE_REF="0.3.9.post2"
24+
ARG LMCACHE_REF="0.3.10"
2625

2726
# sccache configuration - inherit from base build
2827
ARG USE_SCCACHE

container/deps/vllm/install_vllm.sh

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@
1111

1212
set -euo pipefail
1313

14-
VLLM_REF="v0.11.2"
14+
VLLM_REF="v0.12.0"
1515

1616
# Basic Configurations
1717
ARCH=$(uname -m)
@@ -22,9 +22,9 @@ INSTALLATION_DIR=/tmp
2222
TORCH_CUDA_ARCH_LIST="9.0;10.0" # For EP Kernels
2323
DEEPGEMM_REF=""
2424
CUDA_VERSION="12.9"
25-
FLASHINF_REF="v0.5.2"
25+
FLASHINF_REF="v0.5.3"
2626
# LMCache version - 0.3.9+ required for vLLM 0.11.2 compatibility
27-
LMCACHE_REF="0.3.9.post2"
27+
LMCACHE_REF="0.3.10"
2828

2929
while [[ $# -gt 0 ]]; do
3030
case $1 in

pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,7 +56,7 @@ trtllm =[
5656
vllm = [
5757
"uvloop",
5858
"nixl[cu12]<=0.7.1",
59-
"vllm[flashinfer]==0.11.2",
59+
"vllm[flashinfer]==0.12.0",
6060
]
6161

6262
sglang = [

0 commit comments

Comments
 (0)