Skip to content

Commit 2b3fcc3

Browse files
authored
[djl] patch release for 0.32.0 TRT-LLM DLC, update available_images.md (#4612)
1 parent 5b10ae6 commit 2b3fcc3

File tree

2 files changed

+4
-4
lines changed

2 files changed

+4
-4
lines changed

available_images.md

+1
Original file line numberDiff line numberDiff line change
@@ -193,6 +193,7 @@ Starting LMI V10 (0.28.0), we are changing the name from LMI DeepSpeed DLC to LM
193193
| Framework | Job Type | Accelerator | Python Version Options | Example URL |
194194
|-----------------------------------------------------------------------------------------------------------------------------|-----------|-------------|------------------------|-------------------------------------------------------------------------------------------|
195195
| DJLServing 0.32.0 with LMI Dist 13.0.0, vLLM 0.7.1, HuggingFace Transformers 4.45.2, and HuggingFace Accelerate 1.0.1 | inference | GPU | 3.11 (py311) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.32.0-lmi14.0.0-cu126 |
196+
| DJLServing 0.32.0 with TensorRT-LLM 0.12.0, HuggingFace Transformers 4.44.2, and HuggingFace Accelerate 0.32.1 | inference | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.32.0-tensorrtllm0.12.0-cu125 |
196197
| DJLServing 0.31.0 with LMI Dist 13.0.0, vLLM 0.6.3.post1, HuggingFace Transformers 4.45.2, and HuggingFace Accelerate 1.0.1 | inference | GPU | 3.11 (py311) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.31.0-lmi13.0.0-cu124 |
197198
| DJLServing 0.30.0 with LMI Dist 12.0.0, vLLM 0.6.2, HuggingFace Transformers 4.45.2, and HuggingFace Accelerate 1.0.1 | inference | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.30.0-lmi12.0.0-cu124 |
198199
| DJLServing 0.30.0 with TensorRT-LLM 0.12.0, HuggingFace Transformers 4.44.2, and HuggingFace Accelerate 0.33.0 | inference | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.30.0-tensorrtllm0.12.0-cu125 |

release_images_inference.yml

+3-4
Original file line numberDiff line numberDiff line change
@@ -126,10 +126,9 @@ release_images:
126126
arch_type: "x86"
127127
inference:
128128
device_types: [ "gpu" ]
129-
python_versions: [ "py312" ]
130-
os_version: "ubuntu22.04"
131-
lmi_version: "14.0.0"
132-
cuda_version: "cu126"
129+
python_versions: [ "py310" ]
130+
os_version: "ubuntu24.04"
131+
tensorrtllm_version: "0.12.0"
133132
example: False
134133
disable_sm_tag: True
135134
force_release: False

0 commit comments

Comments
 (0)