feat(vllm_performance): Add GuideLLM experiments #459

christian-pinto · 2026-01-23T13:35:10Z

This PR adds two new experiments to the vllm_performance actuator:

test-endpoint-guidellm-v1
test-deployment-guidellm-v1

The new experiments are 100% compatible with the ones based on vLLM bench. Therefore, the same entity space can be used across vllm bench and GuideLLM. Also, the metrics reported are 100% matching.

At this stage, also for GuideLLM we only support a synthetic (random) dataset. Also, the vLLM experiment shave a burstiness argument the controls the distribution used for generating requests. The default value is 1 and it uses a Poisson distribution. GuideLLM does not support setting the burstiness of the requests. For the sake of guaranteeing using the same space across the two experiments I have decided to still have the burstiness argument in the guidellm experiments and forcing a poisson distribution for the requests generation.

This pr implements #457

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

…m experiments Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

DRL-NextGen · 2026-01-23T13:50:13Z

Checks Summary

Last run: 2026-01-27T15:46:31.450Z

Code Risk Analyzer vulnerability scan found 2 vulnerabilities:

Severity	Identifier	Package	Details	Fix
◻ Unknown	CVE-2025-53000	nbconvert	nbconvert has an uncontrolled search path that leads to unauthorized code execution on Windows GHSA-xm59-rqc7-hhvf nbconvert:7.16.6->ado-core:1.3.3	>7.16.6
◻ Unknown	CVE-2026-0994	protobuf	protobuf affected by a JSON recursion depth bypass GHSA-7gcm-g887-7qv7 protobuf:6.33.4->guidellm:0.5.3,protobuf:6.33.4,vllm:0.14.1,protobuf:6.33.4,ado-core:1.3.3	>6.33.4

Mend Unified Agent vulnerability scan found 3 vulnerabilities:

Severity	Identifier	Package	Details	Fix
❗ Critical	CVE-2025-56005	ply-3.11-py2.py3-none-any.whl	An undocumented and unsafe feature in the PLY (Python Lex-Yacc) library 3.11 allows Remote Code Exec... An undocumented and unsafe feature in the PLY (Python Lex-Yacc) library 3.11 allows Remote Code Execution (RCE) via the "picklefile" parameter in the "yacc()" function. This parameter accepts a ".pkl" file that is deserialized with "pickle.load()" without validation. Because "pickle" allows execution of embedded code via "reduce()", an attacker can achieve code execution by passing a malicious pickle file. The parameter is not mentioned in official documentation or the GitHub repository, yet it is active in the PyPI version. This introduces a stealthy backdoor and persistence risk.	Not Available
🔺 High	CVE-2025-53000	nbconvert-7.16.6-py3-none-any.whl	The nbconvert tool, jupyter nbconvert, converts Jupyter notebooks to various other formats via Jinja... The nbconvert tool, jupyter nbconvert, converts Jupyter notebooks to various other formats via Jinja templates. Versions of nbconvert up to and including 7.16.6 on Windows have a vulnerability in which converting a notebook containing SVG output to a PDF results in unauthorized code execution. Specifically, a third party can create a "inkscape.bat" file that defines a Windows batch script, capable of arbitrary code execution. When a user runs "jupyter nbconvert --to pdf" on a notebook containing SVG output to a PDF on a Windows platform from this directory, the "inkscape.bat" file is run unexpectedly. As of time of publication, no known patches exist.	Not Available
🔺 High	CVE-2026-0994	protobuf-6.33.4-cp39-abi3-manylinux2014_x86_64.whl	A denial-of-service (DoS) vulnerability exists in google.protobuf.json_format.ParseDict() in Python,... A denial-of-service (DoS) vulnerability exists in google.protobuf.json_format.ParseDict() in Python, where the max_recursion_depth limit can be bypassed when parsing nested google.protobuf.Any messages. Due to missing recursion depth accounting inside the internal Any-handling logic, an attacker can supply deeply nested Any structures that bypass the intended recursion limit, eventually exhausting Python’s recursion stack and causing a RecursionError.	Not Available

...rformance/ado_actuators/vllm_performance/vllm_performance_test/execute_guidellm_benchmark.py

plugins/actuators/vllm_performance/ado_actuators/vllm_performance/experiment_executor.py

plugins/actuators/vllm_performance/GUIDELLM_PARAMETER_MAPPING.md

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

plugins/actuators/vllm_performance/ado_actuators/vllm_performance/experiment_executor.py

...rformance/ado_actuators/vllm_performance/vllm_performance_test/execute_guidellm_benchmark.py

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

…ydantic models usage and improved docs Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

michael-johnston · 2026-01-27T11:36:56Z

@christian-pinto there is a fixable ruff problem in vllm_performance causing CI to fail. Can you pull and fix?

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

christian-pinto · 2026-01-27T12:25:36Z

@christian-pinto there is a fixable ruff problem in vllm_performance causing CI to fail. Can you pull and fix?

Just fixed it. It's weird though that the pre-commit hook missed it

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

...rs/vllm_performance/ado_actuators/vllm_performance/vllm_performance_test/benchmark_models.py

...ors/vllm_performance/ado_actuators/vllm_performance/vllm_performance_test/guidellm_models.py

plugins/actuators/vllm_performance/ado_actuators/vllm_performance/experiment_executor.py

…ers model Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

Co-authored-by: Alessandro Pomponio <10339005+AlessandroPomponio@users.noreply.github.com> Signed-off-by: Michael Johnston <66301584+michael-johnston@users.noreply.github.com>

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

website/docs/actuators/vllm_performance.md

...rs/vllm_performance/ado_actuators/vllm_performance/vllm_performance_test/benchmark_models.py

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

christian-pinto added 3 commits January 22, 2026 15:47

feat(vllm_performance): Adding guidellm experiment

18cdd8e

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

feat(vllm_performance): Fixing implementation

ad63a44

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

feat(vllm_performance): Finalized first implementation of the guidell…

4786e5a

…m experiments Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

christian-pinto requested review from AlessandroPomponio and michael-johnston January 23, 2026 13:35

AlessandroPomponio requested changes Jan 23, 2026

View reviewed changes

fix(vllm_performance): Fixed code after PR review

5a7d901

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

AlessandroPomponio requested changes Jan 26, 2026

View reviewed changes

AlessandroPomponio mentioned this pull request Jan 26, 2026

build(deps): update dependencies #462

Closed

christian-pinto added 10 commits January 26, 2026 19:58

fix(vllm_performance): Further fixes after PR review

03826a8

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): Cleanup actuator dependencies

7ea7218

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): One last round of fixes

d364abe

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): Fixed required version of datasets, improved p…

9f67d90

…ydantic models usage and improved docs Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): Added URL validation logic to vLLM experiment

98db8a6

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): Just a touch up to the docs

658a513

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

Merge branch 'main' into cp_add_guidellm_experiment

a90fa5a

fix(vllm_performance): Cleaning up pyproject.toml

936e927

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): README touch up

850cb6a

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): Using Annotated fields in pydantic models

5d68dec

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

christian-pinto added 2 commits January 27, 2026 12:11

fix(vllm_performance): Improved benchmark parameters extraction

0ae1cf7

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): Solved Ruff error in experiment_executor

ed19426

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): Some more fixes, hopefully the last ones

b1bb82a

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

AlessandroPomponio requested changes Jan 27, 2026

View reviewed changes

christian-pinto and others added 4 commits January 27, 2026 13:06

fix(vllm_performance): Improved model validation for BenchmarkParamet…

6b8bf7c

…ers model Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

Apply suggestions from code review

faeaa89

Co-authored-by: Alessandro Pomponio <10339005+AlessandroPomponio@users.noreply.github.com> Signed-off-by: Michael Johnston <66301584+michael-johnston@users.noreply.github.com>

fix(vllm_performance): Further changes after review

8f15aca

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

fix(vllm_performance): I can see the light at the end of the tunnel

4348c08

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

AlessandroPomponio requested changes Jan 27, 2026

View reviewed changes

website/docs/actuators/vllm_performance.md Show resolved Hide resolved

...rs/vllm_performance/ado_actuators/vllm_performance/vllm_performance_test/benchmark_models.py Outdated Show resolved Hide resolved

christian-pinto added 2 commits January 27, 2026 14:59

fix(vllm_performance): I ran out of commit messages

0c6ce6d

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>

Merge branch 'main' into cp_add_guidellm_experiment

13e86c3

AlessandroPomponio approved these changes Jan 27, 2026

View reviewed changes

michael-johnston enabled auto-merge January 27, 2026 15:51

michael-johnston approved these changes Jan 27, 2026

View reviewed changes

michael-johnston added this pull request to the merge queue Jan 27, 2026

Merged via the queue into main with commit 46ffac8 Jan 27, 2026
19 checks passed

michael-johnston deleted the cp_add_guidellm_experiment branch January 27, 2026 16:35

michael-johnston linked an issue Jan 27, 2026 that may be closed by this pull request

feat: add guidellm experiment in vllm_performance #457

Closed

feat(vllm_performance): Add GuideLLM experiments #459

feat(vllm_performance): Add GuideLLM experiments #459

Uh oh!

Conversation

christian-pinto commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DRL-NextGen commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michael-johnston commented Jan 27, 2026

Uh oh!

christian-pinto commented Jan 27, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

christian-pinto commented Jan 23, 2026 •

edited

Loading

DRL-NextGen commented Jan 23, 2026 •

edited

Loading