[WIP][AQUA] Add Supporting Fine-Tuned Models in Multi-Model Deployment #1186

mrDzurb · 2025-05-16T21:39:41Z

Description

The current implementation of Multi-Model Deployment in AQUA supports base models only. Fine-tuned models, however, are a critical part of many customer workflows - allowing them to adapt base models to domain-specific use cases.
This PR introduces support for deploying fine-tuned LLM models as part of a multi-model deployment group on the VLLM container.

Implementation

In the first iteration, we will treat each selected model, whether it's a base model or a fine-tuned variant—as an independent entity. Even if multiple fine-tuned models share the same base model, each one will be deployed in its own isolated VLLM instance.

On the SMC side, we will leverage VLLM's capability to dynamically merge LoRA adapter weights during runtime. This means each VLLM instance will load the base model and its corresponding fine-tuned weights independently.

To avoid routing conflicts caused by multiple instances using the same base model name, we will route the base model name to one instance only, but we will not advertise this base model as an endpoint to users (This is current behavior with Single Model Deployment).

This configuration structure will prepare us for future enhancements, such as stacked fine-tuned deployments, where multiple fine-tuned variants are hosted under a single base model within one VLLM instance. However, this future enhancement will apply to single-model deployments initially.

In a second iteration, we will explore expanding this capability to multi-model deployments, enabling grouped deployment of fine-tuned variants with shared GPU allocation. That enhancement will require additional work across the ADS SDK, AQUA UI, and validation logic.

…Base Model for Fine-Tuned Models (#1185)

github-actions · 2025-05-16T22:12:08Z

📌 Cov diff with main:

📌 Overall coverage:

[AQUA][MMD] Add Support for Retrieving Deployment Configuration from …

f967984

…Base Model for Fine-Tuned Models (#1185)

mrDzurb requested review from darenr, mayoor, VipulMascarenhas, qiuosier and ahosler as code owners May 16, 2025 21:39

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label May 16, 2025

Merge branch 'main' into feature/aqua_ft_mmd

ff44f84

mrDzurb requested review from elizjo and dipatidar May 16, 2025 21:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][AQUA] Add Supporting Fine-Tuned Models in Multi-Model Deployment #1186

[WIP][AQUA] Add Supporting Fine-Tuned Models in Multi-Model Deployment #1186

mrDzurb commented May 16, 2025

github-actions bot commented May 16, 2025

[WIP][AQUA] Add Supporting Fine-Tuned Models in Multi-Model Deployment #1186

Are you sure you want to change the base?

[WIP][AQUA] Add Supporting Fine-Tuned Models in Multi-Model Deployment #1186

Conversation

mrDzurb commented May 16, 2025

Description

Implementation

github-actions bot commented May 16, 2025