Add Inference Spec & CI #22

AlejandroEsquivel · 2024-09-17T06:27:26Z

Added:

Added Inference Spec class to support Ray, Sagemaker, FastAPI in one go
Added CI to support to deploy/update validator's inference endpoint on the cluster

Example Running FastAPI with Inference Spec:

pip install git+https://github.com/guardrails-ai/models-host@feat/adding-ray-setup
uvicorn models_host.fastapi:app

Example Running Ray Serve with Inference Spec:

# Refer to models-host repo for instructions if you don't have a GPU on the machine running ray serve
pip install git+https://github.com/guardrails-ai/models-host@feat/adding-ray-setup
serve run models_host.ray_serve:app

Add Inference Spec & CI

795f693

AlejandroEsquivel marked this pull request as draft September 17, 2024 06:28

AlejandroEsquivel marked this pull request as ready for review September 17, 2024 20:56

AlejandroEsquivel requested review from CalebCourier, JosephCatrambone, dtam and zsimjee September 17, 2024 20:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Inference Spec & CI #22

Add Inference Spec & CI #22

Uh oh!

AlejandroEsquivel commented Sep 17, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Inference Spec & CI #22

Are you sure you want to change the base?

Add Inference Spec & CI #22

Uh oh!

Conversation

AlejandroEsquivel commented Sep 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AlejandroEsquivel commented Sep 17, 2024 •

edited

Loading