Skip to content

Commit d34b8db

Browse files
committed
Allow unlimited connections per-worker
Signed-off-by: Samuel Monson <[email protected]>
1 parent a98a644 commit d34b8db

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

src/guidellm/backends/openai.py

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -134,6 +134,12 @@ async def process_startup(self):
134134
timeout=self.timeout,
135135
follow_redirects=self.follow_redirects,
136136
verify=self.verify,
137+
# Allow unlimited connections with short keep-alive
138+
limits=httpx.Limits(
139+
max_connections=None,
140+
max_keepalive_connections=None,
141+
keepalive_expiry=0.5,
142+
),
137143
)
138144
self._in_process = True
139145

0 commit comments

Comments
 (0)