-
Notifications
You must be signed in to change notification settings - Fork 208
Description
Please read this first
- Have you read the docs? Yes
- Have you searched for related issues? yes
Describe the feature
I asked ChatGPT about securing sessionKey in realtime API to enable web-based voice agents with full VAD without needing an intermediate SFU/MCU media server. It would reduce complexity, cost and latency in online agents.
It made a number of suggestions about how this could be handled within the OpenAI API / SDK itself, but nothing that currently works. The suggestions were:
What Would Be Needed
To make this viable, OpenAI would need to:Allow short-lived, thread-scoped access tokens (ideally JWT or presigned URLs).
Support custom headers or query parameters for client-side auth.
Possibly integrate OAuth2 or federated identity systems to delegate auth securely.
ChatGPT also pointed out that:
Because there’s a material security concern with client-controlled models and session keys, OpenAI may be motivated to enhance their system.
So my question is - is this something that OpenAI is looking at?