Skip to content

Secure direct client session #284

@tstewart-klaudhaus

Description

@tstewart-klaudhaus

Please read this first

  • Have you read the docs? Yes
  • Have you searched for related issues? yes

Describe the feature

I asked ChatGPT about securing sessionKey in realtime API to enable web-based voice agents with full VAD without needing an intermediate SFU/MCU media server. It would reduce complexity, cost and latency in online agents.

It made a number of suggestions about how this could be handled within the OpenAI API / SDK itself, but nothing that currently works. The suggestions were:

What Would Be Needed
To make this viable, OpenAI would need to:

Allow short-lived, thread-scoped access tokens (ideally JWT or presigned URLs).

Support custom headers or query parameters for client-side auth.

Possibly integrate OAuth2 or federated identity systems to delegate auth securely.

ChatGPT also pointed out that:

Because there’s a material security concern with client-controlled models and session keys, OpenAI may be motivated to enhance their system.

So my question is - is this something that OpenAI is looking at?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions