Skip to content

[Bug] Infinite hanging and quota surge with gemini-claude-opus-4-5-thinking in Claude Code #852

@KnightCat9

Description

@KnightCat9

Is it a request payload issue?
[ ] Yes, this is a request payload issue. I am using a client/cURL to send a request payload, but I received an unexpected error.
[x] No, it's another issue.

Describe the bug
When using the specified model in Claude Code, the terminal occasionally hangs in an infinite waiting state. However, in the CLI Proxy API Management Center (CPAMC), the request is marked as "Success," and the quota/usage statistics continue to rise indefinitely. This issue is sporadic but becomes persistent in long-context conversations.

CLI Type
claude code

Model Name
gemini-claude-opus-4-5-thinking (via Antigravity)

LLM Client
claude code

Request Information
Using CLI Proxy API Version: v6.6.80. (No specific cURL provided, issue observed during streaming in Claude Code).

Expected behavior
The response should complete normally, and the usage/quota statistics in CPAMC should stop increasing once the model finishes generating.

Screenshots
N/A

OS Type

OS: Linux Ubuntu
Version: 24.04
Additional context
It is suspected that the 2API format provided by Antigravity is incompatible with the response format required by Claude Code, causing the client to wait indefinitely for a stream end signal that never arrives, while the proxy keeps billing.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions