Dealing with images #201

animanathome · 2025-05-15T21:04:18Z

I'm trying to create an Agent that describes the content of an image. When doing so, I noticed that the Agent uses the text property type in the request and sends the raw image data, which is unexpected.

"data":{"data":{"meta":null,"content":[{"type":"text","text":"\ufffd\ufffd\ufffd\ufffd\u0000\u0010Lavc61.19.101\u0000\ufffd\ufffd\u0000C\u0000\b\f\f\u000e\f\u000e\u0010\u0010\u0010\u0010\u0010\u0010\u0013\u0012\u0013\u0014\u0014\u0014\u0013\u0013\u0013\u0013\u0014\u0014\u0014\u0015\u0015\u0015\u0019\u0019\u0019\u0015\u0015\u0015\u0014\u0....fd\ufffd\ufffd'}]

Instead of using the text property, the Agent should use the dedicated property {type: "image_url", {detail: "high", url: imageAsBase64String }} to make the request. See link for more information.

Am I doing something wrong, or maybe they don't support images yet? When looking at the roadmap, I noticed it's not listed @andrew-lastmile.

The text was updated successfully, but these errors were encountered:

saqadri · 2025-05-15T21:22:39Z

@animanathome thanks for reporting and you're correct, currently images aren't supported properly. There are 2 changes required here --

For AugmentedLLM providers like OpenAI, Anthropic, etc. to handle non-text messages correctly.
To support resources and MCP message types for Image/AudioContent, etc.

cc'ing @StreetLamb if there is a fix we can add to OpenAIAugmentedLLM to unblock the first of these.

StreetLamb · 2025-05-17T04:36:57Z

Hi @animanathome, could you clarify how you’re passing the image to the agent? Are you attempting to return the image as a tool response from an MCP server? If so, one potential blocker is that OpenAI’s tool message currently support only text content. This means you cannot directly pass an image back to the LLM using a tool.

saqadri self-assigned this May 15, 2025

saqadri added the enhancement New feature or request label May 15, 2025

StreetLamb self-assigned this May 16, 2025

StreetLamb linked a pull request May 17, 2025 that will close this issue

Support proper conversion from MCP's image content to OpenAI's image content part #210

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dealing with images #201

Dealing with images #201

animanathome commented May 15, 2025 •

edited

Loading

saqadri commented May 15, 2025

Uh oh!

StreetLamb commented May 17, 2025 •

edited

Loading

Uh oh!

Dealing with images #201

Dealing with images #201

Comments

animanathome commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

saqadri commented May 15, 2025

Uh oh!

StreetLamb commented May 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

animanathome commented May 15, 2025 •

edited

Loading

StreetLamb commented May 17, 2025 •

edited

Loading