[Inference Providers] Add `image-text-to-image` and `image-text-to-video` support for wavespeed #1882

hanouticelina · 2025-12-15T14:16:19Z

Discussed internally in this thread.
This PR adds support for the image-text-to-image and image-text-to-video tasks for wavespeed provider. These tasks support models that accept both optional image + text input.
For these tasks, we use the same endpoints as for image-to-image and image-to-video, when no image is provided, we send a 1x1 fully transparent image. Note that wavespeed's doesn't follow a consistent endpoint naming pattern that would allow us to have the same logic as for fal in #1879.

I've tested it (i.e. text only input) with FLUX-2 and and the results looks reasonable to me:

"A robot playing chess in a garden"

"A butterfly landing on a flower"

wavespeed_video_placeholder_4.mp4

Wauplin

Hacky but nice it works! 😄

add image-text-to-video and image-text-to-image for wavespeed

8be7973

hanouticelina requested review from Wauplin and apolinario December 15, 2025 14:16

hanouticelina requested review from SBrandeis and julien-c as code owners December 15, 2025 14:16

Wauplin approved these changes Dec 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inference Providers] Add `image-text-to-image` and `image-text-to-video` support for wavespeed #1882

[Inference Providers] Add `image-text-to-image` and `image-text-to-video` support for wavespeed #1882

Uh oh!

hanouticelina commented Dec 15, 2025

Uh oh!

Wauplin left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Inference Providers] Add image-text-to-image and image-text-to-video support for wavespeed #1882

Are you sure you want to change the base?

[Inference Providers] Add image-text-to-image and image-text-to-video support for wavespeed #1882

Uh oh!

Conversation

hanouticelina commented Dec 15, 2025

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Inference Providers] Add `image-text-to-image` and `image-text-to-video` support for wavespeed #1882

[Inference Providers] Add `image-text-to-image` and `image-text-to-video` support for wavespeed #1882