Qwen2.5 VL fail to train due to qwen-vl-utils

### Search before asking

- [x] I have searched the Multimodal Maestro [issues](https://github.com/roboflow/multimodal-maestro/issues) and found no similar bug report.


### Bug

Hi! First of all thank you for this amazing library! 🔥 

I was trying to fine-tune the Qwen2.5VL following the [colab example](https://colab.research.google.com/github/roboflow/maestro/blob/develop/cookbooks/maestro_qwen2_5_vl_object_detection.ipynb#scrollTo=GHZGIRB_eAh3) when the following error appeared:

```bash
File "/home/dredo/anaconda3/envs/occluders/lib/python3.12/site-packages/maestro/trainer/models/qwen_2_5_vl/detection.py", line 14, in detections_to_suffix_formatter
    input_h, input_w = smart_resize(height=image_h, width=image_w, min_pixels=min_pixels, max_pixels=max_pixels)
                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: smart_resize() missing 1 required positional argument: 'factor'
```

I realized that `qwen-vl-utils` library deleted the deafult value of `factor` yesterday in [this commit](https://github.com/QwenLM/Qwen3-VL/commit/0dcc180d854f4b132f8059b10bbf0b5fd5dae9ed). I didn't find why.

The error will only appear when loading a coco dataset as it calls the `smart_resize`  function from `detections_to_suffix_formatter`. 

### Workaround

- `qwen-vl-utils` <= 0.0.11 worked for me

- Add this parameter in the function with the default (`IMAGE_FACTOR = 28`) value they used to have


### Environment

- maestro: `maestro[qwen_2_5_vl]==1.1.0rc3`
- OS: Ubuntu 22.04
- Python: 3.12

### Minimal Reproducible Example

_No response_

### Additional

_No response_

### Are you willing to submit a PR?

- [x] Yes I'd like to help by submitting a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen2.5 VL fail to train due to qwen-vl-utils #227

Search before asking

Bug

Workaround

Environment

Minimal Reproducible Example

Additional

Are you willing to submit a PR?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Qwen2.5 VL fail to train due to qwen-vl-utils #227

Description

Search before asking

Bug

Workaround

Environment

Minimal Reproducible Example

Additional

Are you willing to submit a PR?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions