Skip to content

Conversation

@Silviase
Copy link
Collaborator

  • add fish helper to provision uv environments with CUDA paths
  • refactor VLLM wrapper/registry for Qwen3 prompts and LoRA handling
  • raise vllm_normal deps to torch>=2.8 and flash-attn>=2.8.3
  • tweak eval scripts to run Qwen3 VL 30B with tensor parallelism

@Silviase
Copy link
Collaborator Author

[記録]

  • glm4vは複数の画像を利用した推論に非対応のため外した
  • glm4.5vは100Bを超えるため未検証
  • Ovis2は後継として2.5があるためそちらを優先して検証

@Silviase Silviase merged commit 85df021 into master Oct 17, 2025
1 check passed
@Silviase Silviase linked an issue Oct 17, 2025 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

new model: qwen/qwen3

2 participants