-
Notifications
You must be signed in to change notification settings - Fork 699
Pull requests: open-compass/VLMEvalKit
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Fix] Video-MME-v2: improve acc calculation and data preparation
#1551
opened May 20, 2026 by
EliYuan30
Loading…
fix: guard choices[0] and message=None before content access
#1550
opened May 17, 2026 by
qizwiz
Loading…
[Cleanup] Remove unused Polygon3 dependency (#1528)
#1548
opened May 16, 2026 by
SHAI-Akshay-Tripathi
Contributor
Loading…
[WIP] Fix default judge model selection conflict in run.py and tools.py
#1532
opened May 6, 2026 by
TianhaoLiang2000
Contributor
Loading…
[Fix] Fix judge intermediate result caching and resume support
#1531
opened May 6, 2026 by
TianhaoLiang2000
Contributor
Loading…
[Benchmark] Add support for MMOral-Uni benchmark
#1527
opened Apr 26, 2026 by
isjinghao
Contributor
Loading…
Add LICA-Bench dataset (graphic design VLM evaluation)
#1513
opened Apr 15, 2026 by
purvanshi
Loading…
1 task
[Benchmark] Fix problems found in XSTest, M3oralbench, MSSBench, MMSafetyBench, Flames and SIUO_GEN.
#1503
opened Apr 1, 2026 by
Gugugugugutian
Contributor
Loading…
[Fix] fix MathCanvas-Bench md5 & summarize_mathcanvas_results
#1502
opened Mar 31, 2026 by
shiwk24
Contributor
Loading…
Fix LLaVA model output issues by using official conversation templates
WIP
#1424
opened Feb 2, 2026 by
cdllI
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.