open-compass / VLMEvalKit Public

Notifications You must be signed in to change notification settings
Fork 699
Star 4.1k

Code
Issues 205
Pull requests 37
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: open-compass/VLMEvalKit

Labels 17 Milestones 1

New pull request New

37 Open 938 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Fix] Match verbose Chain-of-Thought in eval

#1552 opened May 21, 2026 by ptonso

Loading…

[Fix] Video-MME-v2: improve acc calculation and data preparation

#1551 opened May 20, 2026 by EliYuan30

Loading…

fix: guard choices[0] and message=None before content access

#1550 opened May 17, 2026 by qizwiz

Loading…

[Cleanup] Remove unused Polygon3 dependency (#1528)

#1548 opened May 16, 2026 by SHAI-Akshay-Tripathi Contributor

Loading…

[Benchmark] Add Spatial-DISE benchmark

#1542 opened May 9, 2026 by shinmohuang

Loading…

[WIP] Fix default judge model selection conflict in run.py and tools.py

#1532 opened May 6, 2026 by TianhaoLiang2000 Contributor

Loading…

[Fix] Fix judge intermediate result caching and resume support

#1531 opened May 6, 2026 by TianhaoLiang2000 Contributor

Loading…

[Benchmark] Add support for MMOral-Uni benchmark

#1527 opened Apr 26, 2026 by isjinghao Contributor

Loading…

[Benchmark] Add support for ReVSI Benchmark

#1526 opened Apr 25, 2026 by eamonn-zh

Loading…

[Benchmark] Add support for Ref-L4_test benchmark

#1525 opened Apr 22, 2026 by rshube

Loading…

[Benchmark] Add support for RefCOCO-M benchmark

#1524 opened Apr 22, 2026 by rshube

Loading…

[Benchmark] Add support for PixmoPoints benchmark

#1523 opened Apr 22, 2026 by rshube

Loading…

[Benchmark] Add support for PixmoCount benchmark

#1522 opened Apr 22, 2026 by rshube

Loading…

Feat/concurrent dispatch

#1514 opened Apr 15, 2026 by Bluear7878

Loading…

Add LICA-Bench dataset (graphic design VLM evaluation)

#1513 opened Apr 15, 2026 by purvanshi

Loading…

1 task

[Benchmark] Add Support for LVOmniBench

#1510 opened Apr 9, 2026 by KD-TAO

Loading…

[Benchmark] Add support for MaXM

#1505 opened Apr 2, 2026 by inigopm Contributor

Loading…

[Benchmark] Fix problems found in XSTest, M3oralbench, MSSBench, MMSafetyBench, Flames and SIUO_GEN.

#1503 opened Apr 1, 2026 by Gugugugugutian Contributor

Loading…

[Fix] fix MathCanvas-Bench md5 & summarize_mathcanvas_results

#1502 opened Mar 31, 2026 by shiwk24 Contributor

Loading…

add Ming-flash-omni-2.0

#1501 opened Mar 30, 2026 by OMRailgun Contributor

Loading…

fix the vladbench download bug

#1497 opened Mar 25, 2026 by Depth2World Contributor

Loading…

[Model] Support for llava hf

#1479 opened Mar 11, 2026 by smgjch

Loading…

[API] Add DeepOCR pipeline API provider

#1473 opened Mar 4, 2026 by leejooan

Loading…

jt video chat v260227

#1460 opened Feb 28, 2026 by jiutiancv Contributor

Loading…

Fix LLaVA model output issues by using official conversation templates WIP

#1424 opened Feb 2, 2026 by cdllI

Loading…

Previous 1 2 Next

Previous Next

ProTip! Filter pull requests by the default branch with base:main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!