Add Gemini 3.1 Pro, Flash, and Flash Lite benchmark results by DUCKJAIII · Pull Request #4922 · Aider-AI/aider

DUCKJAIII · 2026-03-15T10:05:28Z

Description

This PR adds the Polyglot benchmark results for the latest Google Vertex AI Gemini models, including the new 3.1 preview models.

Models Benchmarked:

vertex_ai/gemini-3.1-pro-preview
vertex_ai/gemini-3.1-flash-lite-preview
vertex_ai/gemini-3-flash

Run Details & Edit Formats

Pro & Flash Models: Explicitly forced to use the diff-fenced edit format, which they handled exceptionally well.
Flash Lite: Allowed to default to the whole edit format to ensure stability and avoid syntax loop traps.
Environment: Ran via Aider's official Docker container. Note: Encountered some expected litellm timeouts / API connection drops due to strict Vertex AI quota limits on the preview endpoints, but the runs were successfully resumed and completed.

Benchmark Results (Pass Rate)

Gemini 3.1 Pro: 94.2%
Gemini 3 Flash: 82.7%
Gemini 3.1 Flash Lite: 68.4%

Let me know if you need me to upload any of the raw run directories for verification!

CLAassistant · 2026-03-15T10:05:34Z

All committers have signed the CLA.

DUCKJAIII · 2026-03-15T13:46:46Z

@CLAassistant recheck

Add Gemini 3.1 Pro, Flash, and Flash Lite benchmark results

d7afbc9

DUCKJAIII force-pushed the add-gemini-3-benchmarks branch from 49f30d9 to d7afbc9 Compare March 15, 2026 13:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Gemini 3.1 Pro, Flash, and Flash Lite benchmark results#4922

Add Gemini 3.1 Pro, Flash, and Flash Lite benchmark results#4922
DUCKJAIII wants to merge 1 commit intoAider-AI:mainfrom
DUCKJAIII:add-gemini-3-benchmarks

DUCKJAIII commented Mar 15, 2026

Uh oh!

CLAassistant commented Mar 15, 2026 •

edited

Loading

Uh oh!

DUCKJAIII commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DUCKJAIII commented Mar 15, 2026

Description

Run Details & Edit Formats

Benchmark Results (Pass Rate)

Uh oh!

CLAassistant commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DUCKJAIII commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented Mar 15, 2026 •

edited

Loading