Skip to content

Pull requests: UKGovernmentBEIS/inspect_ai

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Inspect View: edit log tags and metadata in the viewer
#4014 opened May 22, 2026 by ransomr Collaborator Loading…
1 of 5 tasks
text_editor: fix corrupted-history failure mode and cap history size
#4010 opened May 22, 2026 by tadamcz Contributor Loading…
feat: add host-mode backend for computer() tool
#4009 opened May 22, 2026 by marov Loading…
1 of 5 tasks
Remote checkpoint directory support
#4007 opened May 21, 2026 by epatey Collaborator Draft
fix(eval): record default epochs reducer consistently
#4001 opened May 21, 2026 by herbert-apollo Contributor Draft
1 of 5 tasks
Thread exclude_fields through eval log readers
#3990 opened May 20, 2026 by RaviTeja-Kondeti Loading…
2 of 5 tasks
Fix Responses API replay for reasoning-only tool calls
#3985 opened May 20, 2026 by ckane Contributor Loading…
1 of 5 tasks
Bound transcript memory for long-running samples
#3971 opened May 18, 2026 by rasmusfaber Contributor Draft
3 of 5 tasks
Bound hook event queue to prevent OOM from slow hooks
#3962 opened May 18, 2026 by rasmusfaber Contributor Draft
1 task done
Atomic log file writes (#2949)
#3950 opened May 15, 2026 by antnewman Contributor Loading…
2 of 5 tasks
fix(model): record cached usage in per-sample and eval model_usage
#3946 opened May 14, 2026 by afspies Contributor Loading…
5 tasks done
fix: warn when model output is truncated by token limit
#3933 opened May 13, 2026 by awesome-pro Loading…
1 of 5 tasks
fix: preserve env_vars across eval-retry
#3932 opened May 13, 2026 by awesome-pro Loading…
1 of 5 tasks
feat(model): per-attempt ModelEvent retry accounting and timing
#3860 opened May 7, 2026 by sjawhar Contributor Loading…
1 task done
Add aggregate(key, agg=...) metric factory (#3735)
#3850 opened May 6, 2026 by antnewman Contributor Loading…
2 of 5 tasks
add exclude_fields to read_eval_log_async and samples_df
#3816 opened May 2, 2026 by RoshDSIT Loading…
1 of 5 tasks
Agent bridge: preserve ChatMessage and ToolCall ids across turns
#3805 opened Apr 30, 2026 by ezra-apollo Loading…
3 tasks done
Fix(scorer): strip % in numeric match for face-value comparisons
#3782 opened Apr 28, 2026 by RecreationalMath Contributor Loading…
2 of 5 tasks
ProTip! What’s not been updated in a month: updated:<2026-04-24.