[WIP] Start on a coding agent demo by msullivan · Pull Request #117 · vercel-labs/ai-python

msullivan · 2026-05-13T00:53:22Z

tau, a crappy coding agent using Textual.
(Even though I don't really like alternate screen coding agents, but.)

vercel · 2026-05-13T00:53:25Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
ai-python	Ready	Preview, Comment	May 13, 2026 6:04pm

socket-security · 2026-05-13T00:54:35Z

Review the following changes in direct dependencies. Learn more about Socket for GitHub.

Diff	Package	Supply Chain Security	Vulnerability	Quality	Maintenance	License
	pypi/anthropic@0.101.0
	pypi/vercel@0.5.8
	pypi/openai@2.36.0
	pypi/textual@8.2.5
	pypi/rich@15.0.0
	pypi/mcp@1.27.1
	pypi/pydantic@2.13.4
	pypi/ruff@0.15.12

View full report

The composer is no longer disabled mid-turn. Submissions go into a pending queue; run_turn is the sole consumer and loops until drained, popping one queued message per turn so user/assistant alternation stays clean.

All interaction with the `ai` library now lives in `chat_loop(app)` at the top of the file. TauApp owns public state (model, agent, messages, pending) that chat_loop reads. run_turn shrinks to a thin worker wrapper around the busy flag.

Streaming deltas only call scroll_end when the transcript was already at the bottom — scroll up to read earlier output and the stream keeps writing offscreen. The scrollbar itself is hidden (scrollbar-size: 0 0) because its per-chunk thumb motion was visually noisy; arrow keys, pageup/pagedown, and mouse wheel still scroll.

tools.py mirrors pi's built-in surface: read, write, edit, bash, grep, find, ls. Same schema shapes and continuation/truncation behavior: - read: 1-indexed offset/limit; head truncation at 2000 lines or 50KB with a 'use offset=N to continue' hint; first-line-too-big escape pointing at sed | head -c. - write/edit/bash: require_approval=True so the agent's default loop gates them behind a ToolApproval hook. - edit: exact-match, must-be-unique str_replace; multiple disjoint edits per call, applied right-to-left against the original file. - bash: tail truncation (errors live at the end), exit-code footer. - grep/find: skip .git/node_modules/etc; respect limit/byte caps. chat_loop now handles ToolEnd, ToolCallResult, and HookEvent — tool calls and their results render as 'tool' bubbles below the streaming text, and pending approvals turn the composer placeholder into a [y/n] prompt. Non-y/n input during an approval falls through to the message queue without resolving the hook.

Replaces the y/n-in-the-composer approval flow with a HookPrompt widget that mounts above the composer when a tool fires its approval hook. Yellow rounded border, shows the tool name + args; single-key shortcuts (y/a approve, n/d deny) resolve the hook. Focus shifts to the prompt automatically and returns to the composer on resolution. Tab cycles between prompt and composer if the user wants to look something up before deciding \u2014 the hook stays pending until y/n. Drops the parallel y/n branch from on_composer_submitted; the prompt owns the decision now.

Only bash still requires approval. File mutations through write/edit are fine — they're targeted, reversible, and the approval prompt became friction more than safety once both fired several times per turn.

… messages When TAU_ADVERTISE=1 is set, appends an instruction to the system prompt asking the model to include a Co-authored-by trailer in any commit messages it writes or suggests. Co-authored-by: anthropic/claude-sonnet-4.6, via tau

Co-authored-by: anthropic/claude-opus-4.6, via tau

Add ruff as a dev dependency with the same lint rules as the top-level project. Fix import sorting, timezone.utc → UTC alias, and line length issues. Co-authored-by: anthropic/claude-opus-4.6, via tau

Add a Static widget below the composer that displays running totals for input, output, cache-read, and cache-write tokens. Updated on each turn and when restoring a session. Co-authored-by: anthropic/claude-opus-4.6, via tau

Derive context estimate from the last assistant turn's input + output tokens and display it as 'ctx: ~N' in the footer bar. Co-authored-by: anthropic/claude-opus-4.6, via tau

Pass providerOptions.gateway.caching=auto as params to agent.run(). Update the usage footer to show uncached input as 'in' and cache-read tokens separately as 'cached'. Co-authored-by: anthropic/claude-opus-4.6, via tau

Co-authored-by: anthropic/claude-opus-4.6, via tau

Bash approval prompts now offer four options: [y] approve once, [n] deny, [!] always approve this exact command, [a] always approve all commands for the rest of the session. Co-authored-by: anthropic/claude-opus-4.6, via tau

File I/O tools (read, grep, find, ls, write, edit) now fire approval hooks. Paths under the working directory are auto-approved. External paths prompt with [y] yes, [n] no, [d] allow dir, [a] always all. Read and write directories are tracked as separate categories. Co-authored-by: anthropic/claude-opus-4.6, via tau

Co-authored-by: anthropic/claude-opus-4.6, via tau

ESC cancels the active worker and dismisses any pending approval prompt. Partial messages are saved before the stream is closed so context isn't lost. Co-authored-by: anthropic/claude-opus-4.6, via tau

Co-authored-by: anthropic/claude-opus-4.6, via tau

Check at_bottom once per event before any mutation. Tool calls, tool results, and new bubbles all respect the follow state now, so scrolled-up users aren't yanked down. Co-authored-by: anthropic/claude-opus-4.6, via tau

Bash now yields output lines as they arrive from the subprocess, emitting PartialToolCallResult events. The ConcatAggregator concatenates all chunks into the final tool result for the model. tau.py doesn't handle the streaming events yet. Co-authored-by: anthropic/claude-opus-4.6, via tau

vercel Bot deployed to Preview May 13, 2026 00:54 View deployment

vercel Bot deployed to Preview May 13, 2026 01:40 View deployment

msullivan force-pushed the tau-agent branch from 2e071d1 to 244f134 Compare May 13, 2026 06:48

msullivan added 17 commits May 13, 2026 00:38

[tau] Start on the coding agent demo

3944056

[tau] Queue user input while a turn is streaming

816e35d

The composer is no longer disabled mid-turn. Submissions go into a pending queue; run_turn is the sole consumer and loops until drained, popping one queued message per turn so user/assistant alternation stays clean.

[tau] Stop gating write and edit behind approval

ca90d55

Only bash still requires approval. File mutations through write/edit are fine — they're targeted, reversible, and the approval prompt became friction more than safety once both fired several times per turn.

[tau] Add session history with persist/resume support

fa28f07

Co-authored-by: anthropic/claude-opus-4.6, via tau

[tau] Add ruff config, lint & format

31efff8

Add ruff as a dev dependency with the same lint rules as the top-level project. Fix import sorting, timezone.utc → UTC alias, and line length issues. Co-authored-by: anthropic/claude-opus-4.6, via tau

[tau] Show cumulative token usage in footer bar

93e8f50

Add a Static widget below the composer that displays running totals for input, output, cache-read, and cache-write tokens. Updated on each turn and when restoring a session. Co-authored-by: anthropic/claude-opus-4.6, via tau

[tau] Show approximate context size in usage footer

99ac2fd

Derive context estimate from the last assistant turn's input + output tokens and display it as 'ctx: ~N' in the footer bar. Co-authored-by: anthropic/claude-opus-4.6, via tau

[tau] Enable gateway caching and improve usage display

c0a33e4

Pass providerOptions.gateway.caching=auto as params to agent.run(). Update the usage footer to show uncached input as 'in' and cache-read tokens separately as 'cached'. Co-authored-by: anthropic/claude-opus-4.6, via tau

[tau] Refactor chat_loop: extract _run_turn for single-turn logic

e5afb6b

Co-authored-by: anthropic/claude-opus-4.6, via tau

[tau] Ring terminal bell on turn completion and approval prompts

14ac685

Co-authored-by: anthropic/claude-opus-4.6, via tau

msullivan force-pushed the tau-agent branch from 244f134 to 14ac685 Compare May 13, 2026 07:38

vercel Bot deployed to Preview May 13, 2026 07:39 View deployment

msullivan added 5 commits May 13, 2026 10:22

[tau] Render assistant messages as markdown via Rich

be58130

Co-authored-by: anthropic/claude-opus-4.6, via tau

[tau] ESC to interrupt running turn

9cfecb9

ESC cancels the active worker and dismisses any pending approval prompt. Partial messages are saved before the stream is closed so context isn't lost. Co-authored-by: anthropic/claude-opus-4.6, via tau

[tau] Remove ctrl+d quit binding

a889f16

Co-authored-by: anthropic/claude-opus-4.6, via tau

[tau] Consistent scroll-follow behavior for all event types

401da53

Check at_bottom once per event before any mutation. Tool calls, tool results, and new bubbles all respect the follow state now, so scrolled-up users aren't yanked down. Co-authored-by: anthropic/claude-opus-4.6, via tau

vercel Bot deployed to Preview May 13, 2026 18:04 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Start on a coding agent demo#117

[WIP] Start on a coding agent demo#117
msullivan wants to merge 22 commits into
mainfrom
tau-agent

msullivan commented May 13, 2026 •

edited

Loading

Uh oh!

vercel Bot commented May 13, 2026 •

edited

Loading

Uh oh!

socket-security Bot commented May 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

msullivan commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel Bot commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

socket-security Bot commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

msullivan commented May 13, 2026 •

edited

Loading

vercel Bot commented May 13, 2026 •

edited

Loading

socket-security Bot commented May 13, 2026 •

edited

Loading