Build a demo-ready Chrome extension in 12 hours that lets users execute web tasks via natural language, with very fast human-like navigation and reliable summaries.
- Product definition:
/Users/marcoshernanz/dev/hackeurope2/PROJECT.md - Implementation roadmap and status:
/Users/marcoshernanz/dev/hackeurope2/PLAN.md
Read these first at the start of every session.
- Hacker News: “Summarize the top 5 hackernews articles.”
- Gmail: “Give me a summary of my last 5 unread emails.”
All technical decisions should optimize these two flows for live demo performance and stability.
- Human-like but extremely fast navigation.
- Overall agent speed.
- Reliable and grounded output.
- Chrome Extension MV3 with background orchestrator + content executor.
- Claude is mandatory model.
- Constrained action schema only; no unconstrained free-form execution.
- Deterministic selectors and domain adapters for HN/Gmail first.
- Tight timeouts, retries, and hard caps to avoid demo stalls.
- Open
PROJECT.mdandPLAN.md. - Pick highest-priority
TODOtask fromPLAN.mdSection 4. - Implement minimal viable slice that keeps end-to-end path runnable.
- Verify with focused checks (unit/integration/manual as applicable).
- Update
PLAN.mdbefore ending turn:- change task statuses (
TODO/IN_PROGRESS/DONE/BLOCKED) - update Section 10
Progress Snapshot - append entry to Section 11
Work Log
- change task statuses (
- Report what changed, what remains, and immediate next task IDs.
After doing any changes (code, config, docs), PLAN.md must be updated in the same session to reflect:
- which parts are done
- which parts remain to do
No exceptions.
- Implementation complete for scoped behavior.
- Basic verification executed and results noted.
- Failure behavior handled (or explicitly documented).
PLAN.mdupdated with status and log entry.
- Keep model context compact (URL/title/candidates/snippets only).
- Prefer short action batches (3-5 steps).
- Use fallback selector chains for Gmail.
- Clip extraction text aggressively to reduce latency.
- Fail partial with explicit message instead of hanging.
- Always preserve a fallback path (
top 3variant). - Keep a deterministic per-domain backup flow available if planner degrades.
- Before demo: run both critical scenarios end-to-end once.
- Broad “agent for all websites” scope creep.
- Heavy UI frameworks or complex refactors during hackathon window.
- Long unbounded loops, weak parsing, or hidden failures.
- Review
PLAN.mdcurrent topTODOs. - Confirm environment and extension build/run command works.
- Execute next P0 task.
- Update
PLAN.mdstatus + work log.