vibn-agent-runner

Author	SHA1	Message	Date
mawkone	4375fbcb22	fix(preview): resolve 'command not found' by prefixing next dev fallback and prompt recipes with npx	2026-06-12 15:29:01 -07:00
mawkone	9b13320253	design(chat): remove un-wired dictation and select-mode buttons from composer to match Base44 simplicity	2026-06-12 11:05:01 -07:00
mawkone	bcf47b5c6c	fix(ai): enforce port 3000 exclusively for visual previews to match UI constraints	2026-06-11 11:23:01 -07:00
mawkone	82a41f7e95	fix(stop+stability): stop button interrupts live generation; classifier, prompt + preview pane improvements Stop button fix: - Plumb AbortSignal end-to-end: callVibnChat → Gemini SDK (config.abortSignal) / OpenAI fetch → executeMcpTool (/api/mcp fetch) - Treat abort as clean user stop (not fatal error); partial reply persisted with '(stopped by user)' Classifier fix: - Add timeout/gateway/5xx/connection-error vocabulary to diagnose intent - Prevents 'I get a gateway timeout' from falling through to feature_build (40 rounds) and looping Prompt / agent behaviour: - Render verification is now scope-aware: small edits stop at green healthCheck; no browser_console/curl audit on healthy server - Sanitize stale '### Phase Checkpoint' walls from loaded history so old threads stop biasing new turns - Next.js dev command updated to --no-turbopack for container stability (per-route lazy compile caused cold-start 503s) - New public page prompt: agent checks middleware allowlist in the same turn - Scope discipline and QA-tool gating carried forward from prior session Code cleanup: - Remove duplicate AgentPhase declaration (TS2440) - Remove dead checkpoint emit branch and orphan 'checkpoint' phase value - Remove unused MAX_TOOL_ROUNDS constant Preview pane (build status): - 4-state machine: initial-load / building (with elapsed timer) / build-failed / not-running - pollMs 0 → 5 000ms so dev-server recovery and build completion auto-update without refresh - anatomy route + use-anatomy type: inFlightBuild gains createdAt for elapsed timer	2026-06-10 21:40:48 -07:00
mawkone	39cb9194a5	feat(verification): acceptance-check layer + executor fix-loop; hide phase-checkpoint walls; guaranteed turn-end summary. Verification gated behind VIBN_VERIFICATION_ENABLED.	2026-06-10 19:43:36 -07:00
mawkone	e9d597de03	fix(governor): classify multi-word greetings ('hey there!', 'good morning') and short verb-less messages as conversational so they don't trigger the agent loop	2026-06-10 17:50:31 -07:00
mawkone	a87faa2353	ux(chat): clean tool-pill results (no raw JSON, Failed/exit verbs); structured build-health status instead of 'didn't reach a clean stopping point'; label active toolbar mode	2026-06-10 17:44:19 -07:00
mawkone	6fe774719a	fix(governor): stop misclassifying 'okay <request>' and investigative questions as conversational; raise status_check/diagnose tool budgets (fixes round_cap cut-offs)	2026-06-10 17:26:41 -07:00
mawkone	a4fe96496a	feat(telemetry): emit per-turn governor summary (stop_reason, rounds, tool_results) for orchestration diagnostics	2026-06-10 17:07:43 -07:00
mawkone	ef539d34a7	chore(telemetry): correct Path Confusion loop breaker implementation	2026-06-10 12:16:42 -07:00
mawkone	e6721a0b72	chore(telemetry): resolve loop crash caused by Temporal Dead Zone hoisting and fix conversational budget mapping	2026-06-10 12:04:13 -07:00
mawkone	8eaa20106a	feat(orchestration): implement state-based loop governor with forced checkpoints, phase events, and robust tool signatures	2026-06-10 11:55:17 -07:00
mawkone	a01f3331df	chore(telemetry): implement robust path-confusion stall guard, persist verify signatures correctly, and redact secrets from telemetry logs	2026-06-10 11:38:49 -07:00
mawkone	019211ecce	chore(telemetry): loosen error normalization to preserve status codes and line numbers for accurate verification signatures	2026-06-10 11:12:29 -07:00
mawkone	d433da56f9	chore(telemetry): resolve universal path normalizer logic and path-confusion tracking	2026-06-10 11:08:42 -07:00
mawkone	ca47d0643d	feat(telemetry): implement phase-based execution loop and adaptive tool budgets	2026-06-09 18:58:12 -07:00
mawkone	7ddcbfe32d	chore(telemetry): add path-confusion loop breaker and strict blank-preview diagnostic protocol	2026-06-09 16:27:09 -07:00
mawkone	7a9c2575f0	chore(telemetry): add path-confusion loop breaker and strict blank-preview diagnostic protocol	2026-06-09 16:10:45 -07:00
mawkone	dd510fe81f	chore(telemetry): verify signature comment and cleanup	2026-06-09 15:35:46 -07:00
mawkone	ef4a06a57c	ux(chat): tune silent-loop status nudge threshold to 6 rounds and strip leaked tool trace payloads from UI	2026-06-09 15:02:34 -07:00
mawkone	8c73f72680	chore(telemetry): jack up MAX_TOOL_ROUNDS to 150 for ultimate custom app-building runway	2026-06-09 14:30:36 -07:00
mawkone	f1d0c9e0b5	chore(telemetry): jack up MAX_TOOL_ROUNDS to 150 for ultimate custom app-building runway	2026-06-09 14:18:44 -07:00
mawkone	ad7d0face8	chore(telemetry): raise MAX_TOOL_ROUNDS to 60 for complete engineering runway	2026-06-09 14:16:31 -07:00
mawkone	1284078799	chore(telemetry): align attached-file reader to flattened project root path	2026-06-09 13:38:32 -07:00
mawkone	de1209afe4	chore(telemetry): refactor stall detector to track real state progress and persist non-null verify signatures across edit rounds	2026-06-09 13:36:30 -07:00
mawkone	6ec312f716	chore(telemetry): flatten the project slug layer and remove cd path instructions from system prompt	2026-06-09 13:28:57 -07:00
mawkone	3679ccf913	chore(telemetry): optimize state-based loop stall detector by tracking tool input signatures and clean up unused helper functions	2026-06-09 12:23:20 -07:00
mawkone	7b6cac5462	chore(telemetry): implement state-based loop governor, 180s tool timeout, visual-qa path fix, and fs_write diff-guard	2026-06-09 12:05:15 -07:00
mawkone	c442921ccb	chore(telemetry): add bulletproof mcp_token sanitization and read-only mode fallback in chat route	2026-06-09 10:47:32 -07:00
mawkone	492404cd14	chore(telemetry): fix agent loops, name mangling, dev server leaks, CWD alignment, and add daily session auditor	2026-06-08 16:09:58 -07:00
mawkone	4d40496739	feat: complete live-verified GTM onboarding flow & places autocomplete search proxies	2026-06-06 17:53:13 -07:00
mawkone	b3dd3714c3	feat(refactor): live zed-style codebase files autocomplete and context attachment	2026-05-21 17:20:31 -07:00
mawkone	8049a7f1ab	feat(refactor): premium zed-style chat UI, collapsible reasoning, and comprehensive strict type sweeps	2026-05-21 17:05:42 -07:00
mawkone	7c45fdc5cc	fix(ai): bump roundSinceText cutoff to 30 to prevent panic loops	2026-05-19 15:26:15 -07:00
mawkone	5573f1e6fa	fix(ai): restore thinking animations for gemini streams	2026-05-19 14:53:24 -07:00
mawkone	d7d4b2d2fe	fix(ai): bump loop-breaker limits from 16 to 30 to permit long autonomous workflows	2026-05-19 14:51:45 -07:00
mawkone	bbcd4ad55e	fix(ai): implement Phase 2 and 3 prompt recommendations from review	2026-05-19 13:47:18 -07:00
mawkone	618f7796b2	feat(ai): optimize tool loops, fix deployments, and integrate new onboarding flow	2026-05-19 12:52:47 -07:00
mawkone	6b8862ef2b	feat(api): comprehensive QA hardening — security gates, chat improvements, beta scaffolds Closes checklist items F-01..F-06, D-01..D-28, S-01..S-10, C-01..C-07, B-01..B-07, R-01..R-02, O-03. Security (28 deletions + 10 auth gates): - Delete 28 unauthenticated debug/cursor/firebase/test routes - Gate ai/chat, ai/conversation, context/summarize, work-completed with withTenantProject/withAuth - Add HMAC-SHA256 signature verification to webhooks/coolify - Switch all admin secret comparisons to timingSafeStringEq Foundations (lib/server/*): - api-handler.ts: withAuth, withTenantProject, withWorkspace, withAdminSecret, withRateLimit - logger.ts: structured request-scoped logging with turnId - audit-log.ts: writeAuditLog helper + audit_log table - rate-limit.ts: Postgres sliding window rate limiter - coolify-webhook.ts: verifyCoolifySignature - timing-safe.ts: timingSafeStringEq Chat hardening (chat/route.ts): - MAX_TOOL_ROUNDS 15 → 8 (C-01) - Loop detection: hard-break at 3 identical fingerprints (was 5) (C-02) - Add 6-consecutive-tool-call hard-break (C-02) - Mode: respond first, act second prompt block (C-03) - SSE heartbeat every 25s via setInterval (C-04) - Per-tool 45s timeout via Promise.race (C-05) - turnId per-turn UUID for log correlation (C-06) - Recovery fires when roundsSinceText >= 4 (C-07) - SSE plan event on plan_task_add/edit (B-05) Beta features: - invites table + GET/POST /api/invites (P4.8) - invites/[token] validate + redeem (P4.8) - fs_project_dev_servers table + lib/server/dev-server-state.ts (P6.B1) - fs_project_secrets table + CRUD routes (P6.D2) - lib/integrations/brief-extract.ts (P3.7) Documentation: - app/api/ROUTES.md: full route map with auth + tenant	2026-05-17 19:17:22 -07:00
mawkone	1545145292	fix(ai): implement two-stage loop detection to warn before hard-stopping (Fix 11)	2026-05-16 12:59:16 -07:00
mawkone	f2c29857b5	fix(ai): relax fs_edit line number enforcement to allow safe oldString replacements	2026-05-16 12:54:15 -07:00
mawkone	ca8a915fe2	fix(ai): sync auto-commit with streamed result to surface commit SHA to UI (Fix 10)	2026-05-16 12:26:54 -07:00
mawkone	89c9b01669	fix(ai): add hard-rule prompt clause forbidding unverified mutation claims (Fix 8)	2026-05-16 12:25:26 -07:00
mawkone	24a7689d69	fix(ai): force recovery summary when final tools fail (Fix 6)	2026-05-16 12:25:00 -07:00
mawkone	36c9dd47fb	fix(ai): implement fixes 4, 5, and 7 to broaden loop detection, tighten silent stretches, and lower tool round caps	2026-05-16 12:24:09 -07:00
mawkone	9f59b584e5	fix(ai): feed verified tool history back into model context to prevent hallucination compounding (Fix 3)	2026-05-16 12:22:58 -07:00
mawkone	f0d7548fe1	feat(ai): persist raw tool execution results in postgres to enable fine-tuning dataset extraction	2026-05-16 11:59:46 -07:00
mawkone	b92939ab86	fix(ai): update system prompt to enforce line-number usage for fs_edit to prevent 404 block mismatches	2026-05-15 16:38:17 -07:00
mawkone	485ed73572	feat(ai): inject dynamic codebase summary into system prompt to eliminate blind structure searches	2026-05-15 16:16:45 -07:00
mawkone	d464ccd19d	feat(ui): hardcode visual preview tab to primary frontend port 3000	2026-05-15 16:03:06 -07:00

1 2

57 Commits