beb0bebe8f
fix(governor): stop misclassifying 'okay <request>' and investigative questions as conversational; raise status_check/diagnose tool budgets (fixes round_cap cut-offs)
2026-06-10 17:26:41 -07:00
6d2ab7bd33
feat(telemetry): emit per-turn governor summary (stop_reason, rounds, tool_results) for orchestration diagnostics
2026-06-10 17:07:43 -07:00
9b6dfcab73
chore(telemetry): correct Path Confusion loop breaker implementation
2026-06-10 12:16:42 -07:00
62d78e088f
chore(telemetry): resolve loop crash caused by Temporal Dead Zone hoisting and fix conversational budget mapping
2026-06-10 12:04:13 -07:00
92ad20946c
feat(orchestration): implement state-based loop governor with forced checkpoints, phase events, and robust tool signatures
2026-06-10 11:55:17 -07:00
8eb941f1d9
chore(telemetry): implement robust path-confusion stall guard, persist verify signatures correctly, and redact secrets from telemetry logs
2026-06-10 11:38:49 -07:00
0e579be136
chore(telemetry): loosen error normalization to preserve status codes and line numbers for accurate verification signatures
2026-06-10 11:12:29 -07:00
6cbe7c2515
chore(telemetry): resolve universal path normalizer logic and path-confusion tracking
2026-06-10 11:08:42 -07:00
b1ad4fb363
feat(telemetry): implement phase-based execution loop and adaptive tool budgets
2026-06-09 18:58:12 -07:00
ee8109c927
chore(telemetry): add path-confusion loop breaker and strict blank-preview diagnostic protocol
2026-06-09 16:27:09 -07:00
69f7e79266
chore(telemetry): add path-confusion loop breaker and strict blank-preview diagnostic protocol
2026-06-09 16:10:45 -07:00
f1e1d1e753
chore(telemetry): verify signature comment and cleanup
2026-06-09 15:35:46 -07:00
5409d651b3
ux(chat): tune silent-loop status nudge threshold to 6 rounds and strip leaked tool trace payloads from UI
2026-06-09 15:02:34 -07:00
173eabd8bf
chore(telemetry): jack up MAX_TOOL_ROUNDS to 150 for ultimate custom app-building runway
2026-06-09 14:30:36 -07:00
65c8dcb6dc
chore(telemetry): jack up MAX_TOOL_ROUNDS to 150 for ultimate custom app-building runway
2026-06-09 14:18:44 -07:00
1301819501
chore(telemetry): raise MAX_TOOL_ROUNDS to 60 for complete engineering runway
2026-06-09 14:16:31 -07:00
341e40c8d5
chore(telemetry): align attached-file reader to flattened project root path
2026-06-09 13:38:32 -07:00
bb63755dcd
chore(telemetry): refactor stall detector to track real state progress and persist non-null verify signatures across edit rounds
2026-06-09 13:36:30 -07:00
5026cc64a3
chore(telemetry): flatten the project slug layer and remove cd path instructions from system prompt
2026-06-09 13:28:57 -07:00
9d32747e07
chore(telemetry): optimize state-based loop stall detector by tracking tool input signatures and clean up unused helper functions
2026-06-09 12:23:20 -07:00
4caf7755bf
chore(telemetry): implement state-based loop governor, 180s tool timeout, visual-qa path fix, and fs_write diff-guard
2026-06-09 12:05:15 -07:00
bf2b4f966a
chore(telemetry): add bulletproof mcp_token sanitization and read-only mode fallback in chat route
2026-06-09 10:47:32 -07:00
1f2fbd1115
chore(telemetry): fix agent loops, name mangling, dev server leaks, CWD alignment, and add daily session auditor
2026-06-08 16:09:58 -07:00
16a545a26a
feat: complete live-verified GTM onboarding flow & places autocomplete search proxies
2026-06-06 17:53:13 -07:00
2647919bd5
feat(refactor): live zed-style codebase files autocomplete and context attachment
2026-05-21 17:20:31 -07:00
50f65e337d
feat(refactor): premium zed-style chat UI, collapsible reasoning, and comprehensive strict type sweeps
2026-05-21 17:05:42 -07:00
1da2a0b166
fix(ai): bump roundSinceText cutoff to 30 to prevent panic loops
2026-05-19 15:26:15 -07:00
0a4b3af327
fix(ai): restore thinking animations for gemini streams
2026-05-19 14:53:24 -07:00
6a083ff261
fix(ai): bump loop-breaker limits from 16 to 30 to permit long autonomous workflows
2026-05-19 14:51:45 -07:00
d21daa8abf
fix(ai): implement Phase 2 and 3 prompt recommendations from review
2026-05-19 13:47:18 -07:00
93087d4f9a
feat(ai): optimize tool loops, fix deployments, and integrate new onboarding flow
2026-05-19 12:52:47 -07:00
5364bd8497
feat(api): comprehensive QA hardening — security gates, chat improvements, beta scaffolds
...
Closes checklist items F-01..F-06, D-01..D-28, S-01..S-10, C-01..C-07,
B-01..B-07, R-01..R-02, O-03.
Security (28 deletions + 10 auth gates):
- Delete 28 unauthenticated debug/cursor/firebase/test routes
- Gate ai/chat, ai/conversation, context/summarize, work-completed with withTenantProject/withAuth
- Add HMAC-SHA256 signature verification to webhooks/coolify
- Switch all admin secret comparisons to timingSafeStringEq
Foundations (lib/server/*):
- api-handler.ts: withAuth, withTenantProject, withWorkspace, withAdminSecret, withRateLimit
- logger.ts: structured request-scoped logging with turnId
- audit-log.ts: writeAuditLog helper + audit_log table
- rate-limit.ts: Postgres sliding window rate limiter
- coolify-webhook.ts: verifyCoolifySignature
- timing-safe.ts: timingSafeStringEq
Chat hardening (chat/route.ts):
- MAX_TOOL_ROUNDS 15 → 8 (C-01)
- Loop detection: hard-break at 3 identical fingerprints (was 5) (C-02)
- Add 6-consecutive-tool-call hard-break (C-02)
- Mode: respond first, act second prompt block (C-03)
- SSE heartbeat every 25s via setInterval (C-04)
- Per-tool 45s timeout via Promise.race (C-05)
- turnId per-turn UUID for log correlation (C-06)
- Recovery fires when roundsSinceText >= 4 (C-07)
- SSE plan event on plan_task_add/edit (B-05)
Beta features:
- invites table + GET/POST /api/invites (P4.8)
- invites/[token] validate + redeem (P4.8)
- fs_project_dev_servers table + lib/server/dev-server-state.ts (P6.B1)
- fs_project_secrets table + CRUD routes (P6.D2)
- lib/integrations/brief-extract.ts (P3.7)
Documentation:
- app/api/ROUTES.md: full route map with auth + tenant
2026-05-17 19:17:22 -07:00
cdddaced30
fix(ai): implement two-stage loop detection to warn before hard-stopping (Fix 11)
2026-05-16 12:59:16 -07:00
c06ab8650b
fix(ai): relax fs_edit line number enforcement to allow safe oldString replacements
2026-05-16 12:54:15 -07:00
f053567741
fix(ai): sync auto-commit with streamed result to surface commit SHA to UI (Fix 10)
2026-05-16 12:26:54 -07:00
3f4a0de2e6
fix(ai): add hard-rule prompt clause forbidding unverified mutation claims (Fix 8)
2026-05-16 12:25:26 -07:00
aff22cba93
fix(ai): force recovery summary when final tools fail (Fix 6)
2026-05-16 12:25:00 -07:00
ad66fe61c9
fix(ai): implement fixes 4, 5, and 7 to broaden loop detection, tighten silent stretches, and lower tool round caps
2026-05-16 12:24:09 -07:00
8bed453f7c
fix(ai): feed verified tool history back into model context to prevent hallucination compounding (Fix 3)
2026-05-16 12:22:58 -07:00
0494a67632
feat(ai): persist raw tool execution results in postgres to enable fine-tuning dataset extraction
2026-05-16 11:59:46 -07:00
7832e83542
fix(ai): update system prompt to enforce line-number usage for fs_edit to prevent 404 block mismatches
2026-05-15 16:38:17 -07:00
67c43028dd
feat(ai): inject dynamic codebase summary into system prompt to eliminate blind structure searches
2026-05-15 16:16:45 -07:00
377725a98f
feat(ui): hardcode visual preview tab to primary frontend port 3000
2026-05-15 16:03:06 -07:00
299addfcad
feat(ai): integrate open-design capabilities (templates, media generation, visual QA)
2026-05-15 11:07:44 -07:00
4862c76a60
feat(ui): add mobile preview device framing and design QA tools
2026-05-15 11:01:49 -07:00
6c85b1db34
fix(mcp): resolve external preview routing failures and correct monorepo git paths
2026-05-14 14:56:29 -07:00
d99e8f96c5
fix(ai): strip deepseek xml tags from chat history & secure git tools
...
This commit addresses the issue where DeepSeek's raw XML markup (like <tool_calls> and <think>) was leaking into chat history, causing hallucinations in subsequent turns. It also patches a vulnerability in the git commit tool where arbitrary shell injection was possible.
Additionally, it includes UX copy and color contrast adjustments for the marketing homepage breadcrumbs.
2026-05-14 11:34:42 -07:00
c8bbae6978
feat(marketing): replace landing page with new site design and animations
2026-05-13 20:50:02 -07:00
8bc8bc36f1
fix(ai): add system prompt guardrail to prevent infinite dev server loops when testing protected auth routes
2026-05-13 15:07:31 -07:00
575c38cb34
chore: convert submodules to standard directories for true monorepo structure
2026-05-13 14:54:23 -07:00