The dominant production failure was a dead dev-server process behind a
'running' DB flag (idle-stop / OOM / crash / host restart), which the UI
trusted and embedded -> permanent 502 until a manual restart.
- dev-container.ts: add isDevServerListening() fast liveness probe; stop the
container entrypoint from auto-running 'npx next dev --webpack' (it competed
with the managed server, forced the wrong bundler/cwd, and doubled memory);
drop the fake state='running' seed row; bump dev container memory 1g -> 2g.
- ensure route: verify a 'running' row is ACTUALLY listening and resurrect it
if dead, instead of trusting the flag; never bounce a healthy server.
- preview page: call ensure on every mount and on refresh (verify + heal),
force an immediate anatomy refetch on (re)start so a dead frame swaps to
'warming up' without the 5s lag.
Backstopped by the partial unique index + startDevServer idempotency, so heals
can never duplicate or thrash a server.