Notice history

Jan 2026

Feb 2026

Mar 2026

Jan 2026

Feb 2026

Mar 2026

Jan 2026

Feb 2026

Mar 2026

Jan 2026

Feb 2026

Mar 2026

Mar 2026

No notices reported this month

Feb 2026

Resolved
February 11, 2026 at 1:36 PM
Resolved
February 11, 2026 at 1:36 PM
This incident has been resolved.
Identified
February 11, 2026 at 1:14 PM
Identified
February 11, 2026 at 1:14 PM
We are continuing to work on a fix for this incident.
Investigating
February 11, 2026 at 1:04 PM
Investigating
February 11, 2026 at 1:04 PM
We are currently investigating this incident.

Postmortem
February 06, 2026 at 2:38 PM
Postmortem
February 06, 2026 at 2:38 PM
This incident was caused by a code refactoring that consolidated references to CLERK_PUBLISHABLE_KEY across several web sites. However, due to an oversight in omitting the variable name in the deployment script, the platform failed to locate the legacy CLERK_PUBLISHABLE_KEY in the production environment, resulting in page failures and inability to use platform.vm0.ai.
The related API services and container services were not affected.
The follow-up remediation plan primarily includes attempting to validate required environment variables during the build phase to prevent problematic code from being deployed. Additionally, introducing e2e testing for platform.vm0.ai to ensure the happy path workflow functions normally.
Resolved
February 06, 2026 at 2:29 PM
Resolved
February 06, 2026 at 2:29 PM
This incident has been resolved.
Identified
February 06, 2026 at 2:18 PM
Identified
February 06, 2026 at 2:18 PM
We determined that the issue originated from a recent deployment of the platform frontend code.
Investigating
February 06, 2026 at 1:28 PM
Investigating
February 06, 2026 at 1:28 PM
We are currently investigating this incident.

Jan 2026

Postmortem
January 19, 2026 at 8:23 AM
Postmortem
January 19, 2026 at 8:23 AM
Postmortem: Claude Code Hanging in Sandbox
Date: 2026-01-19
Severity: P0
Duration: ~4 hours
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
SUMMARY
Production agent runs were failing silently. Claude Code started but never produced output, timing out after 15+ minutes.
Root Cause: stdin was configured as "pipe" but never closed, causing Claude Code to hang waiting for EOF.
Why CI missed it: CI uses mock-claude which doesn't check stdin state. Real Claude Code does.
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
THE BUG
Before (hangs - stdin pipe never closed):
spawn(cmd, args, { stdio: ["pipe", "pipe", "pipe"] })
After (works - stdin is /dev/null, immediate EOF):
spawn(cmd, args, { stdio: ["ignore", "pipe", "pipe"] })
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
HOW WE FOUND IT
1. SSH into sandbox
2. cat /tmp/vm0-agent-*.log → empty (no Claude output)
3. ps aux | grep claude → process alive, using 23% memory
4. ps -p 510 -o wchan → ep_pol (waiting on I/O)
5. ls -la /proc/510/fd/0 → stdin connected to pipe
6. Manual "claude --print hello" → works (TTY mode)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
WHY NO ROLLBACK
Release included database migration. Forward-fix was safer.
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
INVESTIGATION NOISE
Runner npm publish failure (@vm0/core not built) was unrelated but consumed investigation time.
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
ACTION ITEMS
✅ Fix stdin → "ignore" (#1316)
✅ Add spawn unit tests (#1319)
✅ Fix CI publish jobs (#1306, #1318)
🔲 Add real Claude test in CI (TODO)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
KEY LESSON
Mock ≠ Real: CI must include at least one test with real Claude Code to catch behavior differences like stdin handling.
Resolved
January 19, 2026 at 7:38 AM
Resolved
January 19, 2026 at 7:38 AM
This incident has been resolved.
Update
January 19, 2026 at 7:03 AM
Update
January 19, 2026 at 7:03 AM
We have confirmed that the issue lies in the way VM0 calls the claude code. The minimal fix has been finalized and is currently being redeployed.
Update
January 19, 2026 at 6:21 AM
Update
January 19, 2026 at 6:21 AM
We have now restored normal operation for both the database and task dispatcher, and are currently working on getting the Claude code in the sandbox back online.
Update
January 19, 2026 at 5:50 AM
Update
January 19, 2026 at 5:50 AM
Locating the problem comes from a recent database change, the team is trying to fix the data that caused the problem
Update
January 19, 2026 at 2:50 AM
Update
January 19, 2026 at 2:50 AM
This glitch comes from a recent runner deployment, and the team is trying to fix the issue
Identified
January 19, 2026 at 2:45 AM
Identified
January 19, 2026 at 2:45 AM
We are continuing to work on a fix for this incident.
Investigating
January 19, 2026 at 2:40 AM
Investigating
January 19, 2026 at 2:40 AM
We are currently investigating this incident.

Jan 2026 to Mar 2026

VM0 - Notice history

All systems operational

Notice history

Mar 2026

Feb 2026

Jan 2026