Previous incidents
July 2026
June 2026
May 2026
May 19, 2026
1 incident
CPU functions, GPU functions, and 5 other services are down
Downtime
Resolved May 19, 2026 at 2:04pm UTC
All services are back.
6 previous updates
May 07, 2026
3 incidents
Internal data stores are failing
Downtime
Resolved May 8, 2026 at 2:30am UTC
All data stores are recovered
4 previous updates
Container Scheduling and RPC errors
Downtime
Resolved May 8, 2026 at 12:03am UTC
We've recovered
1 previous update
H200 Scheduling is degraded
Degraded
Resolved May 7, 2026 at 10:32pm UTC
We are still actively working on securing more H200 capacity. We recommend you use RTX6000s or H100s in the meantime. Thank you for your patience!
1 previous update