Discovercloud2030HA Troubleshooting [Tech Ops]
HA Troubleshooting [Tech Ops]

HA Troubleshooting [Tech Ops]

Update: 2025-04-25
Share

Description

This episode of the TechOps series goes into high availability troubleshooting. Not just high availability, not just troubleshooting, but actually talking through what it takes to manage and maintain and fix HA systems. This is part of a longer discussion we've been having and so there's some really interesting ideas in the middle of these discussions that I hope will shape your thinking as you build high availability systems, diagnostics and troubleshooting for people who are in high availability very complex environments.

Transcript: https://otter.ai/u/wM__4w1YIzZnhVdgLuXLsDDu0Ng?utm_source=copy_url

References:
https://status.openai.com/incidents/ctrsv3lwd797\
Comments 
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

HA Troubleshooting [Tech Ops]

HA Troubleshooting [Tech Ops]

the2030.cloud Podcast