AI Infrastructure

Patching Shouldn’t Kill Production: Dave Bermingham, SIOS Technology | TFiR

0

Most enterprise outages don’t come from hardware failures or cyberattacks. They come from the patch cycle. HA architectures built for disaster recovery leave a critical gap when it comes to planned maintenance — and that gap is where production goes dark.

The Guest: Dave Bermingham, Senior Technical Evangelist at SIOS Technology

Key Takeaways:

  • Most HA failures happen during maintenance windows, not random outages — HA architectures designed for disaster recovery don’t account for planned patching workflows
  • Application-level clustering enables rolling, near-zero downtime updates; hypervisor-level solutions like VMware HA and Hyper-V clustering still require the workload inside the VM to go offline
  • Configuration drift between nodes is a silent killer — servers diverge over time and failovers that worked in the lab behave unexpectedly in production
  • The standby-node-first approach — patch the standby, fail over, patch the original — reduces risk and preserves a fast rollback path
  • A documented, rehearsed patching playbook is the single highest-ROI improvement an IT team can make before the next maintenance window
Read Full Transcript & Technical Deep Dive

How Does JDK 26’s HTTP/3 API Transform Microservices Performance Using UDP | TFiR

Previous article

Project Glasswing Aims to Turn AI From Threat to Shield for Open Source Security

Next article