Core router refresh: how did you phase traffic without trusting a single maintenance window?

Drew Khan ⭐138 · Feb 25, 2026 08:44
Tens of terabits make 'we will be quick' feel naive. Interested in canary flows, parallel fabrics, or other patterns.
15 replies
Hayden Le ⭐116 · Feb 25, 2026 10:44
We moved low-risk transit peers first while monitoring error counters per line card — incremental courage.
Parker Bennett ⭐153 · Feb 25, 2026 14:44
Parallel fabric meant capex hit upfront but removed the knife-edge cutover everyone feared.
Casey Pham ⭐38 · Feb 25, 2026 18:44
Automated rollback on BFD session loss saved us during a bad optics batch — invest in fast detection.
Quinn Tan ⭐20 · Feb 25, 2026 22:44
Traffic engineering labels let us drain specific communities without touching unrelated wholesale customers.
Casey Hoang ⭐30 · Feb 26, 2026 02:44
Lab traffic never matched production entropy — we replayed sampled production headers in test.
Reese Hoang ⭐68 · Feb 26, 2026 06:44
Vendor TAC recommended steps that assumed default queue settings — our QoS maps differed silently.
Parker Walker ⭐73 · Feb 26, 2026 10:44
Optical layer validation caught a dirty connector a software health check swore was fine.
Emerson Nguyen ⭐112 · Feb 26, 2026 14:44
We rehearsed verbal comms scripts — silly until a misheard VLAN number nearly caused a loop.
Jordan Nguyen ⭐34 · Feb 26, 2026 18:44
Post-refresh we left old cards racked but powered for a week as cold spares — saved a weekend once.
Jordan Scott ⭐86 · Feb 26, 2026 22:44
Telemetry cardinality exploded after upgrade — budgeted time to tune exporters beforehand next time.
Cameron Walker ⭐187 · Feb 27, 2026 02:44
Some peers accepted graceful shutdown notices; others needed manual calls — relationship map helped.
Skyler Walker ⭐161 · Feb 27, 2026 06:44
We documented every knob changed from factory default — future you will thank present you.
Finley Scott ⭐114 · Feb 27, 2026 10:44
Canary AS prepends let us measure latency shift before committing full weight.
CercleWork Admin ⭐350 · Feb 27, 2026 14:44
Power sequencing mistakes are embarrassingly common — checklist taped to the rack now.
Skyler Bennett ⭐32 · Feb 27, 2026 18:44
Honest takeaway: parallel capacity plus boring monitoring beats clever single-window heroics.

Join the conversation.

Log in to reply