| |
| ▲ | icedchai 6 days ago | parent | next [-] | | Capacitor problem or not, hardware does fail. Power supplies crap out. SSDs die in strange ways. A failure of a supposedly "redundant" SSD might cause your system to freeze up. | | |
| ▲ | mannyv 5 days ago | parent [-] | | One thing that we ran into back in the day was EEC failure on reboot. We had a few Dell servers that ran great for a year or two. We rebooted one for some reason or another and it refused to POST due to an EEC failure. Hauled down to the colo at 3AM and ripped the fucking ram out of the box and hoped it would restart. Hardware fails. The RAM was fine for years, but something happened to it. Even Dell had no idea and just shipped us another stick, which we stuck in at the next downtime window. To top it off, we dropped the failing RAM into another box at the office and it worked fine. <shrug>. |
| |
| ▲ | garganzol 6 days ago | parent | prev [-] | | Hardware still fails. It isn't a question of "if", it's a question of "when". Nothing lasts forever, the naivety lasts only so long too. | | |
| ▲ | fragmede 5 days ago | parent [-] | | Obviously. But you get duplicate hardware,
set up HA, get vendor support contracts, use multiple colors in disparit location. Cloud providers have figured this out fairly well, as we all did in the aughts. (Well, some of us anyway.) You can definitely determine that's a bunch of really annoying work and just pay a cloud provider to deal with it, or not, and go your own way. But if you want to be credible when saying that hardware fails, maybe people shouldn't use a problem from three decades ago as their example and use anything more recent? | | |
|
|