Designed For Reliability

Grovesy · 2008-07-22

sewiv:
Not quite clear on the WTF here. He flipped the wrong switch, the system went down. He kept his job? Is that the WTF?

No the WTF is that a system purchased because of it's uptime ability had been abused, so much so that the one time it did go down (accidently) it couldn't come back up again.

If the other engineers had fixed up the boot scripts and carefully checked and tested their changes the mainframe should have just come straight back up again.

...

A friend of mine used to manage Tandem machines; he's told me the story several times about his company purchased another company. Some ten years later it came around to shutting down the data centre of the company that had been brought out. They found a Tandem mainframe stuck in an old disused room, they had no idea what it did why it was there and anyone who would have known during the merger had long since gone... It had become part of the furniture. So someone had the bright idea of just shutting it down...

Several hours later the Port of Dover was backed up as a small, but very vital part of the Custom & Exercises system had mysteriously gone missing, all Hell had broken loose at his company as they had been fingered as the people who now ran the service… though no one seemed to be aware that they did, let alone where the machine was.

That Tandem has been sitting there for some ten years crunching data and spitting results back out again…

I want to file it under ‘urban legend’, but I always quite liked the story.

and it's ability to shove things in

Addendum (2008-07-22 10:47): **and it's ability to shove things in --

Ooops I was writing an email while writing this.. must have got the wrong window at some point..

2008-07-22

Back in the 90's, there was a bomb in the world trade centre. Apparently there were a whole bunch of Tandem machines in the basement -- and they all were blown over by the force of the blast. But they kept on working. They are very solid machines indeed.

By the way, I'm 'Chris B' :) ... and yes, I felt very foolish when I flipped the wrong switch. I'd only been at the company for a month or so!

2008-07-22

Didn't TFA mention him already being the redundant-redundant-redundant person?

The guru had his #1 (redundant) and #2 (redundant's redundancy) people both unavailable, so it fell on the redundant-redundancy's redundant machine maintainer to assure redundancy of the redundant system was restored and redundantly available?

Ouch.

2008-07-22

FredSaw:
That assuaged Chris a bit
He would probably have preferred that it assuage his fears. </pedantry>

Let's clean up the language. It should be "buttuaged."

2008-07-22

I work in investment industry and I can assure you that Tandem is still "all the rage". These things power the financial markets, they're at the heart of every major stock and commodity exchange in the world.

I'm sure there are a few Microsoft customers happy to read this story though - "see? good thing we have to reboot every week - otherwise we'd never get to test our startup scripts"

Designed For Reliability

CPU Failure

Holdup at the ATM

Featured Comments