- Feature Articles
- CodeSOD
- Error'd
- Forums
-
Other Articles
- Random Article
- Other Series
- Alex's Soapbox
- Announcements
- Best of…
- Best of Email
- Best of the Sidebar
- Bring Your Own Code
- Coded Smorgasbord
- Mandatory Fun Day
- Off Topic
- Representative Line
- News Roundup
- Editor's Soapbox
- Software on the Rocks
- Souvenir Potpourri
- Sponsor Post
- Tales from the Interview
- The Daily WTF: Live
- Virtudyne
Admin
This could be a tale from almost any IT workplace in the world.
Admin
I've seen something similar to this, but I'm not sure I get what exactly happened.
Did they use the staging server instead of a production server? And why did that bring something down? (the boss is claiming that some business critical app is not in production because of this)
Admin
Admin
I would probably start using that server for any and all early patch testing, making it go down pretty frequently.
Admin
"I'm sorry we decommissioned that server as part of an upgrade to our development/staging environment. Since it was a staging server and nothing critical was supposed to be on it all data and applications deployed to it were lost"
(CAPTCHA damnum)
Admin
It's the gimp in the IT Helpdesk that bypassed the usual process that should be held accountable for this.
Admin
publicly available beta?
Admin
I never understand someone who immediately tries to defend the indefensible.
Fair enough if they admit; "Oh yeah, I kinda took a bit of a shortcut there to meet a deadline, but I know it was shit... I had meant to sort it out properly later and just haven't got round to it... sorry about that", but the ones who fuck up and then completely refuse to acknowledge they fucked up: fire them immediately.
Admin
So the gist of this is to kill all project managers on sight, yes?
New PM: Hi, I'm the new Project Mana- IT: EXTERMINATE! EXTERMINATE! New PM: Ahhhhh! Ahhhh! (They close in on him)
Admin
Eh? Whats it gotta do with the IT Helpdesk guy?
Someone rang him up, and asked him if he gave them permission to do something. Since it probably wasn't the helpdesk's jurisdiction, why shouldn't the guy there just shrug and say "yeah... I don't have a problem with that mate... knock yourself out."
If some random cunt phoned me up and asked if I was happy for him to do something that was absolutely nothing to do with me, I'd just say "yes" in a slightly confused tone, then hang up the phone. If it turned out that the guy then murdered a high ranking politician, I'm pretty sure I'd be in the clear.
Admin
TRWTF here is production clients accessing a staging server. Few things can cause more badness than staging machines communicating with the outside world.
Admin
This is why we can't have nice things. This is also the reason that staging servers get locked down as tightly as production. This is is also the reason that getting an application through the process has more layers than an onion. When people don't follow the rules, more rules typically get added as a knee jerk response to the "how did this happen". It appears the company is about midway in the life cycle.
Stage one of the life cycle it is the wild west and anything goes anywhere. No sign off, no problem. As the company matures they realize that they need things like change management, source control and other industry standards.
Standards begin small in the middle stage. Things can get done at this stage but the users who had grown to appreciate the speed at which things were done in the beginning stage lose faith in the system and begin to circumvent it. This triggers the final stage.
Final stage and nothing can be touched without three signatures. Duties are separated to prevent any one area from having control or authority. Red tape is the final product. At this stage progress grinds to a near standstill.
Bonus stage. Rise of the contractors. Since nothing can be done in the previous stage contract work begins. Work is farmed out to a company in stage 1 or stage 2. The original host company stays alive until they acquire their contract companies.
Admin
Hah, no, only the few with staging servers.
Admin
Victor was upgrading the staging servers. In doing so, he took down a production application that was running on one of the staging servers, not a production server. He didn't know this application was running on the staging server (indeed, it clearly shouldn't have been) because the owner of the application had made an end-run around the production deployment process by going through the IT help desk and nobody there even knew they had to tell anybody about it.
The most gratifying (for Victor and his boss) response is to simply say that (shrug) it's a staging server, and random applications installed there without our knowledge may be taken down at any time. However, it's not a very team-friendly attitude, and people who live in the real world prefer to merely imagine answering like that.
One interesting secondary WTF is that the IT help desk has install rights on a staging server.
Oh, and Victor's group needs an effective way to punish the PM's group for the trouble, because otherwise it will happen again, and again, and again.
Admin
I have a staging area, it's called my development machine :-)
Admin
Several WTFs but the main one, to my mind, is Victor not being notified (even unofficially) that there is a prod app on Staging.
Perhaps TRWTF is the PM not realising that doing the above will result in the prod app not being on a stable server.
Or perhaps TRWTF is the PM having responsibility for this sort of process without understanding the fact that this is a problem in the first place. Or maybe he's had a fight with someone who said "No, you can't do that because blah wibble incomprehensible engineery noise so far over your head you can't even hear it whistle" and won.
At the very least, the timescale of the project being that critical, the fact the deployment needs to be on Staging would have raised enough red flags that notification of this egregious state of affairs would have filtered through to Victor. "Oh boy, this is Staging - how often is that taken down? Every day at 7 am? Sheesh ..."
Admin
Fire them? Oh no, no, no. THOSE are the go-getters. The ones who think outside the box and do whatever it takes to accomplish the mission. They're management material.
Admin
Clearly the project manager made sure every user of TTP knew who to blame if there was a problem; and yet no one leaked what was going on to Victor's team.
And they tell us large conspiracies can't really happen...
Admin
+1
You told'em, dude!
Admin
Admin
...The guy in charge of promotions at my new job is named Vic... and let's just say none of that sounds impossible with our setup...
Admin
Big wtf, but only a small fallout.
We had a group of people from the business side that also knew a little bit about our code base. You could say they blended the line between IT and Business (though they firmly resided in the Business side with all the power that comes with that).
Well they developed a fun little application in Python that would run through all the data from sensors scattered throughout the US. This data was then stored historically to later be trended on.
They needed about 4 terabytes of space for their database, but none of theirs were provisioned for that much. So they poked around until they found a large unsuspecting database, and they used that.
Needless to say that was one of our testing grounds used by just about every group under the sun. One day one of those groups noticed the database was getting pretty big, and they requested all the data be truncated so they could start a new project using that space.
Well emails go out, and everyone signs off. Problem is the original group wasn't on the email chain, because no one knew they were using the database.
Well a couple of truncates later 7 months of production sensor data is gone that can never be recovered (This server only managed to get so much space provisioned for a test stack because it had no backups or redundancies)
Admin
I'll take devil's advocate here.
This business critical app is probably generating actual revenue. R&D was down to the wire and had to get creative. They bypassed the bureaucracy and procedures and got their work done in record time.
The loose ends were tied up by IT after the fact.
Sure it's not ideal. Ideally you'd know about every hiccup months in advance with time to plan for them and get all the necessary approvals. In the real world sometimes you have to get things done creatively.
The real WTF is the confrontational attitude of the PM. He got it done but he must have known it was bad. I hope his undiplomatic tone is simply embellishment to make a better story.
Admin
This is how "The Process" gets created...
The Process be praised! All glory to the Process!
Admin
That was exactly my thought. Aww, looks like the STAGING server had to go down again. Gee, what a shame.
Admin
Seems to me the real WTF is TTP's PM promising a go-live of April 1st, and not "x nmber of days after Development and UAT are finished."
When there's a dependency, set expectations based on that dependency.
Admin
In our client location, we cannot send anything to staging. there are 3 servers named Monika, Phibi and Rachel.
Monika is development. Phibi is test (UAT) Rachel is production.
I can only put stuff on Monika.
Admin
Nagesh, your naming standard is bad and you should feel bad. Unless Monika makes great cake, Phibi is stupid and Rachel has great tits. Then you're spot on.
Admin
what part of client location is not understanding by you? i am not naming the machines. some person with addictive tv disorders is doing that.
Admin
R&D complaining about lost revenue?
I thought the whole purpose of R&D was to burn through revenue generated by the business units and replace it with government grant money to help avoid excessive taxes through creative accounting.
Admin
So someone at your company has the entire DVD box set of the Russian Friends knockoff "Comrades"
Admin
The WTF is that there no such thing as development servers, testing servers or staging servers.
They are all PRODUCTION servers - for different groups.
Servers being used by developers are production servers - for the developers.
Servers being used for testing are production servers - for developers (and may QA).
Servers being used or staging are production servers - for QA.
If not, then a lot of people are not being productive and should be fired.
Admin
Admin
Admin
Victor's boss is nearly too good to be real...
Admin
Admin
"TTP" stands for "The TTP Project".
Admin
There's nothing wrong with bypassing the process to achieve a critical business goal but there's a right way and a wrong way to do so and they chose the wrong way.
The right way is to push the need to get the application into production ASAP up the chain of command to where someone in Victor's chain of command can decree "this has to go live on April 1st - you are authorised to cut whatever corners you need to from your usual process to achieve this but you should treat this as your top priority". If the PM responsible wasn't able to secure this, it can't have been as business critical as he thought he was.
Frankly for subverting the process the way he did, the PM should have been fired. And there are plenty of industries where doing it that way could drop the company into a heap of regulatory doo-doo.
Admin
I knew a SysAdmin who named his servers after supermodels. That resulted in some interesting comments like "Heidi Klum went down on me last night."
Admin
This. You'll inevitably get production tickets against dev versions, people doing production work that gets saved in (ephemeral) dev databases... There's just no way to stay sane in that scenario.
Admin
"I am not beholden to your empty promises. BTW, the staging server is going to be taken down for 3 days for maintenance."
Gotta treat people who think you worse than a commodity with the respect they deserve. It's the only way they'll learn.
Admin
TRWTF is staging servers not getting wiped regularly.
"You deployed your app on a staging server? Yeah, well, tough luck, they get wiped every morning."
Also rebooting of staging servers should be automated and just a click away for any developer, no need to call IT.
"Your apps keep getting rebooted? On a staging server? Go figure."
And what is that preposterous idea of anyone "objecting" before patching/upgrading the staging servers? If stuff breaks, it breaks, that's what they are for.
Captcha: persto. Got problems with your staging server? rm -rf /. Presto.
Admin
The only 'Business Need' that would be served by subverting the process was that the PM would not have to explain to R&D why TTP did not go live on April 1st as promised, (and most likely would get his bonus for finishing on schedule).
There is no fool like an April Fool.
Admin
Admin
Staying sane is not usually one of the requirements that makes the final cut.
Admin
The satisfaction you glean from this will last nearly as long as it takes to clean out your desk.
Admin
Yet another TDWTF about someone not filing the proper paperwork. There's no useful information here, nothing to learn from, just more geeks laughing at management mistakes. I'm getting tired of this. This is not a real WTF.
Unless the WTF is Victor's boss backing him without question.
Admin
Me too...
Admin
Admin