- Feature Articles
- CodeSOD
- Error'd
- Forums
-
Other Articles
- Random Article
- Other Series
- Alex's Soapbox
- Announcements
- Best of…
- Best of Email
- Best of the Sidebar
- Bring Your Own Code
- Coded Smorgasbord
- Mandatory Fun Day
- Off Topic
- Representative Line
- News Roundup
- Editor's Soapbox
- Software on the Rocks
- Souvenir Potpourri
- Sponsor Post
- Tales from the Interview
- The Daily WTF: Live
- Virtudyne
Admin
Without knowing much about the environment, but having a background with MSCS, custom and NLB/WLBS clusters it sounds as if there is either:
Either can cause the small loss of packets they are experiencing. Yes the infinite loop is the software developer's fault, but if you over-rate cabling (if copper) your packet failure rate on a well-tuned network should be as low as 1 packet in 1 trillion (I've built such networks).
Also avoid home-made cabling (bad NEXT/FEXT usually) and cheap switches :)
Just a thought... doesn't sound like there's a fix in the software portion of the system but preventing the house of cards from falling might be a solution.
-S
Admin
Driving in, slowly, is the answer.
Perhaps you missed Massimo's comment on the setup being a black box? Other than feeding power, he has no control over the thing?
I'd drive in slow, making sure the system is down for a while every time it happens. SInce it's PPV, that's lost revenue. The only way management will call the vendor to fix their application.
Admin
Admin
Admin
reminds me of a story in a book called "the hacker crackdown": a new version of a program the phone company used had a glitch...when one server went down, it would send a message to other servers that it was stopping for a reboot, then another message when it recovered. BUT a misplaced Return statement made the servers crash the second time they received a "recovered from a crash" message! this meant terribly slow performance for the phones all day-no dropped calls, just very slow to connect. the phone company fixed the problem and issued an apology, BUT the politicians, completely misunderstanding the explanation, decided that Hackers were responsible...partly because it happened on a holiday. (MLK day)