- Feature Articles
- CodeSOD
-
Error'd
- Most Recent Articles
- Secret Horror
- Not Impossible
- Monkeys
- Killing Time
- Hypersensitive
- Infallabella
- Doubled Daniel
- It Figures
- Forums
-
Other Articles
- Random Article
- Other Series
- Alex's Soapbox
- Announcements
- Best of…
- Best of Email
- Best of the Sidebar
- Bring Your Own Code
- Coded Smorgasbord
- Mandatory Fun Day
- Off Topic
- Representative Line
- News Roundup
- Editor's Soapbox
- Software on the Rocks
- Souvenir Potpourri
- Sponsor Post
- Tales from the Interview
- The Daily WTF: Live
- Virtudyne
Admin
Optimization is the root of all evil. Jeff is going to hell.
Admin
I think there was a spilling mistak. It should have read read the frist line...
Admin
This developer is clearly paid by line.
Admin
But look at the VALUE that was created by leveraging the existing infrastructure (disk drive and network) in such a manner...
Admin
I've seen this before.. just with database table rows instead of files. I got that monstrosity down to 45 seconds from 45 minutes..
Admin
Even a minute runtime seems pretty large. Reading 30k lines and sending it across should take a second.
If it has to take 30k round-trips, then I see another place for an optimization.
Admin
Huzzah! Remy's back!
We missed you and your sneaky HTML comments!
well, i did at any rate.
Captcha1: tristique
Admin
Oh boy, there are so many WTFs in the first paragraph, that I didn't have to read the rest of submission.
BTW, it's "ukrainian", not "ukranian" (those who read HTML comments will understand)
Admin
I guess this is an interesting and efficient way to do things.
Admin
I guess this is an interesting and efficient way to do things. Like writing comments, for example.
Admin
I guess this is an interesting and efficient way to do things. Like writing comments, for example. Captcha: luctus
Admin
I guess this is an interesting and efficient way to do things. Like writing comments, for example. Captcha: luctus TRWTF is not writing this over multiple articles.
Admin
Yeah, but that extra i costs another 50 quid and puts it over budget.
Admin
Pemy Rorter
Admin
The suspicion is that the original mistake was to put the "open" command within the loop by accident. On finding that this caused there to be a large number of unclosed files (breaking the maximum allowed), he said, d'oh! I forgot to close it, and added the "close" to the loop.
Then it was: "But it's reading the first line over and over again!" Ah yes, so let's fix that by deleting that pesky first line after I've processed it, so as to get to the second line.
Oh but hang on, that's no good, I might want to keep the original file. Okay, let's, um, how do we do this? Er um (etc. etc.)
... and all for the want of being unable to identify what his first stupid mistake was.
Admin
No, I believe it's Ukrainian. Unless they are species, like dogs, humans or elves.
Admin
Admin
This is just defensive programming. should the process fail it will resume where it left of instead of having to process the whole file again ;-)
Admin
Indeed. VB6.
Admin
After some iterations it was down from 20h to less than a minute...
Admin
Admin
I recently worked with a Ukrainian offshore company that suddenly went silent, right around the time the Russians (we all know it was you) invaded. Haven't heard from them since. Shame really as I was getting a pretty good collection of WTFs ready to send in.
Admin
The classic O(n^2) algorithm for reading a file line-by-line... still doesn't beat the time I found it, in some similarly "enterprise" code, nested inside a crude bubblesort to make the whole thing O(n^4).
Admin
So what you're saying is, the Russian invasion had some good points to it after all?
Admin
heh
Admin
I guess there really are people who do things like this.
Back in the dark ages, I improved a numerical search by an average factor of 8000 or so -- turned a linear search through an unordered table into a binary search through a sorted one. Was never subsequently able to celebrate with colleagues who chipped a percent or two off running times . . .
Admin
These are the types of DailyWTFs I really enjoy. It's like a good horror movie - all the fun is in the anticipation. I read the two routine names - ProcessFile and DeleteLine - and my stomach lurched.
The satisfaction in reading the code and confirming the train wreck of an implementation is just icing on the cake.
Admin
And he got a mumbled "thanks" for his efforts in saving the company hundreds of hours over the course of a year.
Admin
Actually it's український. Arguing over a transliteration is pointless unless it fails to approximate the original pronunciation.
Admin
Something similar happened at my last job. We had an ad-hoc piece of software thrown together to parse a bunch of flatfiles, load them into a database, and create a specialized report. It worked fine originally, but didn't scale well; some of our larger clients (huge TV networks with 3 letters in their names) were just generating too much data, and the report could take upwards of 10 hours to run.
Profiling showed that the bulk of the time was being spent uploading the data to the database. (Apparently sending a zillion individual INSERT requests over the wire, while holding a transaction open all that time, while the database was also managing regular business processes, can take a long time!) So I recoded it to use SQL Server's Bulk Insert capabilities and sent the new tool off to the Product Management person who was our contact with this client.
About 45 minutes later I got a panicked call from the Product person. "I just checked on the report tool, and it's just sitting there. It doesn't look like it's doing anything at all!"
So I logged in to the test database to check, and everything looked fine to me. When I told the PM person that it looked like the report had run successfully, she was shocked. "You mean it's done already?!?"
Admin
That was pre-russian invasion. Now it has to be just укранський
Admin
I think the 30k number was an arbitrary amount given for the sake of example. It could easily have been hundreds of thousands and into the millions of lines, I imagine.
Admin
In the spirit of throwing good money after bad, this is all negligible on an SSD drive. Let me order a 500GB SSD for my workstation to test and, then once confirmed, we can get another for the server.
There are no software problem too big for hardware to solve!
Admin
Plus one. Sure, it's slow, and sure it does a lot of operations (hint -- isn't that what computers are for?) AND if you power off the computer in the middle of the job, it will just pick up where it left off.
Who's to say that this is a WTF?
Without knowing the actual requirements, who knows. For all we know, perhaps the requirements are that "it's bullet proof as far as crash/restart; and that performance is irrelevant."
Sure, there are other ways to do it -- for example, put each line in a DB, and mark the record after it's been processed; and when the job restarts, only process records that are not marked.
But suppose the task was assigned like this: As quickly as possible, write a processing loop that is guaranteed to send a line from the file to the processor exactly one time. Performance is not an issue, but I've allocated less than one hour of time for this task.
How else would you do it? Not everything needs to be "enterprise quality"; and not every task has the luxury of a week of design/programming time.
Admin
RinkWorks has a similar example
Admin
Every time it's some damn relative fkn it all up...
Admin
Perhaps this tool would be good as an SSD write cycle stress tester.
Admin
Bragging rights ~= improvement x (quality of original programmer).
Admin
This developer is clearly paid by the I/O operation
FIFY.
Admin
FTFBOY
Admin
This looks like VBA, which is abysmally slow. A minute runtime for a 10000 line file is actually pretty good in VBA if you can believe that.
Admin
Admin
Every tutorial I've ever seen about reading files emphasises "CLose the fucking thing", and I can imagine a lot of devs who read an example to read a line (open, read, close) and then whack it in a loop to make it do it multiple times....Unfortunately, a lot of dev's out there seem to think it's all about knowing the syntax of a language, not understanding how things work.
NB: I think modern porgramming languages abstracting a lot of the actual work away haven't helped any. I have nothing against these languages per se' (in fact I agree they can help whip up relatively complex stuff pretty quickly), but I think wanna be dev's (ie students, I suppose) should be forced to use low-level languages (preferably on low-spec machines) at least some of the time to get a bit of an understanding of what's going on and the impact that stupidity (and naiveity, and laziness, and....) can have on performance.
Admin
@DrPepper: someone failed their concurrency class :)
The implemented solution doesn't actually solve that problem. Say it crashed inside ProcessFile(), how would you know if it had already sent the line or not?
There's a reason why multi-phase commit protocols have been invented (and even those don't guarantee it, since a perfect solution is probably impossible).
Admin
Admin
This is BRILLIANT!
He built an Service Buss's queue functionality on top of a text file!
I've never been this drunk.
Admin
At first I wondered why the explanation of the WTF was neccessary. After all, we don't get explanations for Java, C# or PHP WTFs. Then I realized that the people who come here probably can't even tell the difference between VB6 and VBscript.
Admin
Been there, done that. Colleague shows me a slow performing program that had a linear search in a largish, already correctly sorted, array for each input record read.
Me: well, duh, use a binary search. Colleague: Use a what? Me: (sigh) Here, give me that keyboard.
Admin
Or simply record where you are up to in a seperate file...
Admin