Testing Done Right

Coyne · 2011-03-22

I agree testing designed to reach some level of quality that will never be 100%.

Where I often see a lack in software is what I would call recoverability: The ability to correct a problem after it has occurred.

I'm generally a cautious individual. Experience has demonstrated that I should always have a fallback plan, which is really a chain of defenses thing.

If the application has no recoverability, then there is no fallback: The only thing between "everything is good" and "absolute disaster" is a fence called "everything works perfectly". When designing applications, one of the things that should always be done is to consider, "What if this doesn't work? What would be our fallback plan?"

Because, if you don't think about that and plan for that, then one day something doesn't work perfectly and you find yourself in absolute disaster land because you have no other line of defense. That is actually the source of some really good stories (in here as well as in other places). I'll relate one:

(At a previous life.) We bought a 3rd-party product for donation management. The builders of that product had a really interesting way of handling errors: They ignored all errors.
One of the processes was the daily account apply. You entered incoming donations in a batch; the apply process would then read the batch, update the accounts and delete the batch.

On the disaster day in question, the accounts table reached size limit part way (early on) through the processing of the batch and, since the developers ignored such mundane messages from the database as "the table is full and I can't insert this now", the process blythely continued on.

Then it deleted the batch.

No fallback. No way to recover the batch and so an entire day of entry by the user was lost.

Okay, so now let's create a fallback. That's hard, right? No, in this case actually it isn't: The solution is to back up the entire database before running the apply process. Every single time a batch is to be applied! That way, if something goes wrong, you fix the problem, restore, rerun and everything is cool.

...and usually, fallback is just like that. It mostly consists of one single element that I often see omitted: Keep the input state so that rerun is possible. There are "bazillions" of ways to do that; take your own pick.

But some people like to live on the edge and depend on the application doing everything right, and when it doesn't, well, glad I'm not them.

Matt Westwood · 2011-03-23

Alex Papadimoulis:
Mr Frost:
I don't understand how testing would have had any impact in identifying a bug that was created post-deployment (the fridge was scratched in transit after it had been built).

It's a stretched analogy to begin with, but the idea here is more that the supply chain has a defect. In theory, LG could have invested more to fix this defect, but they instead accepted the loss of profit.

They might have done both. "Gahd dammit, Wolverine, that's your last screw-up! Go and get a job as a barber or something! Hmm ... reckon someone would buy this thing cheap? It's only scratched ..."

2011-03-23

Interesting read... You should really check ISTQB or ASTQB

http://www.istqb.org (or) http://www.astqb.org/

Particulary the foundation glossary. It has some of the concepts you mention here completely chewed out for you + tons more. (Especially the test levels and the risk calculation).

Regards, Niels (ISTQB CTAL TM)

HonoredMule · 2011-03-23

I'd just like to point out: it is still theoretically possible to get hit by a bus even if you never leave your house.

99.999%...

2011-03-24

Or, rather recently, from around where I live:

[image]

BTW: This is not funny. Two people died in this accident.

Testing Done Right

Types of Testing

An Inherent Risk

The Weakest Link

Defects Are Not Necessarily Problems

Quality or Quantity

Testing Done Right

Featured Comments