Reliability Stress Testing Costs Are Worth it Compared to the Cost of Failure

The design is finished, the manufacturing transition isaddressed.
complete, pre-production units have successfullyTest Limits
passed acceptance testing, the product is going outIf the equipment is being manufactured to a customer's
the door, and the design team is all ready for the nextspecification, then those specifications become the
project, right?minimum levels of stress. If the application is critical,
Not quite.then testing above these limits by 10-15 percent is
In this age of shrinking product and component size,recommended. If the equipment is being built to your
faster speeds and immense complexities, electronicown product specifications, then those specifications
products are pushing the envelope of reliablebecome the test limits. In more complex systems,
performance due to, among other things, self-heatingdifferent circuits have different failure rates. These can
effects and nanofabrication tolerances. In severebe estimated by using piece-part reliability ratings from
operating environments, reliable performance marginssuch references as MIL-HDBK-217. Those areas with
can easily be exceeded, resulting in catastrophicthe highest failure rates should be pinpointed for
system failures. Therefore, it is important that productsmaximum stress.
are tested over and above anticipated ambientTime & Cost
conditions to ensure specified performance in real-lifeThis can vary widely, depending on the extent of
environments.testing. Once the test limits have been defined, the
Unfortunately, extra reliability testing, because it occursprogram has to be implemented through hardware,
towards the end of a project when little or no fundssoftware or a combination of both. A typical stress
remain to perform adequate stress testing, sometimestest team would consist of a reliability engineer, a
gets orphaned. But ignoring or under-funding reliabilityhardware design engineer and a computer
stress testing can end up being one of the largest costprogrammer, if required. Once the team has defined
factors in a product's life cycle.the scope of the stress testing, then the time and
How Much Testing Is Enough?material estimates will determine the cost of the
So the question becomes, how much testing is neededprogram.
to confidently project long-term reliability? The scopeA note of caution: The design engineer who conceived
of reliability stress testing varies widely from product tothe circuit or system to be tested is NOT the engineer
product, depending mostly on the complexity andwho should design the test program. All too often, the
criticality of the system. In mission critical applications,system designer does not want his or her system to
such as tactical military situations or life supportfail, so the tendency is to design a test that the system
equipment requirements, extensive stress testingwill pass. "Design-to-pass" is poor criteria for devising a
should be performed to ensure maximum reliabilityreliability test and will undoubtedly end up giving the
over the life of the product. Less testing is adequateuser a false sense of security. It is much better to
for less critical applications.follow a "design-to-fail" principle. This will end up yielding
Testing microprocessor-based equipment presents itsmore realistic results. Unfortunately, design engineers
own set of challenges. Some believe that it takes afind it difficult to design a test circuit that may perhaps
smart test program to test a smart system. Notturn up deficiencies in their designs, so the test
necessarily. For example, a systems engineer onceprogram should be left to independent, impartial test
proudly described to me the test program he hadengineers.
designed for a sophisticated processor-based system.Capital Equipment
He claimed that he had included a series of neatlyIf the plan calls for in-house stress testing, then the
partitioned sub-routines that checked out all of themanufacturer must purchase or rent the required test
system's functions using test vectors, flashingequipment needed to simulate the stress environment.
indicators, plus a computer printout of the pass/failAgain, the actual equipment will be determined by the
condition of practically every circuit in the system. Herequired environmental limits (e.g., a temperature
challenged me to induce a fault in the system that hischamber for thermal stresses, a vibration table for
eloquent test program could not locate. I disabled theshock and vibration performance, an electromagnetic
main processor clock oscillator, which defeated hisradiation source for EMI/RFI susceptibility). In almost all
entire program. The point is that elaborate testing is notcases, subjecting the unit under test to temperature
required. The most effective tests are usually thevariations is a basic requirement. Other environmental
simplest, straight-forward ones.test functions are based on the anticipated operating
From the Simple to the Complexconditions of the equipment in its final environment.
Stress testing can vary from simply placingOutside Agencies Can Help
components in a cookie tin and subjecting them to aIf buying or leasing capital equipment is not practical,
few hot-cold temperature cycles to fully exercising aenvironmental testing labs may be an option. Many
system in its intended environment over maximumhave fully staffed reliability design engineers willing to
specified limits. No matter how simple or howassist in designing the overall stress test program. Third
complicated the equipment or device is, stress testingparty advisors such as Nerac, whose analysts include
at some level is a basic requirement. The only questionengineers experienced in the field of reliability stress
is how much or how little is required.testing, can assist with assessing and implementing an
A comprehensive stress testing program weighs theefficient and economical stress test program and can
risk factors anticipated at maximum rated operatinghelp find qualified labs to conduct the tests.
conditions and assigns a probability of failure with each.Designing, implementing and conducting efficient stress
The factors with the highest probability of failure shouldtesting can sometimes be as daunting as the design of
be thoroughly tested early in the stress test program.the system it is testing. It is complex, and many
Less stringent testing can be performed in the areasmanufacturers often underestimate or overlook the
of low probability of failure.importance of such a program. This can cost dearly in
How does one go about configuring an effectivefixing latent defects or funding product or system
stress test program? Three questions should berecall expenses down the road.