If a test is judged necessary, what should be the criteria for success or failure?

Should the new multipack carrier be tested?

If a test is judged necessary, what should be the criteria for success or failure?

How useful is the proposed test in addressing the management problem? What changes, if any, would you recommend?