Designing and comparing automated test oracles for GUI-based software applications
Title | Designing and comparing automated test oracles for GUI-based software applications |
Publication Type | Journal Articles |
Year of Publication | 2007 |
Authors | Xie Q, Memon AM |
Journal | ACM Trans. Softw. Eng. Methodol. |
Volume | 16 |
Issue | 1 |
Date Published | 2007/02// |
ISBN Number | 1049-331X |
Keywords | Graphical user interfaces, GUI state, GUI testing, test oracles, User interfaces, widgets |
Abstract | Test designers widely believe that the overall effectiveness and cost of software testing depends largely on the type and number of test cases executed on the software. This article shows that the test oracle, a mechanism that determines whether a software is executed correctly for a test case, also significantly impacts the fault detection effectiveness and cost of a test case. Graphical user interfaces (GUIs), which have become ubiquitous for interacting with today's software, have created new challenges for test oracle development. Test designers manually “assert” the expected values of specific properties of certain GUI widgets in each test case; during test execution, these assertions are used as test oracles to determine whether the GUI executed correctly. Since a test case for a GUI is a sequence of events, a test designer must decide: (1) what to assert; and (2) how frequently to check an assertion, for example, after each event in the test case or after the entire test case has completed execution. Variations of these two factors significantly impact the fault-detection ability and cost of a GUI test case. A technique to declaratively specify different types of automated GUI test oracles is described. Six instances of test oracles are developed and compared in an experiment on four software systems. The results show that test oracles do affect the fault detection ability of test cases in different and interesting ways: (1) Test cases significantly lose their fault detection ability when using “weak” test oracles; (2) in many cases, invoking a “thorough” oracle at the end of test case execution yields the best cost-benefit ratio; (3) certain test cases detect faults only if the oracle is invoked during a small “window of opportunity” during test execution; and (4) using thorough and frequently-executing test oracles can compensate for not having long test cases. |
URL | http://doi.acm.org/10.1145/1189748.1189752 |
DOI | 10.1145/1189748.1189752 |