mutation-based testing
DESCRIPTION
Mutation-Based Testing. Kyle Mundt February 3, 2010. What is Mutation-Based Testing?. Richard Lipton, 1971 A way of testing your tests Alter your code in various ways Check to see if tests fail on altered code Becoming popular again due to increased computing power - PowerPoint PPT PresentationTRANSCRIPT
Mutation-Based Testing
Kyle MundtFebruary 3, 2010
Richard Lipton, 1971 A way of testing your tests Alter your code in various ways Check to see if tests fail on altered code Becoming popular again due to increased
computing power Strongest use seems to be with Fortran and
C (procedural languages)
What is Mutation-Based Testing?
Mutant: code resulting from applying a mutation operator
Mutation (or Fault) Operator: rule used to create a mutant
Killing a Mutant: when tests fail on a mutant Mutation-Adequate: tests kill all mutants Equivalent mutant: produces same output
as original (can’t be killed) Mutation score: killed / (total – equivalent)
Terms
Original
if (a == 10)b = 3;
elseb = 5;
Example
Mutant
if (a != 10) b = 3;
else b = 5;
Apply mutation operators to code (only one change in each mutant)
Run mutants through test cases Check mutants that do not fail
◦ Equivalent mutants Change test cases to catch mutant Repeat
The Process
Don’t begin too early Code should already be written Tests should be reasonably thorough Begin with good set of test cases and code
that passes tests Iterative process
◦ Create mutants, run tests, fix, repeat
When to Perform?
Traditional Mutation Operators◦ Statement deletion◦ Invert logical operators◦ Replace arithmetic operators◦ Replace variable with another in same scope
Class-Level Mutation Operators◦ Object oriented◦ Change containers◦ Concurrency
Examples of Mutation Operators
If test results are the same◦ There is “dead code”◦ Mutant is equivalent to original◦ Test cases not complete
Code coverage not good enough Functional testing not enough Need way to analyze test effectiveness Useful for hardware verification Based on coupling effect: “ test data set that
detects all simple faults in a program is so sensitive that it also detects more complex faults”
Ties into automatic test generation
Why is it Important?
Hitting every line doesn’t guarantee good tests
Doesn’t ensure defects detected if they occur
Not good measure of verification effectiveness
Code Coverage
Testing functionality of program Deals with how program should work Mostly ignores how it shouldn’t Functional tests are subjective Also bad measure of verification quality
Functional Testing
Looking at, improving tests Convinces us tests will detect defects if they
occur Similar to techniques done manually on
hardware Need way to automate it to be practical
Analyzing Tests
Verify testbed Gives data to be used in improving chip
testing Hardware defects can cause unexpected
behavior Must ensure this behavior is caught Attempt to show defects found by tests
Hardware verification
Adequate automation Huge (maybe infinite) possible number of
mutants Fault classification and prioritization Which operators to use How to apply to OOP, concurrency, etc.
Problems
Mothra – 22 operators 6 operators account for 40-60% of mutants Redundant mutants (can be killed by same
test) Research shows 5 operators gives good
data◦ Gives 99% total mutant score and reduces
number of mutants by 77%
Selective Mutation
ABS – make value of each arithmetic expression be 0, positive, and negative
AOR – replace arithmetic operator with valid operators
LCR – replace each logical operator ROR – replace relational operators UOI – insert unary operators in front of
expressions Note this is used on Fortran, a procedural
language
Five Operators
Running whole program takes time Don’t really care about what happens after mutant
line hit So, don’t execute whole mutant Compare states of original and mutant after
mutated line executed Regular mutation requires three things
◦ Mutant be reached◦ Mutant creates incorrect state in program◦ Program state reflected as difference in test output
Weak Mutation only requires the first two Related to automatic test generation
Weak Mutation
Put all mutants into “metaprogram” One large program to compile and link Mothra turns program into intermediate
form Modifies intermediate form to create
mutants Interprets intermediate forms (interpreting
is slow) Schema-based is much faster than
interpretive systems
Schema-Based Mutation
Can reduce number of test cases executed◦ Once a test case is killed, remove it from further
testing◦ No reason to kill it twice
Random selection of mutants appropriate in some cases
Other Optimizations
Easy to come up with operators for simple types
User types much harder Operators have been developed for things
like◦ Inheritance◦ Polymorphism◦ Method visibility
Harder to mutate based on meaning of object
Some research applied to Java OO mutation
OO Mutation
Methods proposed to alter at code level◦ Allows changes to things like attributes
Mutants made this way may not compile Could cause integration errors Still doesn’t look at meanings Can write operators specifically for classes Probably not practical in almost all
situations How to actually change object state?
Changing Object State
Mutate commonly used Java libraries Again operators picked to try and simulate
common mistakes One solution found was to mutate
Containers, Iterators, and InputStream Java reflection can also be used to examine
object fields at runtime
One Approach
Collection Interface (most apply to List or Vector as well)◦ Make collection empty◦ Remove some element (first, last, random)◦ Reorder elements◦ Mutate element type
Iterators:◦ Skip element
InputStream◦ Skip bytes of data
Some Suggested Operators
First described for C programs Designed for integration testing, mutate
interface between modules Designed to scale to larger systems Actions taken to control number of mutants
◦ Only look at integration errors◦ Tests only connections between pairs of
subsystems◦ Mutates only module interfaces (eg. Function
calls, return values)
Interface Mutation
MuClipse◦ Open source mutation testing plug-in for
Eclipse µJava (muJava) Heckle (Ruby) Insure++ (C++) Nester (C#)
Implementations
C# programs for Visual Studios 2005 Only supports NUnit Framework Highlights code
◦ Killed mutations in green◦ Surviving mutations in red◦ Code not covered by tests in blue
Allows use of XML-based grammar to define own transformation rules
Nester
Different approach Used to find bugs in source code Creates functionally equivalent mutants
◦ Lines changed without changing expected results Tests should all still pass If a test fails, something wrong with code Insure++ reports faults and lines
responsible
Insure++
More complicated programs equals greater need for quality tests
Need way to ensure/measure test effectiveness
Mutation-Based testing can fill this role Still needs further research Not a lot of implementations or examples of
use on large scale Getting attention in hardware verification More work needs to be done to apply to OO
Conclusion
[1] Offutt, J. A. & Untch, R. H. (October 2000). Mutation 2000: Uniting the Orthogonal. Retrieved January 18, 2011 from http://cs.gmu.edu/~offutt/rsrch/papers/mut00.pdf
[2] Bakewell, G. (2010). Mutation-Based Testing Technologies Close the “Quality Gap” in Functional Verification for Complex Chip Designs. Retrieved January 18, 2011, from http://electronicdesign.com/article/eda/Mutation-Based-Testing-Technologies-Close-the-Quality-Gap-in-Functional-Verification-for-Complex-Chip-Designs/4.aspx
[3] Offut, J. A. (June 1995). A Practical System for Mutation Testing: Help for the Common Programmer . Retrieved January 18, 2011 from http://cs.gmu.edu/~offutt/rsrch/papers/practical.pdf
[4] Usaola, M. & Mateo, P. (2010). Mutation Testing Cost Reduction Techniques: A Survey. IEEE Software, 27(3), 80-86. Retrieved January 23, 2011, from ABI/INFORM Global. (Document ID: 2012103051).
[5] Kolawa, Adam (1999). Mutation Testing: A New Approach to Automatic Error-Detection. Retrieved January 18, 2011, from http://www.parasoft.com/jsp/products/article.jsp?articleId=291
[6] Alexander, R. T.; Bieman, J. M.; Ghosh, S.; & Ji, B (2002). Mutation of Java Objects. Retrieved January 18, 2011 from http://www.cs.colostate.edu/~bieman/Pubs/AlexanderBiemanGhoshJiISSRE02.pdf
[7] Nester - Free Software that Helps to do Effective Unit Testing in C#. http://nester.sourceforge.net/
Sources