on test design - diva portal › smash › get › diva2:442409 › ...on test design sigrid eldh...

Mälardalen University Press DissertationsNo. 105

ON TEST DESIGN

Sigrid Eldh

2011

School of Innovation, Design and Engineering

ON TEST DESIGN

Sigrid Eldh

Akademisk avhandling

som för avläggande av teknologie doktorsexamen i datavetenskap vidAkademin för innovation, design och teknik kommer att offentligen försvaras

fredagen den 21 oktober 2011, 13.00 i Delta, Högskoleplan 1, Västerås.

Fakultetsopponent: dr Eleine Weyuker, AT&T Labs Research

Akademin för innovation, design och teknik

ON TEST DESIGN

Sigrid Eldh

Akademisk avhandling

som för avläggande av teknologie doktorsexamen i datavetenskap vidAkademin för innovation, design och teknik kommer att offentligen försvaras

fredagen den 21 oktober 2011, 13.00 i Delta, Högskoleplan 1, Västerås.

Fakultetsopponent: dr Eleine Weyuker, AT&T Labs Research

Akademin för innovation, design och teknik

AbstractTesting is the dominating method for quality assurance of industrial software. Despite its importanceand the vast amount of resources invested, there are surprisingly limited efforts spent on testingresearch, and the few industrially applicable results that emerge are rarely adopted by industry. At thesame time, the software industry is in dire need of better support for testing its software within thelimited time available.

Our aim is to provide a better understanding of how test cases are created and applied, and what factorsreally impact the quality of the actual test. The plethora of test design techniques (TDTs) available makesdecisions on how to test a difficult choice. Which techniques should be chosen and where in the softwareshould they be applied? Are there any particular benefits of using a specific TDT? Which techniques areeffective? Which can you automate? What is the most beneficial way to do a systematic test of a system?

This thesis attempts to answer some of these questions by providing a set of guidelines for test design,including concrete suggestions for how to improve testing of industrial software systems, therebycontributing to an improved overall system quality. The guidelines are based on ten studies on theunderstanding and use of TDTs. The studies have been performed in a variety of system domainsand consider several different aspects of software test. For example, we have investigated some ofthe common mistakes in creating test cases that can lead to poor and costly testing. We have alsocompared the effectiveness of different TDTs for different types of systems. One of the key factors forthese comparisons is a profound understanding of faults and their propagation in different systems.Furthermore, we introduce a taxonomy for TDTs based on their effectiveness (fault finding ability),efficiency (fault finding rate), and applicability. Our goal is to provide an improved basis for makingwell-founded decisions regarding software testing, together with a better understanding of the complexprocess of test design and test case writing. Our guidelines are expected to lead to improvements intesting of complex industrial software, as well as to higher product quality and shorter time to market.

ISBN 978-91-7485-037-6ISSN 1651-4238

ii

iii

Om Test Design

Mer än hälften av svensk export utgörs av produkter som styrs av datorsystem. En stor del av kostnaderna för utveckling av dessa system spenderas på att säkerställa att programvaran fungerar tillfredställande. För många produkter rör det sig om miljardbelopp och motsvarar typiskt 30-70% av den totala utvecklings- och underhållskostnaden.

Testning är den dominerande tekniken för kvalitetssäkring av industriell programvara. Testning innebär att systemet testas i en kontrollerad omgivning för att upptäcka fel och avvikelser från det förväntade beteendet. Kärnan i testning ligger i hur man konstruerar testfallen. Trots att industrin lägger stora resurser på testning och har stort behov av bättre testmetoder är området förvånansvärt outvecklat.

I denna doktorsavhandling försöker vi öka förståelsen för hur man konstruerar och använder testfall och vilka faktorer som påverkar testresultatet. Den stora mängden av testtekniker gör det svårt för den enskilda testaren att skapa bra testfall. Vilka tekniker hittar flest fel? I vilken ordning skall man använda dem? Vilka för- och nackdelar har de olika teknikerna?

Avhandlingen bygger på tio studier och presenterar bland annat en metod för att utvärdera olika testtekniker. Metoden har lett till ökad förståelse av hur felorsaker bör klassificeras för att generellt kunna jämföra olika tekniker mellan olika system. Vi introducerar även en klassificering av testtekniker som utgår från teknikernas effektivitet, dvs. förmågan att hitta fel och täcka in systemet, snabbhet att förstå och skapa testfall, samt användbarhet, dvs. för vilka system och i vilka situationer tekniken är användbar. Vårt mål är att underlätta testningen och därmed erbjuda bättre stöd och en bättre förståelse för den komplicerade process som testfallskonstruktion och testdesign är. Våra slutsatser sammanfattas i konkreta riktlinjer för testning av industriella system. Genom att följa dessa riktlinjer ges möjligheter till avsevärda förbättringar vid testning av komplicerade system, vilket förbättrar förutsättningarna för ökad produktkvalitet och förkortad utvecklingstid.

ii

iii

Om Test Design

Mer än hälften av svensk export utgörs av produkter som styrs av datorsystem. En stor del av kostnaderna för utveckling av dessa system spenderas på att säkerställa att programvaran fungerar tillfredställande. För många produkter rör det sig om miljardbelopp och motsvarar typiskt 30-70% av den totala utvecklings- och underhållskostnaden.

Testning är den dominerande tekniken för kvalitetssäkring av industriell programvara. Testning innebär att systemet testas i en kontrollerad omgivning för att upptäcka fel och avvikelser från det förväntade beteendet. Kärnan i testning ligger i hur man konstruerar testfallen. Trots att industrin lägger stora resurser på testning och har stort behov av bättre testmetoder är området förvånansvärt outvecklat.

I denna doktorsavhandling försöker vi öka förståelsen för hur man konstruerar och använder testfall och vilka faktorer som påverkar testresultatet. Den stora mängden av testtekniker gör det svårt för den enskilda testaren att skapa bra testfall. Vilka tekniker hittar flest fel? I vilken ordning skall man använda dem? Vilka för- och nackdelar har de olika teknikerna?

Avhandlingen bygger på tio studier och presenterar bland annat en metod för att utvärdera olika testtekniker. Metoden har lett till ökad förståelse av hur felorsaker bör klassificeras för att generellt kunna jämföra olika tekniker mellan olika system. Vi introducerar även en klassificering av testtekniker som utgår från teknikernas effektivitet, dvs. förmågan att hitta fel och täcka in systemet, snabbhet att förstå och skapa testfall, samt användbarhet, dvs. för vilka system och i vilka situationer tekniken är användbar. Vårt mål är att underlätta testningen och därmed erbjuda bättre stöd och en bättre förståelse för den komplicerade process som testfallskonstruktion och testdesign är. Våra slutsatser sammanfattas i konkreta riktlinjer för testning av industriella system. Genom att följa dessa riktlinjer ges möjligheter till avsevärda förbättringar vid testning av komplicerade system, vilket förbättrar förutsättningarna för ökad produktkvalitet och förkortad utvecklingstid.

iv

v

Acknowledgements

At the conclusion of this PhD I will have climbed my own Everest, a journey that made me stop and enjoy the view, over and over. I often thought I had reached the top, only to understand I reached a local maximum. With me, on this fantastic intellectual and developing journey, was my very experienced tour guide, Hans Hansson, a superb supervisor and experienced research leader, who has not only been a strong motivator, but has always been available to patiently listen to my self-doubt and insights, and thoroughly reading any word I have written. Hans, you are so generous and insightful. Together on this journey has been my other outstanding supervisor and spiritual mentor, Sasikumar Punnekkat. Thank you for broadening my views. Along the way young fellow climbers like Daniel Sundmark appeared and gave important advice at crucial times. Without all of you – this mountain would have been insurmountable. Thank you from the bottom of my heart and the depth of my mind. Thank you goes also to Joachim Wegener, the opponent at my Licentiate Thesis defense, who asked just the right questions at the right time. In addition, I must sincerely thank my “opponent” to this thesis, Elaine Weyuker, for all her hard work and her lifetime of great papers in the field. Many thanks go to my examination committee: Tom Ostrand, Ina Schieferdecker and Robert Feldt. To my dear colleagues at the SAVE-IT program and to all personnel at IDT at Mälardalen University who have supported me, I give many thanks for all our good times together. These thanks also extend to my wonderful supportive colleagues at Ericsson AB; it is always exiting to go to work. In particular, I give my thanks to Lars-Olof Gustafsson and Michael Williams, who have served as my industrial mentors. For funding my research in collaboration with Ericsson AB and The Knowledge Foundation SAVE-IT program, I give my thanks. And thank you, my Master Thesis Students. In fact, teaching and being with you has been an encouragement. I must particularly mention the following students that have been instrumental to this thesis: Mats Larsson, Peter Jönsson, Hans Bokvist, Jörgen Stenmark, Adithya Gollapudi, Arvind Ojha, Guido di Campli and Savino Ordine. This thesis would not be complete without giving my creative and caring

iv

v

Acknowledgements

At the conclusion of this PhD I will have climbed my own Everest, a journey that made me stop and enjoy the view, over and over. I often thought I had reached the top, only to understand I reached a local maximum. With me, on this fantastic intellectual and developing journey, was my very experienced tour guide, Hans Hansson, a superb supervisor and experienced research leader, who has not only been a strong motivator, but has always been available to patiently listen to my self-doubt and insights, and thoroughly reading any word I have written. Hans, you are so generous and insightful. Together on this journey has been my other outstanding supervisor and spiritual mentor, Sasikumar Punnekkat. Thank you for broadening my views. Along the way young fellow climbers like Daniel Sundmark appeared and gave important advice at crucial times. Without all of you – this mountain would have been insurmountable. Thank you from the bottom of my heart and the depth of my mind. Thank you goes also to Joachim Wegener, the opponent at my Licentiate Thesis defense, who asked just the right questions at the right time. In addition, I must sincerely thank my “opponent” to this thesis, Elaine Weyuker, for all her hard work and her lifetime of great papers in the field. Many thanks go to my examination committee: Tom Ostrand, Ina Schieferdecker and Robert Feldt. To my dear colleagues at the SAVE-IT program and to all personnel at IDT at Mälardalen University who have supported me, I give many thanks for all our good times together. These thanks also extend to my wonderful supportive colleagues at Ericsson AB; it is always exiting to go to work. In particular, I give my thanks to Lars-Olof Gustafsson and Michael Williams, who have served as my industrial mentors. For funding my research in collaboration with Ericsson AB and The Knowledge Foundation SAVE-IT program, I give my thanks. And thank you, my Master Thesis Students. In fact, teaching and being with you has been an encouragement. I must particularly mention the following students that have been instrumental to this thesis: Mats Larsson, Peter Jönsson, Hans Bokvist, Jörgen Stenmark, Adithya Gollapudi, Arvind Ojha, Guido di Campli and Savino Ordine. This thesis would not be complete without giving my creative and caring

vi

family a special mention for all their support; my dear mother Gudrun, brother and artist Johan, aunt Viola and cousin Sebastian come to mind. Finally, my love and gratitude goes to you, my wonderful husband Per. My dearest, I am so blessed to share my life with you and every moment we have together.

Älvsjö, September 2011

Sigrid Eldh

vii

Preface

The format and requirement of an academic thesis differs from that of e.g. a book or a course. Regardless, I have put extra effort in trying to view my main reader as a person from industry involved in creating software and eager to learn more about testing. It would be bold to claim that my findings are valid for all software systems, since there are always exceptions. Nevertheless, my goal has been that the research should target all, and not specifically telecommunication or middleware software. In fact, from a testing viewpoint, test design possesses similar problems, even if terminology and type of system varies. Test design is more important the more complex or quality demanding the software system is.

To guide the reader of this thesis the following may be helpful:

The first chapter should give a clear overview of the thesis for any reader.

A Tester or Developer should find particularly chapters 2, 13 and 14 most useful and straight forward, but there are a lot of tricks and hints on common problems relevant for developers and tester in most of the studies. The “Mistakes” study in Chapter 11 should be a “must read” for anyone writing a test case.

A Manager would probably enjoy the chapters mentioned above, where Chapter 2 is introductory, Chapter 13 is about test design techniques (and reasoning), and Chapter 14 is the plain guideline. In addition, we present a useful improvement method in our first study (Chapter 3) that could give some new insights.

For the academic reader, I have particularly attempted to keep the studies a similar look and feel, by providing initial summaries to each of the studies. Part III, the synthesis of my thesis, is more a proposal targeted to the industrial audience than a complete and fully validated academic effort. Rather, it may be viewed as a basis and inspiration for further studies. Despite its academic shortcomings, I did find it necessary to conclude and attempt a guideline for the industry that craves such information.

If you feel I have unjustly forgotten to quote any of your important work, I must beg your forgiveness in advance. Having read about testing for many decades, I do not possess a memory that traces all the

vi

family a special mention for all their support; my dear mother Gudrun, brother and artist Johan, aunt Viola and cousin Sebastian come to mind. Finally, my love and gratitude goes to you, my wonderful husband Per. My dearest, I am so blessed to share my life with you and every moment we have together.

Sigrid Eldh

vii

Preface

The format and requirement of an academic thesis differs from that of e.g. a book or a course. Regardless, I have put extra effort in trying to view my main reader as a person from industry involved in creating software and eager to learn more about testing. It would be bold to claim that my findings are valid for all software systems, since there are always exceptions. Nevertheless, my goal has been that the research should target all, and not specifically telecommunication or middleware software. In fact, from a testing viewpoint, test design possesses similar problems, even if terminology and type of system varies. Test design is more important the more complex or quality demanding the software system is.

To guide the reader of this thesis the following may be helpful:

The first chapter should give a clear overview of the thesis for any reader.

A Tester or Developer should find particularly chapters 2, 13 and 14 most useful and straight forward, but there are a lot of tricks and hints on common problems relevant for developers and tester in most of the studies. The “Mistakes” study in Chapter 11 should be a “must read” for anyone writing a test case.

A Manager would probably enjoy the chapters mentioned above, where Chapter 2 is introductory, Chapter 13 is about test design techniques (and reasoning), and Chapter 14 is the plain guideline. In addition, we present a useful improvement method in our first study (Chapter 3) that could give some new insights.

For the academic reader, I have particularly attempted to keep the studies a similar look and feel, by providing initial summaries to each of the studies. Part III, the synthesis of my thesis, is more a proposal targeted to the industrial audience than a complete and fully validated academic effort. Rather, it may be viewed as a basis and inspiration for further studies. Despite its academic shortcomings, I did find it necessary to conclude and attempt a guideline for the industry that craves such information.

If you feel I have unjustly forgotten to quote any of your important work, I must beg your forgiveness in advance. Having read about testing for many decades, I do not possess a memory that traces all the

viii

origins to my insights, some I cannot tell if they are my own or someone else’s. Instead, be happy that you have the same conclusions or that you might have influenced me.

All faults herein are mine and my supervisors should not be blamed – in fact, they have been helpful in so many ways, far beyond the scope of this thesis.

With these final words, I hope you enjoy my thesis, as much as I have enjoyed making it.

Sigrid Eldh

ix

Table of Contents

Part I Introduction ............................................................................ 1

Chapter 1. Overview of Research .............................................. 3

1.1 Why Software Testing is Important ...................................... 3 1.2 Brief Background .................................................................. 3 1.3 Research Objective ................................................................ 5 1.4 Overview of Research Methodology ..................................... 6 1.5 Research Scope ..................................................................... 8 1.6 Overview of Thesis ............................................................. 11 1.7 Contributions ....................................................................... 17

Chapter 2. Introduction to Test Design ................................... 23

2.1 Expectations on Testing ...................................................... 23 2.2 Test Process Introduction .................................................... 33 2.3 The Basic Process V-model ................................................ 33 2.4 W-model .............................................................................. 39 2.5 The Plethora of Publications in Software Test and Test Design ............................................................................................ 44 2.6 Historic Classifications ....................................................... 45 2.7 Overview of Test Design Techniques based on Groups ..... 48

Part II Empirical Studies ............................................................... 53

Chapter 3. Component Test Improvement through Software Quality Rank .................................................................................. 55

3.1 Summary.............................................................................. 55 3.2 Software Quality Rank ........................................................ 58 3.3 The Case Study .................................................................... 67 3.4 Conclusions ......................................................................... 74 3.5 Discussion ........................................................................... 76 3.6 Lessons Learned .................................................................. 78

Chapter 4. The Test Design Technique Comparison Framework .................................................................................. 79

4.1 Summary.............................................................................. 79 4.2 Introduction to the Test Framework .................................... 83