monday, august 30, 2004info4990 information technology research methods (july, 2004) 1...

36
Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Post on 19-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

1

Experimentation

INFO4990 – Week 6

Page 2: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

2

Agenda

Experimentation in Computer Science and information systems research

Basic experimentation concepts Some widely used experimental design in CS

and IS field Analyze data from experiment study

Page 3: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

3

History

Experiment in natural science systematic acquisition of new knowledge, testing

theory about nature Agriculture Chemistry …

Experimentation in social, psychology and economic studies Study people’s behavior E.g., fairness study

Page 4: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

4

Experiment in computer science research

Derived from natural science experimentation Computer systems performance analysis

Hardware Software Algorithm Network

Page 5: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

5

Experimentation in Information System research

Derived from social and economic experimentation

Subject under study is usually human Human behavior with regard to information

system Hyperlink transferred trustiness Which subject is most suitable for distance

learning

Page 6: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

6

Purpose of experiment

Discover and confirm causal relationship Examine the possible influences that one

factor or condition may have on another factor or condition

Page 7: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

7

Basic experimentation concepts

Independent variable Cause Research “measure” (manipulate) independent variable by

creating a condition or situation Manipulation of independent variable create different

treatments. Event manipulation

Affecting the independent variable by altering the events that subjects experience

Presence versus absence Instructional manipulation

Varying the independent variable by giving different sets of instructions to the subjects

Page 8: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

8

Basic experimentation concepts (cont)

Effect (outcome) Physical conditions, behaviors, attitudes, feelings,

or beliefs of subjects that change in response to a treatment.

How to measure IS research: various data collection methods

Questionnaire, interviews, observation, test CS research: Metrics in the field

Performance time, rate, error rate, time to failure and duration

Page 9: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

9

The importance of control

Internal validity -- The extent to which we can accurately state that the independent variable produced the observed effect

Page 10: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

10

Experiment cases A marketing researcher wants to study how humor in television

commercials affects sales. To do so, the researcher studies the effectiveness of two commercials that have been

developed for a new soft drink called Zowie. One commercial, in which a well-known but serious television actor describes

how Zowie has a zingy and a refreshing taste, airs during the months of March, April and May. The other commercial, a

humorous scenario in which several teenagers throw Zowie at on another on a hot summer day, airs during the months of

June, July, and the August. The researcher finds that in June through August, Zowie sales are almost double what they

were in the preceding three months. “Humor boost sales,” the research concludes.

Many alternative explanations

Page 11: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

11

Strategies to achieve control Keep some things constant

What are variables that need to be held constant in most experiments?

Include a control group Treatment group (experimental group) Between-subjects design

Randomly assign people to groups Use matched pairs

Matched-subject design

Page 12: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

12

Between and matched-subjects design

18

3

2

67

10

49 5

Random assignment

1 10

5

7

6 3

8 42

9

treatment control

DV DV

23

7

5

92

8

1

10

64

3 8

1

5

4

7 2

10

9

6

Randomly assign one member of

each pair to each group

Page 13: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

13

Steps in conducting an experiment Identify the relevant variables State hypotheses Decide on an experimental design Decide the way to manipulate independent variables Develop a valid and reliable measure for dependent variable Pilot testing the treatment and dependent variable measures Recruit subjects (or locate cases) Assign subject to groups Introduce treatment to treatment groups Gather data for measure of the dependent variables Hypotheses testing

Page 14: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

14

Experimental design

One shot case study True experimental design Factorial design Block design

Page 15: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

15

Classic true experimental design

pretest-posttest

Treatment Versus control group

Randomized Experimental

design

http://trochim.human.cornell.edu/kb/desintro.htm

Vertical alignment shows twoPretests are measured at

same time

Page 16: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

16

Factorial design

Two or more independent variables are manipulated in a single experiment

They are referred to as factors The major purpose of the research is to

explore their effects jointly Factorial design produce efficient

experiments, each observation supplies information about all of the factors

Page 17: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

17

A simple example Investigate an education

program with a variety of variations to find out the best combination Amount of time receiving

instruction 1 hour per week vs. 4 hour per

week Settings

In-class vs. pull out 2 X 2 factorial design

Number of numbers tells how many factors

Number values tell how many levels

The result of multiplying tells how many treatment groups that we have in a factorial design

Page 18: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

18

Factorial designs in computer system performance analysis

Personal workstation design Processor: 68000, Z80, 8086 Memory size: 512K 2M or 8M bytes Number of disks: one, two or three Workload: Secretarial, managerial or scientific User education: high school, college, post-

graduate level Dependent variable

Throughput, response time

Page 19: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

19

22 factorial design

Two factors, each at two levels

Example: workstation design Factor 1: memory size Factor 2: cache size DV: performance in

MIPS

0

20

40

60

80

4M 8M

Memory size

Perf

orm

ance in M

IPS

1K

2K

Cache size

Memory size

4M byte 8M byte

1K 15 45

2K 25 75

Page 20: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

20

2K factorial design

K factors, each at two level

2K experiments 23 design example

In designing a personal workstation, the three factors needed to be studied are: cache size, memory size and number of processors

Factor Level -1 Level 1

Memory size 4Mbytes 16Mbytes

Catch size 1Kbytes 2Kbytes

Number of processors

1 2

Cache size (Kbytes)

4 Mbytes 16 Mbytes

1 proc 2 proc 1 proc 2 proc

1 14 46 22 58

2 10 50 34 86

Page 21: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

21

Full and fractional factorial design

Full factorial design Study all combinations Can find effect of all factors

Fractional (incomplete) factorial design Leave some treatment groups empty Less information May not get all interactions No problem if interaction is negligible

Page 22: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

22

2 factors full factorial design

Used where there are two factors that are carefully controlled

Examples in computer system performance analysis To compare several processors using several

workload To determine two configuration parameters such

as cache and memory size

Page 23: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

23

2 factors full factorial design (cont)

Example: cache comparison

workload Two caches One caches No caches

ASM 54.0 55.0 106.0

TECO 60.0 60.0 123.0

SIEVE 43.0 43.0 120.0

DHRYSTONE 49.0 52.0 111.0

SORT 49.0 50.0 108.0

Page 24: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

24

Field and controlled laboratory experiment

Field experiment Experiments conducted in real-life or field settings Researcher has less control over the experimental

condition Greater external validity but lower internal validity

Controlled laboratory experiment Conducted under controlled conditions of a laboratory Greater internal validity but lower external validity Practical consideration

Planning and pilot testing Instruction to subjects Post experiment interview

Page 25: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

25

Example of field and controlled laboratory experiments

Field experiment The case in slide 10

A controlled laboratory version Ask two group of subject (students) to view the

tape of two different Ads (event manipulation). Use questionnaire to collect their intentions to buy

the product. Compare the response from the two groups

Page 26: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

26

Analyzing data from between subject design

Problem You want to measure the

acquisition of mathematical skills by distance learning and traditional classroom learning. The study involves the comparison of 20 students, ten taught in classroom and ten taught by distance learning program. The final test scores were collected as dependent variable.

DL CL

94 90

89 91

76 83

85 81

88 74

65 60

70 69

72 63

68 62

64 63

77.1 73.6

Page 27: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

27

Why can’t we just compare the means

The difference between the means is the same in all three.

They tell very different stories

When we are looking at the differences between scores for two groups, we have to judge the difference between their means relative to the spread of variability of their scores

Page 28: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

28

T-test

t-test Assesses whether the means of two groups are

statistically different from each other Sample size is small Approximately normal distribution of the measure

in the two groups is assumed

Page 29: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

29

Perform t-test

Page 30: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

30

Interpret result

Set a significance level

Degree of freedom N1+N2 - 2

Compare t-value with critical value from t-distribution to see if it is larger enough to be significant

t-Test: Two-Sample Assuming Equal Variances

DL CLMean 77.1 73.6Variance 120.7666667 142.2666667Observations 10 10Pooled Variance 131.5166667Hypothesized Mean Difference 0df 18t Stat 0.682437133P(T<=t) one-tail 0.251825559t Critical one-tail 1.734063592P(T<=t) two-tail 0.503651117t Critical two-tail 2.100922037

Page 31: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

31

Analyzing data from matched subject design

Problem You want to compare the

hit rate of a two cache algorithms. The simulated cache algorithms are running on 5 benchmarks and the hit rate were recorded

Cache 1 Cache 2

0.91 0.95

0.67 0.65

0.85 0.90

0.73 0.80

0.93 0.97

0.818 0.854

Page 32: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

32

Suitable test: Paired t-test

Calculation of t-value

Degree of freedom N-1

t-Test: Paired Two Sample for Means

Cache 1 Cache 2Mean 0.818 0.854Variance 0.01292 0.01733Observations 5 5Pearson Correlation 0.973040321Hypothesized Mean Difference 0df 4t Stat -2.394684379P(T<=t) one-tail 0.037393209t Critical one-tail 2.131846782P(T<=t) two-tail 0.074786418t Critical two-tail 2.776445105

Cache 1 Cache 2 Difference D2

B1 0.91 0.95 -0.04 0.0016

B2 0.67 0.65 0.02 0.0044

B3 0.85 0.90 -0.05 0.0025

B4 0.73 0.80 -0.07 0.0049

B5 0.93 0.97 -0.04 0.0016

Total -0.18 0.011

Avg -0.036

)1(

)( 22

NNN

DD

Dt

Page 33: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

33

Analyzing data from factorial design

Problem The memory-cache

experiments were repeated three times each. The result is shown right

What we want to find out Which factor contribute

most to the performance What’s the joint effect of

the two factors

Cache size Memory size

4M 8M

1 K 15

18

12

(15)

45

48

51

(48)

2K 25

28

19

(24)

75

75

81

(77)

Page 34: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

34

Suitable test: ANOVA

2 way ANOVA (Analysis of Variance)

F-value Between-sample

variation/within-sample variation

ANOVASource of Variation SS df MS F P-value F crit

Sample 1083 1 1083 84.94118 1.56E-05 5.317655Columns 5547 1 5547 435.0588 2.93E-08 5.317655Interaction 300 1 300 23.52941 0.001271 5.317655Within 102 8 12.75

Total 7032 11

Distribution of Variance

100% 0.788823 0.15401 0.042662 0.014505

Totalvariance

Memorysize

Cachesize

Interaction Errors

Page 35: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

35

Statistical package

Excel SPSS SAS

Page 36: Monday, August 30, 2004INFO4990 Information Technology Research Methods (July, 2004) 1 Experimentation INFO4990 – Week 6

Monday, August 30, 2004 INFO4990 Information Technology Research Methods (July, 2004)

36

References

Paul D. Leedy and Jeanne Ellis Ormrod << Practical Research: Planning and Design >> 7th edition

Robert.B.Burns <<Introduction to Research Methods>> 4th edition

Raj Jain <<The art of computer system performance analysis by >>

www.socialresearchmethods.net http://www.statsoft.com/textbook/stathome.html