reasoning about shape as a pattern in variability arthur ... · 3 need to develop in order to...

1

Reasoning About Shape as a Pattern in Variability

Arthur Bakker

Freudenthal Institute

Utrecht University

The Netherlands

[email protected]

SRTL 3, July 23-28, Lincoln, Nebraska

2

Reasoning about shape as a pattern in variability

Background of the research

The first time I visited an American classroom I attended a statistics lesson in grade 5. When the

teacher asked a question that sounded statistical but did not require a measure of center, a boy

thoughtlessly muttered “meanmedianmode,” as if it were one word. My impression was that

these students had been drilled to calculate mean, median, and mode, and to draw bar graphs, but

did not use their common sense in answering questions. This small incident exemplifies what a

litany of research in statistics education reports on: too often, students learn statistics as a set of

techniques that they do not apply sensibly. Even if they have learned to calculate mean, median,

mode, and to draw histograms and box plots, they mostly do not understand that they can use a

mean as a group descriptor when comparing two data sets—to give just one example that is well

documented (Konold & Higgins, 2003; McGatha, Cobb, & McClain, 2002; Mokros & Russell,

1995). The problem that students do not know well how to use statistical techniques in data

analysis is not typically American; it also applies to the Dutch context, but to a lesser extent. The

reason for this is probably that Dutch students learn most statistical concepts and graphs such as

median, mode, histogram, and box plot about three years later than in the USA.

Despite differences between the curricula in different countries, the underlying problem remains

the same: students generally lack the necessary conceptual understanding for analyzing data with

the statistical techniques they have learned. Konold and Pollatsek (2002) argue that students need

to develop a conceptual understanding of signal and noise in order to understand what an average

value or a distribution is about in relation to the variation around that value or distribution. In this

context, it is useful to distinguish two types of patterns in variability, or signals in noisy

processes. First, the signal can be a true value with error as noise around it. Such signals are

apparent in repeated measurements of one item (Petrosino, Lehrer, & Schauble, 2003). The

“center clump” is then an indication of where the true value probably is. Second, the signal might

be a distribution, such as the smooth bell curve of the normal distribution, with which we model

data. The noise in that case is the variation around that smooth curve. In either type of pattern, it

is evident that students need good conceptual understanding before they can recognize signals in

noisy processes. The focus of recent research seems to shift to the key concepts that students

3

need to develop in order to analyze data in a meaningful way. This paper focuses on the key

concepts of variability, sampling, data, and distribution. The main question of the research

presented here was, how do we promote coherent reasoning about variability, sampling, data,

and distribution in a way that is meaningful for students with little statistical background?

As a partial answer to this question, two instructional activities that support such reasoning are

presented in this paper. The first, growing a sample, is an elaboration of what was presented at

SRTL 2 (Bakker, 2002). It is similar to what Konold and Pollatsek (2002) suggest as a way to

give students the opportunity to experience stable features in variable processes. The second

activity, reasoning about shape, is a sequel to the growing samples activity in the sense that

students came to reason with the shapes they predicted during the growing samples activity for

very large samples of weight data (pyramid, semicircle, bell shape). Two skewed shapes were

added to direct the discussion toward skewness.

Key concepts of statistics at middle school level

There is no unique list of the key concepts of statistics. The list for the middle school level

presented in this section is inspired by a list of Garfield and Ben-Zvi (in press) for statistics

education in general. No particular order is suggested, because these key concepts are only

meaningful in relation to each other (cf. Wilensky, 1997).

• Variability

• Sampling

• Data

• Distribution

• Covariation1

Variability, the phenomenon that things are variable, is at the core of all statistical investigation

(Wild & Pfannkuch, 1999). If students do not expect any variability in a particular context, such

as the life span of batteries or other industrial contexts, they neither have an intuition of why one

would take a sample or look at a distribution. The terms “variability” and “variation” are

1 Covariation is not discussed further in this paper because it is not addressed in the Dutch mathematics curriculum for secondary education. See for instance Cobb, McClain, & Gravemeijer (2003).

4

sometimes used interchangeably, but here “variability” is used for the general phenomenon of

being variable, and “variation” for described or even quantified variability (Reading &

Shaughnessy, in press). In this paper, “variation” is taken as a characteristic of the distribution of

a data set, and “spread” is used as an informal student term for “variation.”

Sampling. To describe or predict a particular variable phenomenon, we need data and data are

mostly created by taking a sample. Students’ notions of sampling have been studied at the middle

school level, but not extensively. Though students seem to have useful intuitions about sampling,

it turns out to be a complex notion for students to learn (Jacobs, 1999; Konold & Higgins, 2003;

Rubin, Bruce, & Tenney, 1990; Schwartz, Goldman, Vye, Barron, & Vanderbilt, 1998; Watson,

2002; Watson & Moritz, 2000).

Data. Sampling and measurement lead to data, that is numbers with a context (Moore, 1997). As

indicated by Latour (1990) and Roth (1996), data have a history and a meaning in a context.

Understanding the key concept of data includes insight into why data are needed and how they

are created. This implies that the idea of data relies on knowledge about measurement. Too often,

however, data are detached from the process of creating them (Wilensky, 1997), so many

researchers and teachers advocate that students collect their own data (National Council of

Teachers of Mathematics, 2000; Shaughnessy, Garfield, & Greer, 1996). This is time consuming,

so “talking through the process of data creation,” as Cobb (1999) calls it, can sometimes

substitute for the real creation of data. A class discussion on why and how data are created is

necessary for students to experience a data set as meaningful. In fact, such a process of talking

through the data creation process is a way to address the sampling issue. For analyzing the data

we can manipulate data icons in a graph to find relations and characteristics that are not visible

from the table of numbers (Lehrer & Romberg, 1996).

Distribution. The problem with finding such relations and characteristics, however, is that many

students tend to perceive data just as a series of individual cases, and not as a whole that has

characteristics that are not visible in any of the individual cases. Hancock et al. (1992) note that

students need to mentally construct such an aggregate before they can perceive a data set as a

whole. Many researchers encountered the same problem and experienced the persistency of it

5

(Ben-Zvi & Arcavi, 2001; Wilensky, 1997). To move from a case-oriented view to an aggregate

view on data, students need to develop a conceptual structure with which they can conceive data

sets as aggregates. The concept of distribution is such a structure (Cobb, 1999; Gravemeijer,

1999). In the words of Petrosino et al. (2003, p. 132): “Distribution could afford an organizing

conceptual structure for thinking about variability located within a more general context of data

modeling.” Of course, distribution is a very advanced concept (Wilensky, 1997) that, in its full

complexity, is far beyond the scope of middle school students. Nevertheless, it is possible to

address the issue of how data are distributed from an informal situational level onwards by

focusing on shape (Bakker & Gravemeijer, in press; Cobb, 1999; Russell & Corwin, 1989). In

the research of Cobb, McClain, and Gravemeijer (2003), students came to reason with hills to

indicate majorities of data sets. Bakker and Gravemeijer (in press) report on how students

reasoned how a “bump” changed if older students would be measured, and what would happen

with the bump if the sample would grow. Students in this Dutch research tended to structure a

real or hypothetical data set into three categories of low, “average,” and high values. In other

words, they soon came to see a pattern in the variability of different phenomena such as weight,

height, and wingspan of birds. These “average” groups and “majorities” correspond to what

students in other studies called “clumps” or “clusters.” In fact, students use such “modal clumps”

(Konold et al., 2002) to indicate the center of a distribution and they can use the range of these

clumps to convey something about the spread of the data. This brings us to two core aspects of

distribution, center and spread.

Center. Traditionally, measures of center such as mean and median are taught before students

have developed a notion of center. Zawojewski and Shaughnessy (2000) reason that students can

find mean and median difficult because they have not had sufficient opportunities to make

connections between center and spread; that is, they have not made the link between measures of

central tendency and the distribution of the data sets (p. 440). It is only possible to choose

sensibly between mean and median if one takes the context and the distribution of the data into

account. If the distribution is skewed and there are outliers, one might choose the median as a

measure of center, but there can also be reasons to use the mean—depending on the context. It is

therefore important to give students opportunities to learn ways to describe how data are

distributed, perhaps even before teaching more formal measures of center.

6

Spread or variation. Variation did not receive much attention in statistics education research

until about 1997. The reason for neglecting variation in education has probably been the focus on

the measures of center (Reading & Shaughnessy, 2000; Shaughnessy, Watson, Moritz, &

Reading, 1999). Noss, Pozzi, and Hoyles (1999) report on nurses who view variation as crucial

information when investigating blood pressure, and Lehrer and Schauble (2001) report on

variation in a physical and biological contexts. For an overview of the research literature on

variation see, for example, Meletiou (2002).

The rationale of the learning trajectory used in the research presented in this paper was that

students would be engaged in coherent reasoning about variability, data, sampling, and shape

from the outset. The learning process aimed at was characterized as “guided reinvention”

(Freudenthal, 1991). Students were stimulated to contribute their own ideas, strategies, and

language in solving statistical problems (reinvention), but they were also provided with

increasingly sophisticated ways to describe how data were distributed and to characterize data

sets (planned guidance). The software tools used were two so-called “minitools,” which Bakker

and Gravemeijer (in press) also used in teaching experiments in grade 7. In grade 7, it turned out

to be difficult for students to reason with shape, except for the high achievers who reasoned with

bumps. The next teaching experiment was therefore carried out in a higher grade. The next

section provides information on the methodology used and the subjects involved in this eighth-

grade teaching experiment. After the analysis of the two instructional activities, implications for

research, teaching, and assessment are discussed.

Methodology and subjects

For answering the main question of how coherent reasoning about variability, sampling, data,

and distribution could be promoted, a design research study was carried out. In line with the

principles of Realistic Mathematics Education (Freudenthal, 1991; Gravemeijer, 1994) and the

NCTM Standards (2000), we looked for ways to guide students in being active learners, in this

case dealing with increasingly sophisticated computer minitools and students’ own

representations. Design research typically involves a preparation and a design phase (of

instructional materials for example), teaching experiments, and retrospective analyses (Cobb,

Confrey, diSessa, Lehrer, & Schauble, 2003; Gravemeijer, 1994).

7

Preparation and design phase. In the research presented here, the preparation phase consisted of

the reformulation of a hypothetical learning trajectory (Simon, 1995) that was developed during

design experiments in grade 7 (Bakker & Gravemeijer, in press). In the last seventh-grade

experiment, the activity of growing a sample turned out to be promising as a way to give students

the opportunities to reason about shapes of distributions (in particular “bumps”). This activity

was again used in grade 8 with the expectation that a more advanced type of reasoning about

shape could be fostered than in grade 7. In the seventh-grade study, it turned out that one of the

solutions to assist students overcome a purely case-oriented view of data, and develop an

aggregate view, was to let them predict hypothetical situations without any data. Examples

questions were, “What would the graph look like if a larger sample would be taken? What would

a weight graph of eighth graders look like compared to one of seventh graders?” To stimulate an

aggregate view we so to speak asked about forests instead of the trees and often stayed away

from data. Letting students invent and compare their own graphs had also appeared to be useful

in engaging them in meaningful mathematical activity. Because we conjectured that this was

easier for students to reason about data in well-known contexts, we again chose weight as the

context for the two activities described here.

Teaching experiment and data collection. The collected data include audio and video recordings

of class activities, student work, field notes, and interviews after every second lesson. An

essential part of the data corpus was the set of mini-interviews held during the lessons. Mini-

interviews varied from about twenty seconds to four minutes, and were meant to find out what

concepts and graphs meant for students. This influenced their learning, because the mini-

interviews often stimulated reflection. However, the validity of the research was not in danger,

since the aim was to find out how students learned to reason with shape or distribution, not

whether teaching the sequence in other eighth-grade classes would lead to the same results in the

same number of lessons. The teaching experiment was carried out in an eighth-grade class with

30 students in a public school in the center of the Dutch city of Utrecht in the fall of 2001. It

lasted for ten lessons of 50 minutes each, half of which were carried out in a computer lab. The

students in this study were being prepared for pre-university (VWO) or higher vocational

education (HAVO)—types of education that the top 35-40% of Dutch students attend. The

remaining 60-65% of Dutch students are prepared for other types of vocational education

(VMBO). In the practice of Dutch mathematics education, the school textbooks play a central role.

8

Students are expected to be able to work through the tasks by themselves, with the teacher

available to help them if necessary. As a consequence, tasks are broken down into very small

steps and real problem solving is rare. Students’ answers tend to be superficial because they have

to deal with about eight contexts per lesson (van den Boer, 2003). The students in the class that is

reported on here were not used to whole-class discussions, but rather to be “taken by the hand”

as the teacher called it. The eighth-grade students had had no prior instruction in statistics; they

were acquainted with bar and line graphs, but not with dot plots, histograms, or box plots. In the

research reported here, students already knew the mean from calculating their report scores

(grades), but mode and median were not introduced until the second half of the instructional

sequence after variability, data, sampling, and shape had been topics of discussion.

Retrospective analysis. For the retrospective analysis, the transcripts were read, the videotapes

watched, and conjectures formulated on students’ learning on the basis of the read and watched

episodes. The generated conjectures were tested against the other episodes and the rest of the

collected data (student work, field observations, tests) in the next round of analysis

(triangulation). Then the whole generating and testing process was repeated. This method

resembles Glaser and Strauss’s constant comparative method (Cobb & Whitenack, 1996; Glaser

& Strauss, 1967). About one quarter of the episodes (including those discussed in this paper) and

the conjectures belonging to these episodes were judged by three assistants who attended the

teaching experiment. Only the conjectures that all of us agreed upon were kept, and this was

more than 95%. The interrater reliability was therefore high. An example of a conjecture that

was confirmed was that students tended to group data sets (real or imagined) into three groups of

low, “average,” and high values.

Growing a sample

The overall goal of the growing samples activity as formulated in the hypothetical learning

trajectory was to let students reason about shape in relation to sampling, center, and spread. The

activity of growing a sample consisted of cycles of making sketches of a hypothetical situation

and comparing those sketches with graph displaying real data sets. In the fourth lesson of the

experiment, three such cycles took place as described below.

9

Figure 0. Minitools 1 and 2: value-bar graphs and dot plots with the same data set of battery life

spans.

First cycle of growing a sample

The text of the activity sheet started as follows:

Last week you all predicted graphs for a balloon driver. During this lesson you will get to see

real weight data of students from another school. We are going to investigate the influence of the

sample size to the shape of the graph.

a. Predict a graph of ten data values, for example with the dots of minitool 2.

The sample size of ten was chosen because the students found that size reasonable after the first

lesson in the context of testing the life span of batteries. Figure 1 shows different diagrams

students made to show their predictions: there were three value bar graphs (such as in minitool 1,

Figure 0), eight with only the endpoints (such as with the option to “hide bars”) and remaining

nineteen plots were dot plots (minitool 2).

Rikkert

Christel

10

Figure 1a. Student predictions (Rikkert, Christel, and Susan) for ten data points (weight in kg).

Figure 1b.Three real samples in minitool 2.

To stimulate the reflection on the graphs, the teacher showed three samples of ten data points on

the blackboard (the overhead projector had just broken down) and students had to compare their

own graphs (Figure 1a) with the graphs of the real samples (Figure 1b).

b. You get to see three different samples of size 10. Are they different than your own prediction?

Describe the differences.

The reason for showing three small samples was to show the variation among these samples.

There are no clear indications, though, that students conceived this variation as a sign that the

sample size was too small for drawing conclusions, but they generally agreed that larger samples

were more reliable. There was just a short class discussion on the graphs with real data before

students worked for themselves again. Please note that a grammatical translation into English of

ungrammatical spoken Dutch does not always sound very authentic.

Teacher: We’re going to look at these three different ones [samples in Figure 1b].

Can anyone say something yet? Give it a try.

S1: In the middle one, there are more together.

Teacher: Here there are many more together, clumped or something like that.

Susan

11

[pointing to the middle graph of Figure 1b] Who can mention other

differences?

S2: Well, uh, the lowest, I think it’s all the furthest apart.

Teacher: Those are all the furthest apart. Here they are in one clump. Are there

any other things you notice, Ricardo?

S3: Yes, the middle one has just one at 70. [This is a case-oriented view.]

Teacher: There’s only one at 70 and the rest are at 60 or lower? Yes?

Teacher: Can you say something about the mean perhaps?

S4: The mean is usually somewhere around 50.

For the remainder of this section, the figures and written explanations of three students are

demonstrated, because their work gives a representative impression of the variation of the whole

class: their diagrams represent all types of diagrams made in this class and the learning abilities

of these students varied considerably. Rikkert and Christel’s report scores were in the bottom

third of the class and Susan had the best report score of the class on the total of all subjects.

Rikkert: Mine looks very much like what is on the blackboard.

Christel: The middle-most [diagram on the blackboard] best resembles mine because

the weights are close together and that is also the case in my graph. It lies

between 35 and 75 [kg].

Susan: The other [real data] are more weights together and mine are further apart.

Rikkert’s answer is not very specific, like most of the written answers in the first cycle of

growing samples. Christel used the predicate “close together” and added numbers to indicate the

range, probably as an indication of spread. Susan used such terms as “together” and “further

apart,” which address spread. The students in the class used common predicates such as

“together,” “spread out” and “further apart” to describe features of the data set or the graph. Van

Oers (2000) calls the process of attributing predicates to features “predication.” “Predication is

the process of attaching extra quality to an object of common attention (such as a situation, topic

or theme) and, by doing so, making it distinct from others” (p. 150). For the analysis it is

important to note that the students used predicates (together, apart) and no nouns (spread,

average) in this first cycle of growing samples. Spread can only become an object-like concept,

something that can be talked about and reasoned with, if it is a noun. In the semiotics of Peirce

12

(1976), such transitions from the predicate “the dots are spread out” to “the spread is large” are

important steps in the formation of concepts.

Second cycle of growing a sample

With the feedback of the samples of ten data points in dot plots, students had to predict the

weight graph of a whole class of 27 students and of three classes with 67 students (27 and 67

were the sample sizes of the real data sets of eighth graders of another school.

c. We will now have a look how the graph changes with larger samples. Predict a sample of 27

students (one class) and of 67 students (three classes).

d. You now get to see real samples of those sizes. Describe the differences. You can use words

such as majority, outliers, spread, average.

During this second cycle, all of the students made dot plots, probably because the teacher had

shown dot plots on the blackboard, and because dot plots are less laborious to draw than value

bars (only one student started with a value-bar graph in the sample of 27 switched to a dot plot

for the sample of 67). The hint on statistical terms was added to make sure that students’ answers

would not be too superficial (as often happened before) and to stimulate them to use such notions

in their reasoning. It was also important for the research to know what these terms meant for

them (see transcripts of video fragment 4.1 at the end of the paper). When the teacher showed the

two graphs with real data, there was once again a short class discussion in which the teacher

capitalized on the question of why most student prediction now looked pretty much like what

was on the blackboard, whereas with the earlier predictions there was much more variation

(transcripts of video fragment 4.2). No student had a reasonable explanation, which indicates that

this was an advanced question. The written answers from the same three students were:

Rikkert: My spread is different.

Christel: Mine resembles the sample, but I have more people around a certain weight

and I do not really have outliers, because I have 10 about the 70 and 80 and

the real sample has only 6 around the 70 and 80.

Susan: With the 27 there are outliers and there is spread; with the 67 there are more

together and more around the average.

13

Figure 2a. Predicted graphs for one and for three classes by Rikkert, Christel, and Susan.

Figure 2b. Real data sets of size 27 and 67of students in another school.

Rikkert addressed the issue of spread. Christel was more explicit about a particular area in her

graph, namely the high values. She also correctly used the term “sample,” which was newly

introduced in the second lesson. Susan used the term “outliers” in this stage, by which students

meant “extreme values”. She also seemed to locate the average somewhere and to understand

that many students are about average. These examples illustrate that students used statistical

notions for describing properties of the data and diagrams. From a statistical point of view, these

Susan

Rikkert Christel

14

terms were not very precise. With “mean” students generally meant “about average” or “the

middle typical group”; with “spread” they meant “how far the data lie apart.” And with “sample”

they seemed to mean just a bunch of people, not necessarily the data as being representative for a

population (cf. Schwartz et al., 1998).

In contrast to the first cycle, students used nouns instead of just predicates for comparing the

diagrams. Rikkert (like others) used the noun “spread,” whereas students earlier used only

predicates such as “spread out.” Of course, this does not always imply that if students use these

nouns that they are thinking of the right concept; nor is it meant as a linguistic trick. Statistically,

however, it makes a difference whether we say, “the dots are spread out” or “the spread is large.”

In the latter case, spread is something that can have particular aggregate characteristics that can

be measured (for instance by the range, the interquartile range, or the standard deviation). Other

notions, outliers, sample, and average, are now used as nouns, that is as conceptual objects that

can be talked about and reasoned with.

Third cycle of growing a sample

In contrast to what was intended in the hypothetical learning trajectory, no student made a

continuous shape or talked about one yet. This was to change in this last cycle of growing the

sample, when the task was to make a graph showing data of all students in the city, not

necessarily with dots. The intention of asking this was to stimulate students to use continuous

shapes and dynamically relate samples to populations without making this distinction between

sample and population explicit yet. The conjecture was that this transition from a discrete

plurality of data values to a continuous entity of a distribution is important to foster a notion of

distribution as an object with which students could model data and describe aggregate properties

of data sets. During teaching experiments in the seventh-grade experiments (Bakker &

Gravemeijer, in press), in two American sixth-grade classes, and a visit to an American group of

ninth graders, reasoning with continuous shapes turned out difficult to accomplish, even if it was

asked for. It often seemed impossible to nudge students toward drawing the general, continuous

shape of data sets represented in dot plots. At best, students drew spiky lines just above the dots.

This underlines that students have to construct something new (a notion of signal, shape, or

distribution) with which they can look differently at the data or the variable phenomenon. The

task proceeded as follows:

15

e. Make a weight graph of a sample of all eighth graders in Utrecht. You need not draw dots. It

is the shape of the graph that is important.

f. Describe the shape of your graph and explain why you have drawn that shape.

Figure 3. Predicted graphs for all students in the city by Rikkert, Christel, and Susan.

The written explanations of the same three students were:

Rikkert: Because the average [values are] roughly between 50 and 60 kg.

Christel: I think it is a pyramid shape. I have drawn my graph like that because I found

it easy to make and easy to read.

Susan: Because most are around the average and there are outliers at 30 and 80 [kg].

Rikkert’s answer resembles that of students in seventh grade who indicated a range of the

average values or the majority. His answer focuses on the average group, or “modal clump” as

Konold and colleagues call such groups in the center (Konold et al., 2002). During an interview

after the fourth lesson, Rikkert literally called his graph a “bell shape,” though he had probably

not encountered that term in a school situation before (three other students also described their

graphs as bell shapes). This is probably a case of reinvention. Christel’s graph was probably

inspired by line graphs that the students made during mathematics lessons. She introduced the

vertical axis with frequency, though such graphs had not been used before in the statistics course.

Susan

Christel

Rikkert

16

Susan’s graph shows both dots and a continuous shape. It could well be that she started with the

dots and then drew the continuous shape.

In this third cycle of growing samples we see a first remark about shape as was asked for (an

example of guidance). The shapes that students proposed were pyramid (three students),

semicircle (one), and bell shape (four); 23 students drew a bump shape. Based on prior

experience that students were not inclined to make continuous sketches of distributions, it was a

pleasant surprise that students in this eighth-grade class drew continuous sketches during this

growing samples activity. This could support the sense of doing such an activity, though there

may be another reason why students in this class made continuous sketches. They had a better

mathematical background than the students in other experiments, for instance knowledge about

line graphs. If students draw continuous shapes, we do not exactly know what these shapes mean

for them. Therefore, in the next section, students’ reasoning with such shapes is analyzed.

Reasoning about shapes

In the fourth lesson, almost all student graphs looked roughly symmetrical, which is not

surprising when the history of distribution is taken into account (Steinbring, 1980). In real life,

however, the phenomenon of weight shows distributions that are skewed to the right because of a

“left wall effect” (two students had in fact drawn a left wall in the fourth lesson). By a left wall I

mean that the lower limit (say about 30 kg) is relatively close to the average (53 kg) and the

upper limit (sumo wrestlers are sometimes 350 kg) is relatively far away from the average. The

lower limit of 35 kg serves as a left wall, because adults can hardly live if they are lighter than 30

kg. This left wall in combination with no clear right wall causes the distribution to be skewed to

the right. Skewness is another important characteristic of a distribution and once there are

different shapes to talk about, for example symmetrical or skewed, students can characterize

shapes with different predicates. According to the hypothetical learning trajectory, skewness

therefore had to become a topic of discussion as well.

In collaboration with the teacher, the following activity was designed. To focus the students’

attention on shape and skewness, the three student shapes were drawn on the blackboard together

with two skewed shapes, which resulted in a pyramid, a semicircle, a bell shape, a unimodal

17

distribution that was skewed to the right, and one that was skewed to the left. Students had to

explain which shapes could not match the context of weight. The teacher expected that it would

be easier for students to engage in the discussion if they could argue which shapes were not

correct, instead of defending the shape they had chosen.

Figure 4. Five shapes as on the blackboard (1) semicircle, (2) pyramid, (3) normal distribution,

(4) distribution skewed to the right, (5) distribution skewed to the left.

The teacher chose students from the groups who thought that a particular shape on the

blackboard could not be right. For all shapes except the normal shape, many students raised their

hands. Apparently, most students expected a “normal” shape.

1. First, Ricardo explained why the semicircle (1) could not be right (video fragment 6.1).

Ricardo: Well, I thought that it was a strange shape (...) For example, I thought that the

average was about here [a little to the right of the middle] and I thought this

one [top of the hill] was a little too high. It has to be lower. And I thought that

here, that it was about 80, 90 [kg], and I don’t think that so many people

weigh that much or something [points at the height of the graph at the part of

the graph with higher values].

Teacher: (...) Does everybody agree with what Ricardo says?

Tobias: Yes, but I also had something else. That there are no outliers. That it is

straight and not that [he makes a gesture with two hands that looks like the

tails of a normal distribution]. I would expect that it would be more sloping if

it goes to the outside more [makes the same gesture in the air].

These students used statistical notions such as “outliers” and height to explain shape issues

(especially frequency). They clearly used their knowledge of the context to reason about shape.

18

2. Because all of the students seemed to agree that the semicircle was not the right shape, the

teacher wiped it off the blackboard and turned to the pyramid shape (2). This discussion involved

outliers and the mean in relation to shape (video fragment 6.2).

Maarten: Well, I thought this one not, because yes, I think that a graph cannot be so

square, or so rectangular.

Teacher: The graph is not so rectangular? [inviting him to say more]

Maarten: No, there are no outliers or stuff.

Sander: It does have outliers; right at the end of both it does have outliers.

Student2: That is just the bottom [of the graph].

Sander: At the end of the slanting line, there is an outlier, isn’t it? (...)

Annemarie: But the middle is the mean and everything else is outlier. [Other students

say they do not agree, e.g. Iris:]

Iris: Who says that the middle is the mean?

Annemarie: Yes, yes, roughly then.

Teacher: Tobias, you want to react.

Tobias: Look, if you have an outlier, then it has to go straight a bit [makes a

horizontal movement with his hands]; otherwise it would not be an outlier

(...) but that is not what I wanted to say. I wanted to react, that it [this

graph] could not be the right one, because the peak is too sharp and then

the mean would be too many of exactly the same.

Michiel: He just means that of one weight exactly all these kids have the same

weight, so if the tip is at, I don't know how many kilos, maybe 60 kilos,

that all these kids are exactly 60 kilos.

This transcript shows that students started to react to each other. Before this lesson they mainly

reacted to questions from the teacher—a type of interaction that is very common unfortunately

(van den Boer, 2003). In other words, the activity stimulated students to participate and their

passive attitude started to change. Because the students agreed that the pyramid was not the right

shape, the teacher wiped this shape off the blackboard also.

2 If we could not find out from the video who said this, we just put “student” as the speaker.

19

3. Next, Mariëlle was to explain why the bell shape (3) could not be the right shape (video

fragment 6.3). Before the discussion, almost all of the students thought this was the right shape

(one girl did not know).

Mariëlle: I had it that this was not the one, because there are also kids who are

overweight. Therefore, I thought that it should go a bit like this [draws the

right part a little more to the right, thus indicating a distribution skewed to the

right, like figure 4.4]. (...)

Rogier: That means that there are more kids much heavier, but there are also kids

much less, so the other side should also go like that [this would imply a

symmetrical graph].

Tobias: Guys, this is the right graph!

Because there was no agreement, the teacher did not wipe the graph off the board.

4. Next, Michiel had to explain why he thought that the fourth, skewed graph could not be right

(video fragment 6.4).

I thought that this was not it because... if the average is perhaps, if this it the highest point,

then this [part on the left] would be a little longer; then it would have a curve like there

[left half of the third graph]. I think that this cannot be right at all, and I also find it

strange that there are so many high outliers. Then you would maybe come to 120 kilos or

so. [Note that there were no numbers in the graphs.]

5. Last, Elaine about the fifth graph, which was skewed to the left (video fragment 6.5):

Well, I think this one is also wrong because there are more heavy people than light

people. And I think that eighth graders are more around 50 kilos. That’s it.

Tobias then objected, “it says 50 nowhere,” and a lively discussion between the two evolved.

Thus, as intended, skewness became a topic of discussion, even in relation to center and

“outliers.” Some students argued that the mean need not be the value in the middle. Still students

20

seemed to make no clear distinctions between midrange, mean, and mode. Because the mode is

not a measure that is often used in statistics, it was not the intention to address the mode unless

students were already reasoning with it. But since students at this point argued about the mean

versus the value that occurred the most, it was decided to introduce a name for the mode, which

these students had not learned before.

Researcher: The value that occurs the most often has a name also; it is called the mode

[pointing at the value where the distribution has its peak]. (...) Who can

explain in this graph [skewed to the right] whether the mean is higher or lower

than the mode? (...)

Rogier: There are just more heavy people than light people, and therefore the mean is

higher.

In this way, there were opportunities to introduce statistical terms and relate them to each other,

because students already talked about the corresponding concepts or informal precursors to them.

Traditionally, the mode is just introduced as the value that occurs the most, but here it was

introduced as a characteristic of a distribution, be it informally. The median was introduced in

the ninth lesson as the value that yields two equal groups. In fact, this introduction in relation to

continuous distributions was inspired by the history of the mode and median (Bakker, 2003;

Walker, 1931), because these notions were first defined in relation to distributions, not in relation

to data sets. Moreover, in my view the mode only makes sense in relation to continuous

distributions or for categorical variables.

The purpose of this activity of reasoning about shapes was that students would come to reason

about skewed distributions, and they did. They were even more engaged than expected. The

satisfactory thing about this activity was that they came to reason with notions in a way they had

not demonstrated before, and that they better engaged in the discussion than ever before, even

the students with low scores for mathematics. I conjecture that the lack of data, the game-like

character, and students’ knowledge about the context were important factors, but also the fact

that they had to argue against certain shapes. Such reasoning is safer than choosing the shape

they think is right and defending that one. I also conjecture that the lack of formal definitions

makes it easier for low-achieving students (such as Tobias) to engage in the discussion. This

issue can be illustrated with a metaphor that Frege, who is seen as one of the first modern

21

logicians and philosophers of language, wrote to Hilbert, who is seen as a formalist. The topic

was using and making symbols in mathematical discourse.

I would like to compare this with lignification [transformation into wood]. Where the tree

lives and grows, it must be soft and sappy. If, however, the sappiness does not lignify, the

tree cannot grow higher. If, on the contrary, all the green of the tree transforms into wood,

the growing stops. (Frege, 1895/1976, p. 59; translation from German3)

On the one hand, if statistical concepts are defined before students even have an intuitive idea of

what these concepts are for (such as mean, median, mode), then the tree transforms into wood

and students’ conceptual development is hindered. On the other hand, if teachers and textbooks

do not guide students well in a process of reinvention, the tree stays weak and cannot grow

higher. It is evident that the notions of average, outliers, distribution, and sample of students in

the present research needed to be developed into more precise notions, but at least they

developed a language that was meaningful to them, an image that could be sharpened later on or

a sappy part of the tree that can be lignified soon.

Implications for research, teaching, and assessment

In this discussion, two questions are raised. First, why do almost all school textbooks follow the

same routes? Second, what are the challenges of the approach taken in this paper for research,

teaching, and assessment?

Why do almost all school textbooks first introduce mean, median, and mode as a trinity, and

provide students with graphical tools such as histogram and box plot long before students have

the conceptual understanding to use such tools sensibly? G. Cobb (1993, nr. 53c) compared the

situation with a night picture of a city: “if one could superimpose maps of the routes taken by all

elementary books, the resulting picture would look much like a time-lapse night photograph of

car taillights all moving along the same busy highway.” Apart from the phenomenon of copying

what others do, one important reason could be that mean, median, mode, and graphs seems so

22

easy to teach and, even more importantly, to assess. The approach taken in this paper is much

harder for the teacher because notions stay informal for a while. The learning that results from

such an approach might also be harder to assess than whether students can calculate average

values or draw a histogram. In the approach taken here, the teacher has to accept that students’

notions stay informal for a while, but in my view this is better than building castles in the air. In

countries with a many tests, this might be difficult to accomplish (cf. Makar & Confrey, in

press). Research is needed to find out how and to what extent the approach propagated in this

paper is feasible. Moreover, research is needed into the question of how students can develop

their own informal notions, such as center clumps, spread, and shapes, into conventional

measures of center, variation, and other distribution aspects.

The hypothetical learning trajectory that is envisioned after the design study reported here is the

following. The main goal is still that students enhance their case-oriented views with aggregate

views on data sets and develop a coherent understanding of key concepts such as variability,

sampling, data, and distribution. In well-known contexts students can develop a view on data sets

as consisting of three groups of low, average, and high values. The middle group can be

informally called a clump, cluster, or majority and it can serve both as a measure of center

(mean, median, mode are generally in the modal clump) and as a measure of spread (some

clumps have a larger range than others). By such activities as growing samples reported in this

paper, students can be stimulated to reason about stable features within this process of sampling

(stabilizing clump or shape). If opportunities appear to introduce conventional definitions

(median, mode for example), teachers can take advantage of them. It is important to start with

case-value plots (Konold & Higgins, 2003) that are meaningful for students (bar graph, dot plot)

and wait with aggregate plots such as histogram and box plot until students have enough

conceptual basis. Once students have developed an understanding of center (e.g. by reasoning

with clumps), they can appreciate more formal measures of center such as mean and median.

When comparing distribution, the need can be acknowledged to use a convention of comparing

the middle 50% instead of various locally motivated proportions. Not until then, box plots can

function as meaningful tools in students’ reasoning about distributions. Further research is

needed to test this hypothetical learning trajectory and design instructional means supporting

such learning.

3 Ich möchte dieses [Symbolisieren] mit dem Verholzungsvorgange vergleichen. Wo der Baum lebt und wächst, muss er weich und saftig sein. Wenn aber das Saftige nicht mit der Zeit verholzte, könnte keine bedeutende Höhe

23

Acknowledgments

I thank Corine van den Boer for teaching the eighth-grade class, and Carolien de Zwart, Sofie

Goemans, and Yan-Wei Zhou for assisting in multiple ways. I also thank Nathalie Kuijpers for

translating the transcripts and correcting the English, and Jantien Smit, Anneleen Post, and Katie

Makar for their editing help. The research was funded by the Netherlands Organization for

Scientific Research under number 575-36-003B. The opinions expressed in this chapter do not

necessarily reflect the views of the Organization.

References

Bakker, A. (2002). From data via 'bump' to distribution. Statistics Education Research

Journal, (http://fehps.une.edu.au/serj), 1(1), 35.

Bakker, A. (2003). The early history of statistics and implications for education.

www.amstat.org/publications/jse/v11n1/bakker.html. Journal of Statistics Education,

11(1).

Bakker, A., & Gravemeijer, K. P. E. (in press). Learning to reason about distribution. In D.

Ben-Zvi & J. Garfield (Eds.), The challenge of developing statistical literacy,

reasoning, and thinking. Dordrecht, the Netherlands: Kluwer Academic Publishers.

Cobb, G. (1993). Considering Statistics Education: A National Science Foundation

Conference. Journal of Statistics Education [Online], 1(1),

www.amstat.org/publications/jse/v1n1/cobb.html

Cobb, P. (1999). Individual and collective mathematical development: The case of statistical

data analysis. Mathematical Thinking and Learning, 1(1), 5-43.

Cobb, P., Confrey, J., diSessa, A. A., Lehrer, R., & Schauble, L. (2003). Design experiments

in educational research. Educational Researcher, 32(1), 9-13.

Cobb, P., McClain, K., & Gravemeijer, K. P. E. (2003). Learning about statistical covariation.

Cognition and Instruction.

erreicht werden. Wenn dagegen alles Grüne verholzt ist, hört das Wachstum auf.

24

Cobb, P., & Whitenack, J. (1996). A method for conducting longitudinal analyses of

classroom videorecordings and transcripts. Educational Studies in Mathematics, 30(3),

213-228.

Frege, F. L. G. (1976). Wissenschaftlichter Briefwechsel [Scientific correspondence] (Gabriel,

G., Ed.) (First ed. Vol. 2). Hamburg, Germany: Meiner.

Freudenthal, H. (1991). Revisiting mathematics education: China lectures. Dordrecht, the

Netherlands: Kluwer Academic Publishers.

Garfield, J., & Ben-Zvi, D. (in press). Research on statistical literacy, reasoning, and thinking:

Issues, challenges, and implications. In D. Ben-Zvi & J. Garfield (Eds.), The challenge

of developing statistical literacy, reasoning, and thinking. Dordrecht, the Netherlands:

Kluwer Academic Publishers.

Glaser, B. G., & Strauss, A. L. (1967). The discovery of grounded theory; Strategies for

qualitative research. Chicago: Aldine Publishing Company.

Gravemeijer, K. P. E. (1994). Educational development and developmental research. Journal

for Research in Mathematics Education, 25, 443-471.

Gravemeijer, K. P. E. (1999). An instructional sequence of analysing univariate Data sets.

Paper presented at the AERA 1999. Unpublished manuscript, Montréal, Canada.

Hancock, C., Kaput, J. J., & Goldsmith, L. T. (1992). Authentic enquiry with data: critical

barriers to classroom implementation. Educational Psychologist, 27(3), 337-364.

Jacobs, V. R. (1999). How do students think about statistical sampling before instruction?

Mathematics Teaching in the Middle School, 5, 240-263.

Konold, C., & Higgins, T. (2003). Reasoning about data. In J. Kilpatrick & W. G. Martin &

D. Schifter (Eds.), A research companion to Principles and Standards for School

Mathematics (pp. 193-215). Reston, VA: National Council of Teachers of

Mathematics.

Konold, C., Robinson, A., Khalil, K., Pollatsek, A., Well, A. D., Wing, R., & Mayr, S. (2002).

Students' use of modal clumps to summarize data. In B. Phillips (Ed.), Proceedings of

the International Conference on Teaching Statistics [CD-ROM], Cape Town.

Voorburg, the Netherlands: International Statistics Institute.

Latour, B. (1990). Drawing things together. In M. Lynch & S. Woolgar (Eds.), Representation

in scientific practice (pp. 19-68). Cambridge, MA: MIT Press.

Lehrer, R., & Romberg, T. (1996). Exploring children's data modeling. Cognition and

Instruction, 14(1), 69-108.

25

Lehrer, R., & Schauble, L. (2001). Accounting for contingency in design experiments. Paper

for AERA, April 2001, Seattle, WA.Unpublished manuscript, Madison, WI.

Makar, K., & Confrey, J. (in press). Secondary Teachers’ Reasoning about Comparing Two

Groups. In D. Ben-Zvi & J. Garfield (Eds.), The Challenges of Developing Statistical

Literacy, Reasoning, and Thinking. Dordrecht, the Netherlands: Kluwer Academic

Publisher.

McGatha, M., Cobb, P., & McClain, K. (2002). An analysis of students' initial statistical

understanding: Developing a conjectured learning trajectory. Journal of Mathematical

Behavior, 21, 339-355.

Meletiou, M. (2002). Conceptions of variation: A literature review. Statistics Education

Research Journal (http://fehps.une.edu.au/serj), 1(1), 46-52.

Mokros, J., & Russell, S. J. (1995). Children's concepts of average and representativeness.

Journal for Research in Mathematics Education, 26, 20-39.

Moore, D. S. (1997). New pedagogy and new content: The case for statistics. International

Statistical Review, 65, 123-165.

Noss, R., Pozzi, S., & Hoyles, C. (1999). Touching epistomologies: Meanings of average and

variation in nursing practice. Educational Studies in Mathematics, 40, 25-51, 1999.

Peirce, C. S. (1976). The new elements of mathematics (Eisele, C., Ed.) (Vol. I-IV). The

Hague-Paris/Atlantic Highlands, N.J.: Mouton/Humanities Press.

Petrosino, A. J., Lehrer, R., & Schauble, L. (2003). Structuring error and experimental

variation as distribution in the fourth grade. Mathematical Thinking and Learning,

5(2&3), 131-156.

Reading, C., & Shaughnessy, J. M. (2000). Student perceptions of variation in a sampling

situation. In T. Nakahara & H. Koyama (Eds.), Proceedings of the 24th Conference of

the International Group for the Psychology of Mathematics Education (Vol. 4, pp. 89-

96). Hiroshima, Japan: Department of Mathematics Education, Hiroshima University.

Reading, C., & Shaughnessy, J. M. (in press). Reasoning about variation. In D. Ben-Zvi & J.

Garfield (Eds.), The challenge of developing statistical literacy, reasoning, and

thinking. Dordrecht, the Netherlands: Kluwer Academic Publishers.

Roth, W.-M. (1996). Where is the context in contextual word problems? Mathematical

practices and products in grade 8 students' answers to story problems. Cognition and

Instruction, 14, 487-527.

26

Rubin, A., Bruce, B., & Tenney, Y. (1990). Learning about sampling: Trouble at the core of

statistics. Paper presented at the Third International Conference on Teaching Statistics,

Dunedin, New Zealand.

Russell, S. J., & Corwin, R. B. (1989). Statistics: The shape of the data. Used numbers: Real

data in the classroom. Grades 4-6. Washington, DC: National Science Foundation.

Schwartz, D. L., Goldman, S. R., Vye, N. J., Barron, B. J., & Vanderbilt, C. a. T. G. a. (1998).

Aligning everyday and mathematical reasoning: The case of sampling assumptions. In

S. P. Lajoie (Ed.), Reflections on statistics: Learning, teaching, and assessment in

grades K-12 (First ed., pp. 233-273). Mahwah, NJ: Lawrence Erlbaum Associates.

Shaughnessy, J. M., Garfield, J., & Greer, B. (1996). Data handling. In A. J. Bishop & K.

Clements & C. Keitel & J. Kilpatrick & C. Laborde (Eds.), International handbook of

mathematics education (pp. 205-237). Dordrecht, the Netherlands: Kluwer.

Shaughnessy, J. M., Watson, J. M., Moritz, J., & Reading, C. (1999). School mathematics

students' acknowledgment of statistical variation. NCTM Research Presession

Symposium: There's more to life than centers. Paper presented at the 77th Annual

NCTM Conference, San Francisco, California.Unpublished manuscript.

Simon, M. A. (1995). Reconstructing mathematics pedagogy from a constructivist

perspective. Journal for Research in Mathematics Education, 26, 114-145.

Steinbring, H. (1980). Zur Entwicklung des Wahrscheinlichkeitsbegriffs - Das

Anwendungsproblem in der Wahrscheinlichkeitstheorie aus didaktischer sicht [On the

development of the probability concept --The applicability problem in probability

theory from a didactical perspective. Bielefeld: Institut für Didaktik der Mathematik

der Universität Bielefeld.

van den Boer, C. (2003). Als je begrijpt wat ik bedoel. Een zoektocht naar verklaringen voor

achterblijvende prestaties van allochtone leerlingen in het wiskundeonderwijs [If you

know what I mean. A search for an explanation of lagging results of mathematics

education among ethnic minority students] CD Beta Press: Utrecht, the Netherlands.

van Oers, B. (2000). The appropriation of mathematics symbols: A psychosemiotic approach

to mathematics learning. In P. Cobb & E. Yackel & K. McClain (Eds.), Symbolizing

and communicating in mathematics classrooms; Perspectivies on discourse, tools, and

instructional design (pp. 133-176). Mahwah, NJ: Lawrence Erlbaum Associates.

Walker, H. M. (1931). Studies in the history of statistical methods with special reference to

certain educational problems. Baltimore: Williams & Wilkins Company.

27

Watson, J. M. (2002). Creating cognitive conflict in a controlled research setting: Sampling.

In B. Phillips (Ed.), Developing a Statistically Literate Society; Proceedings of the

Sixth International Conference of Teaching Statistics Cape Town [CD-ROM].

Voorburg, the Netherlands: International Statistics Institute.

Watson, J. M., & Moritz, J. (2000). Developing concepts of sampling. Journal for Research in

Mathematics Education, 31, 44-70.

Wild, C. J., & Pfannkuch, M. (1999). Statistical thinking in empirical enquiry. International

Statistics Review, 67, 223-265.

Wilensky, U. (1997). What is normal anyway? Therapy for epistemological anxiety.

Educational Studies in Mathematics, 33, 171-202.

Zawojewski, J. S., & Shaughnessy, J. M. (2000). Mean and Median: Are They Really so

Easy? Mathematics Teaching in the Middle School, 5(7), 436-440.

Translations of transcripts

Lesson 4 on growing samples

Video fragment 4.1: students drawing graphs plus two interviews [43.30 to 49.32]

Interv: Why do you have that shape, Joshua? [dotplot of 67 students]

Joshua: Because it’s easier that way to see what the average of 67 children is

[This answer is typical; it occurs quite often.]

Interv: Can you see more than just the mean? You’re talking about the mean

now, but I can see a lot more.

Joshua: Yes, there are outliers in there too, that they don’t all weigh the same.

Interv: And, eh, I can see, the top is more to the left than to the right, right?

Joshua: Yes, because around average children [makes a more-or-less gesture] are

something like around 50 kilos, in eighth grade. So that would be the

highest, that’s where most of the dots are.

Interv: Yes, but why isn’t there more to the left or the right or in the middle?

Joshua: I think most will be between 50 and 60 , if you measure it [looks around]

I think most are around 50 and 60.

Video fragment 4.2: discussion after second cycle [49.46 to 53.13]

[Lynn draws while teacher starts discussion]

28

Teacher: Tobias, what was yours like? Do they look like that?

Tobias: It’s almost the same, see. [It was remarkable how good their predictions

were.]

Teacher: That’s really quite remarkable, isn’t it, that, well, not with Rikkert, but for

most say: ‘Here there was some variation and here they begin to match.’

Does anyone have an explanation for it? Rikkert, how would you explain

it?

Rikkert: Well, if there are a lot more, then uh it’ll be spread out more, you could

say.

Teacher: You think it will be more spread out when there are more.

Rikkert: Yes, they will all be from 50 to somewhere up at 80, will be a lot of

students, because there are so many

Teacher: We had been talking about this one, many students said that this one

looked a lot like the one they drew. How could it be that we all found so

many different ones, and here, that looks a bit like it for everyone. What

would you think of that?

Niels: Well because, see, because. Yes, many are a bit in the middle. Also

around 60 and 50 there are many, with like, for the weight. So if you do

more, you will get more and more.

Teacher: Elaine, do you have anything else?

Elaine: Well, uh, between those 3 you have there, there can be small people, light

people or something. And when you have a large group, you have more

small people and more people who are a bit heavier, so then it would be

.... [is talking softer and softer] [good argument which I expected to

appear: wider range?]

Teacher: So it would be more noticeable or something like that, is that what you

mean?

Elaine: Yes, it doesn’t get noticed now, because there are more between 40 and

60.

Teacher: Good, we have heard all kinds of arguments now about comparing these

things. You will now continue with the problem. We have 10 minutes

left. You go to d again, describe the difference. The same you had to do

29

for question b and then you will do e, make a weight graph of the sample

of all eighth grade classes in Utrecht. What might that look like?

You don’t have to draw dots now of course. It’s about the SHAPE

(emphasis) of the graph on? Do you understand what they mean? What

would the shape be like here?

Student: Pyramid, something like that.

Lesson 6 Reasoning about shapes

The teacher’s explanation of the activity

Teacher: Now we will look at the homework. There were 5 graphs and then it said:

Which of these do you think matches the weight of many eighth graders

the best? So I’m going to turn that question around. If one is the best

match according to you, that means four don’t have the best match.

Yes? That’s logical, isn’t it?

Who says this isn’t the best match, it certainly isn’t this one. [many

students raise their hands] [the semicircle] Ricardo.

Who says it certainly isn’t this one? [the pyramid] Maarten.

Who says it certainly isn’t this [the normal (symmetrical) distribution]

Marielle.

Who thinks it certainly isn’t this one? [with the hump on the left] Michiel.

And who thinks it certainly isn’t this one [with the hump on the right]

Elaine.

We’ll talk them through one by one, Ricardo will come over here with

this graph. And what is the plan? Ricardo will explain to you why this

certainly isn’t the right one. After he’s finished his explanation, and you

can do this in any way you like, you have to convince the class that this

isn’t the right one. Because I didn’t see everyone raise their hands. When

you are done, I will ask the class who agrees with you. When someone

says ‘no, I don’t agree with him,’ you will sit down and the other person

will come to the front. Yes? We’ll do all five graphs like that. So the

others already know ‘I’ll have to think of good arguments.’

Give it a try; this is difficult, because we’ve never done something like

this before.

30

Video fragment 6.1: semicircle [25.18 to 26.41]

Ricardo: Well, I thought that it was a strange shape and I thought the third one was

the best (...)

Teacher:

No, I said to turn it around, you can’t say which one you thought was the

best, but why is it not this one? What’s wrong with it?

Ricardo: For example, I thought that the average was about here [a little to the right

of the middle] and I thought this one [top of the hill] was a little too high.

It has to be lower. And I thought that here, that it was about 80, 90 [kg],

and I don’t think that so many people weigh that much or something

[points at the height of the graph at the part of the graph with higher

values].

Teacher: Don’t look at me, you don’t have to convince me [makes a gesture to the

class] (...) Does everybody agree with what Ricardo says?

Tobias: Yes, but I also had something else. That there are no outliers. That it is

straight and not that [he makes a gesture with two hands that looks like

the tails of a normal distribution]. I would expect that it would be more

sloping if it goes to the outside more [makes the same gesture in the air].

Teacher: Ha, Tobias says more outliers. We’ll take these two arguments together.

Ricardo’s story and Tobias’s story. Are here any more people who still

say ‘but I can still think of something that could be wrong. Anyone who

says: ‘But it could still be the right one’ [someone says no] Okay, we’ll

get to that [?] Excellent

Video fragments 6.2: pyramid shape [27.17 to 28.00; 28.20 to 29.12; 29.39 to 31.08]

Maarten: Well, I thought this one not, because yes, I think that a graph cannot be so

square, or so rectangular.

Teacher: The graph is not so rectangular? [inviting him to say more]

Maarten: No, there are no outliers or things like that.

Sander: Yes, there are

Maarten: Like here [?]

Sander: Yes, there are.

Teacher: It is, a distribution is never that rectangular, you say, and I don’t see any

31

outliers. Raise your hands please, because I think there are people who

want to react. Sander, when Maarten said there are no outliers, you said

there were.

Maarten: No, there are no outliers or stuff.

Sander: It does have outliers; right at the end of both it does have outliers.

Teacher: What’s an outlier.

Sander: One that/ [interrupted]

Student: That is just the bottom [of the graph].

Sander: At the end of the slanting line, there is an outlier, isn’t it? (...)

Maarten: But that’s all, yes, do you have an outlier at the bottom in the middle.

Sander: [sigh]

Maarten: Well, you come to the front and explain it then.

Teacher: Let’s have a look, Annemarie.

Annemarie: It does have outliers, because that point is the mean and everything that’s

there. The outsides are the outliers.

Student b: Yes, Maarten...

Tobias: No, nooooo, because when you suddenly...

Teacher: So you’re saying outliers are in the middle, they don’t have to be on the

outside at all [class in confusion].

Annemarie: This is the mean, and if you go more to the [?]/

Teacher: I think you’re working really well here, but I say who gets to speak or it’ll

be a mess. Annemarie can finish her story and then Tobias gets to react

first, and then Maarten will defend his position, or not.

Annemarie: But the middle is the mean and everything else is outlier. [Other students

say they do not agree, e.g. Iris:]

Iris: Who says that the middle is the mean?

Annemarie: Yes, yes, roughly then. [A number of students make it quite clear that

they don’t agree with Annemarie by calling out loud ‘yeah right’ and

similar comments. No clear arguments can be discerned as yet.]

Teacher: Tobias, it’s your turn to reply. Anyone else who wants to react, raise your

hand, or no one will be able to follow what is going on

Tobias: Look, if you have an outlier, then it has to go straight a bit [makes a

horizontal movement with his hands]; otherwise it would not be an

32

outlier/

Student: Two.

Tobias: (...) but that is not what I wanted to say. I wanted to react, that it [this

graph] could not be the right one, because the peak is completely sharp

and then the mean would be too many of exactly the same, or something.

Michiel: So what he means is that all the children at one weight all weigh exactly

the same, so when the peak is at who knows how many kilos [he makes

an upward gesture], maybe 60 kilos, immediately all those children weigh

exactly 60 kilos.

Teacher: Does anyone have something to add to this? Anyone who says it could

still be the right one? Well done.

Video fragments 6.3: normal shape [31.20 to 31.55]

Mariëlle: I had it that this was not the one, because there are also kids who have

overweight. Therefore, I thought that it should go a bit like this [draws the

right part a little more to the right, thus indicating a distribution skewed to

the right, like figure 4.4]. (...)

Teacher: Why don’t you draw roughly how it should be with another color?

Rogier: That means that there are more kids much heavier, but there are also kids

much lighter so the other side should also go like that [this would imply a

symmetrical graph].

Tobias: Guys, this is the right graph! [general agreement from the group to

Tobias’ comment]

Teacher: Who agrees with Marielle that this might not be the right one? [little

reaction]

Teacher: So I won’t wipe it off the blackboard yet. It’s a bit doubtful, isn’t it.

Rogier: Doubt, I just think it’s the right one. [people are calling “doubt???”]

Student: It IS the right one.

Video fragments 6.4: skewed to the right [32.40 to 33.06; 35.32 to 36.X; 37.40 to 39.20]

Teacher: Does anyone want to react to this? Or are you saying ‘no, that is

completely wrong?’ [pause] Everyone agrees.

Researcher: Michiel, maybe you want to put in the numbers, what you think it should

33

be.

Michiel: What do you mean?

Researcher: You said 120, where the average is and things like that. [the group calls

out all kinds of suggestions and numbers. Michiel’s numbers are

remarkably close to reality]

Teacher: Okay, the numbers are there now.

Teacher: Michiel, are you done? You believe you have convinced the class.

Everybody agrees that this might not be the right one? By the way, what

is going on with that kind of peak?

Rikkert: Yes, the mean, that most people weigh that much.

Teacher: Now you’re saying two different things. You’re saying mean, and you’re

saying that most people weigh that much.

Student z: That isn’t the mean.

Student q: Yes, it is. [discussion about whether or not it is the mean, impossible to

follow]

Teacher: Okay, back to raising your hands. Take a breather. Is this the mean or

not? Rogier.

Rogier: No, it’s just, it’s just what most people weigh.

Michiel: You calculate the mean by just taking all the people, adding their weight

and divide that by the number of people.

Teacher: And it doesn’t necessarily have to be this? [Points at the highest point in

the graph; Teacher is in a hurry, is looking at the clock, even appears to

find the discussion not as valuable as researcher does, taking too long,

loss of concentration?]

Boy: No, it could be a bit more or less.

Researcher: When the mean and what occurs most are different, in the fourth graph.

Would the average be higher or lower?

Student: Lower.

Class: [the others all call ‘higher’] Higher.

Researcher: So some are saying that the mean is something else than that which occurs

most. The value that occurs the most has a name also; it is called the

34

mode [pointing at the value where the distribution has its peak]. (...)

Teacher: Mode, is what occurs most [‘modus’ in Dutch]. And you can remember

that because there is an English word in it. What word does it resemble?

Class: Modern.

Teacher: Most [the class is not enthusiastic about this] So now we will use the

word mode every time. Mode is the weight that occurs most. So when you

say that most eighth graders weigh exactly 48 kilos. You’re saying that 48

kilos is the mode.

Researcher: So how do you see most are here anyway?

Maarten: That’s where it’s highest.

Researcher: So the most are where the graph is highest. My question is, I’ll repeat it, is

the mean higher or lower than the mode. Who says higher [most students

raised their hands for higher], who says lower [no one for lower. So there

were some hands missing].

Teacher: Who doesn’t know? [one student admits]

Researcher: Who can explain in this graph [skewed to the right] why the mean is

higher or lower than the mode? (...)

Anthony: Most of what comes after, it’s more than is on the left, on the low side. So

there are more people with a higher weight than with a lower weight.

Researcher: And why would the mean be higher?

Anthony: Because uh. There are more people so you do [...]

Researcher: Does anyone understand what Anthony means? [general laughter from

the class, because no one understood] Ah, there must be some. Can

anyone repeat it in their own words?

Rogier: There are just more heavy people than light people, and therefore the

mean is higher.

Researcher: In that graph you mean? [graph with clump on the left, right-skewed]

Rogier: Yes.

Researcher: Who understands what Rogier is saying?

Tobias: Me!

Researcher: Who doesn’t [no hands?].

35

Video fragments 6.5: skewed to the left [39.48 to 41.15; 41.45 to 42.26]

Elaine: Well, I think this one is also wrong, because there are more heavy people

than light people. And I think that eighth graders are more around 50

kilos. That’s it. [Elaine goes back to her place.]

Teacher: Wait, I don’t know if there are people who want to react to this. Does

everyone agree with Elaine?

Tobias: I didn’t even hear what she said, it was a bit short, but I think I know why

that one isn’t the right one

Teacher: Wait wait wait, first Elaine again. Elaine, could you repeat it?

Elaine: I think it’s not that one because there are more heavy people, that one has

more heavy people and I think most eighth graders would weigh around

50.

Tobias But it doesn’t say anywhere that that isn’t 50.

Researcher: Yes, which numbers do you think go with this? Write it down briefly.

[While Elaine is busy on the blackboard, Tobias has a discussion with

someone. The class is fairly noisy anyway.]

Tobias: But they can also be a short distance.

Student: What do you mean, a short distance?

Tobias: Well, it could be between 54 and 64. Then it would be possible.

[There is a vehement reaction to this, but it is inaudible because

everybody is talking at once. Teacher stops the discussion and gives the

last word to Tobias.]

Tobias: Now, look, if it’s a whole for example, if there in the beginning, it’s 40 or

whatever and there at the end is 60. That there isn’t that much distance

between. Then it could be right.

Elaine: Yes, but look, there it goes suddenly, here it would be 40 and suddenly it

would be 50 here.

Tobias: Yes, because a lot of people weigh 50.

Elaine: Yes, but then it suddenly goes mmmm [sound of a racing car], all the way

to 50 here and then suddenly 60.

Tobias: Yes, because many weigh a lot less than a few.

Elaine: Yes, but that isn’t possible.

Boy: That’s impossible, because 50 goes to 60 very fast and that’s impossible.

36

Tobias: Well, it could be 70 as well, I don’t know. But I’m not saying it’s right.

reasoning about shape as a pattern in variability arthur ... · 3 need to develop in order to...

Documents