native and non-native vowel contrasts in early second language … · 2012-12-05 · the critical...
TRANSCRIPT
Ghent University
Faculty of Arts and Philosophy
English Linguistics Department
Academic year 2011-2012
Native and non-native vowel contrasts in early
second language phonology: The production of the English vowel contrasts
/ɪ/-/ɛ/-/æ/ and /ʊ/-/u:/ by young native speakers of Dutch
Esther Holleman
Supervisor:
Dr. Ellen Simon
Master dissertation submitted
in partial fulfilment of the
requirements for the degree of
Master in Linguistics and
Literature: English – Swedish
1
Acknowledgments
The writing of this research paper would not have been possible without the encouragement
of many people. First of all, I would like to show my utmost gratitude to my supervisor, Dr.
Ellen Simon, for her constant support and guidance. Her insightful comments contributed
immensely to the completion of this research paper. I would also like to thank her for
conducting the experiment, without which this paper could not have been written. I am also
truly thankful to the children who participated in this study and to their school, without whose
corporation I would not have required the necessary data. Special thanks also to my friends
for their continuous support. Finally, I would like to express my gratitude to my parents and
sister for their help and understanding through the duration of my studies.
2
Contents
Acknowledgments ..................................................................................................................... 1
Contents ..................................................................................................................................... 2
Introduction .............................................................................................................................. 4
1 Age and second language acquisition .................................................................................. 7
1.1 Critical Period Hypothesis ............................................................................................... 7
1.2 Remarks on the Critical Period Hypothesis ..................................................................... 9
1.3 Conclusion ...................................................................................................................... 11
2 Second language phonology ................................................................................................ 13
2.1 Factors influencing L2 phonology ................................................................................. 13
2.2 The role of the native language in second language phonology .................................... 15
2.2.1 Early vs. more recent research on L2 phonology .................................................... 16
2.2.2 Flege’s Speech Learning Model .............................................................................. 17
2.2.2.1 Flege’s Equivalence Theory ............................................................................. 17
2.2.2.2 Remarks on Flege’s Equivalence Theory ......................................................... 19
2.2.2.3 Younger vs. older learners in Flege’s Interaction Hypothesis ......................... 21
2.3 The role of universal principles ...................................................................................... 22
3 The role of instruction on second language phonology .................................................... 24
3.1 The Integrated Approach ................................................................................................ 24
3.2 Explicit Instruction ......................................................................................................... 25
3.3 Form Focused Approach ................................................................................................ 26
3.4 Comparison of the different approaches ........................................................................ 27
3.5 The role of language awareness, input and output in pronunciation instruction ............ 28
3.6 Conclusion ...................................................................................................................... 29
4 Vowels ................................................................................................................................... 30
4.1 Vowel analysis ............................................................................................................... 30
4.2 Duration .......................................................................................................................... 34
4.3 The vowels of English .................................................................................................... 35
4.4 The vowels of Dutch ...................................................................................................... 37
4.5 Predicting difficulties for Dutch-speaking learners of English ...................................... 39
3
5 Case study ............................................................................................................................ 41
5.1 General aim and procedure ............................................................................................ 41
5.2 Methodology .................................................................................................................. 42
5.2.1 Participants .............................................................................................................. 42
5.2.2 Design and stimuli ................................................................................................... 43
5.2.2.1 Session 1 ........................................................................................................... 43
5.2.2.2 Session 2 ........................................................................................................... 46
5.3 Coding and analysis ....................................................................................................... 47
5.4 Hypotheses ..................................................................................................................... 48
6 Results .................................................................................................................................. 51
6.1 Dutch .............................................................................................................................. 51
6.2 English ............................................................................................................................ 57
6.2.1 Repetition vs. picture-naming task .......................................................................... 58
6.2.2 Pre-instruction vs. post-instruction vowels ............................................................. 61
6.2.3 Intra- and interspeaking variation............................................................................ 64
6.2.4 Native speakers ....................................................................................................... 65
7 Discussion ............................................................................................................................. 68
7.1 Differences and similarities between the L1 and L2 ...................................................... 68
7.1.1 The front vowels ...................................................................................................... 69
7.1.2 The back vowels ...................................................................................................... 77
7.2 The influence of age ....................................................................................................... 84
7.3 Effects of instruction ...................................................................................................... 86
Conclusion ............................................................................................................................... 91
References ............................................................................................................................... 93
Appendices .............................................................................................................................. 99
4
Introduction
The accuracy and speech in which the phonology of a foreign language can be acquired
depends on a number of different factors, such as the age of the learner, the learner’s native
language (henceforth L1), and the amount of experience in the new language (henceforth L2)
the learner has. For instance, in a study by Højen and Flege (2006) on the discrimination of
English vowel contrasts by native speakers of Spanish, it was revealed that mainly the early
learners were able to successfully perceive the differences between the foreign vowels.
Nevertheless, Højen et al. concluded that, besides early acquisition, other factors may also be
of significance in second language acquisition, as not all young learners showed similar high
results.
This research paper will focus mainly on three subjects in the field of native and non-
native L2 vowel acquisition.
Firstly, the question whether vowel production in the beginning stages of L2 learning
benefits from early exposure will be addressed. In many studies it has been argued that solely
young learners are able to achieve an accurate L2 phonology level, while late learners will
inevitably maintain a foreign accent (Munro 2008). Next to the ability to acquire an L2 more
accurately, it is also claimed that young learners can learn an L2 more rapidly than older
learners (Tsukada et al. 2004). These claims are mainly based on the critical period
hypothesis, which suggests that language learning must be accomplished before a certain age.
Whether this critical period hypothesis can be applied on L2 acquisition is, however, subject
to more debate (Ioup 2008). Some studies have, for example, provided evidence that even
young learners have to be continually exposed to the L2 in order to attain a high level of
proficiency (Bongaerts et al. 1997).
Moreover, it is suggested in Flege’s (1995) Speech Learning Model that accurate L2
phonology acquisition is not entirely impossible for late learners. The difficulty a learner has
in acquiring a particular phoneme depends rather on whether this phoneme is similar or
dissimilar from the phonemes found in the learner’s L1 than on age of acquisition. This
suggestion is mostly of significance for the second matter in this research paper, which deals
with the influence of the L1 sound system on that of the L2.
The fact that the L1 plays an important role in the pronunciation of the L2 sounds is
widely agreed upon, but the issue of how this influence manifests itself is less certain, as
many different suggestions have been made with respect to which sounds are most
5
problematic for L2 learners (Eckman 2004). In the majority of studies, it is nevertheless
assumed that L2 learners will generally be able to produce new L2 vowels (i.e., vowels which
do not occur in the L1) more accurately than similar L2 vowels (i.e., vowels which have a
close equivalent in the L1). To this end, this research paper will examine the vowel contrasts
/ɪ/-/ε/-/æ/ and /ʊ/-/u:/, which contain both new and similar vowels.
A third subject which will be touched upon in this study is the question whether L2
learners’ L2 phonology may benefit from instruction. In order to form an answer to this issue
several different approaches to L2 phonology instruction will be compared, such as the
integrated and form focused approach, which each have led to some positive results.
However, it should be remarked that in general little evidence seems to indicate that
instruction has a direct effect on a learner’s L2 pronunciation (Piske et al. 2001), although
some other studies have underlined the findings that both focused phonology instruction and
immersion may be beneficial for L2 phonology acquisition (Piske et al. 2001; Lord 2010).
These subjects lead to the following research questions which will be discussed in this
research paper.
The first question posed is whether young second language learners are indeed capable
of rapidly acquiring foreign vowel sounds accurately, or if early exposure does not have any
significant effects yet on L2 vowel production at the onset of learning.
The second question concerns the different vowel contrasts, in particular /ɪ/-/ε/-/æ/ and
/ʊ/-/u:/. Are beginning L1 Dutch learners of English able to contrast these vowels from one
another both in terms of their acoustic properties and in terms of their duration? If so, are the
contrasts made native-like, or rather influenced by the learners’ L1? In addition, do the
productions of the learners follow regular patterns, or do they show intra- or interspeaker
variation?
The third question regards the role of instruction in L2 phonology. Does instruction
have some effect on young learners’ L2 production, even when it does not specifically focus
on phonology, but more on vocabulary and grammar?
To be able to discuss these questions the data from an experiment conducted by Dr.
Ellen Simon were analysed and examined. In total, ten children participated in the
experiment, all of which attended the same primary school where they received a small
amount of English instruction. For the experiment a Dutch picture-matching task, followed by
an English repetition and picture-naming task were carried out at two different sessions. The
first session was held before the participants had received any English instruction, while the
second session was held after the participants had received a total of ten hours of instruction.
6
Outline
The first chapter of this paper will give a general overview of the relationship between second
language acquisition and age. The focus will mainly lie on the discussion of the critical period
hypothesis, and the presentation of previous findings both supporting and opposing the
suggestion that early exposure is paramount for accurate second language acquisition.
The second chapter will focus more specifically on which factors influence second
language phonology. In particular, the role of the first language sound system on the second
language sound system will be discussed, followed by a brief discussion of universal language
principles.
The third chapter will provide a comparison of three different approaches to
phonology instruction, to be precise the integrated, explicit and form focused approach will be
compared. Furthermore, this chapter will also discuss the role of language awareness, and the
role of input and output in second language instruction.
The fourth chapter will deal with the qualitative and quantitative aspects of vowels,
both with respect to how vowels are analysed and with respect to the characteristics of
English and Dutch vowels. This chapter also discusses the pronunciation difficulties which
are expected for Dutch-speaking learners of English.
The fifth chapter is concerned with the construction of the experiment, the
methodology used and its participants, and also discusses the hypotheses for this research
paper, which are based on the findings from the previous chapters.
The sixth chapter will present the results obtained from the experiment.
The last chapter will discuss the results as presented in the previous chapter in relation
to the hypotheses made in the fifth chapter.
7
Chapter 1. Age and second language acquisition
In the field of second language acquisition, many researchers claim that early learning leads to
better results than late learning, since it has been observed that L2 learners who were exposed
to the L2 in childhood often have higher levels of L2 proficiency than L2 learners who
learned the L2 in late adolescence or adulthood (Høyen & Flege 2006). This observation is
often associated to the critical period hypothesis (henceforth CPH), which suggests that
learning a new language accurately after a certain age becomes virtually impossible
(Marinova-Todd et al. 2000). However, many other researchers do not attach any believe to
the critical period hypothesis, but rather suggest that young learners perform better because of
other factors, such as, for instance, education (Binghadeer 2010).
1.1 Critical Period Hypothesis
According to Marinova-Todd et al. (2000), the majority of psycholinguists agree that a critical
period exists for L1 acquisition. The term critical period (henceforth CP) can be defined as
the period in which it is relatively easy to acquire a language completely. It is widely believed
that the critical period also applies for second language acquisition (henceforth SLA) and that
younger second language learners consequently have an advantage over older second
language learners. It is accordingly hypothesized that children are able to acquire an L2 easier
and more rapidly, and are additionally more likely to achieve a native-like level, whereas
older adolescents and adults have more difficulty in acquiring the L2 and generally maintain a
foreign accent. Tsukada et al. (2005) note that children can learn an L2 “quickly,
automatically, effortlessly, and to a level indistinguishable from that of native speakers”
(264). The general belief in the CPH is that native-like L2 acquisition can solely be achieved
if the learner is continually exposed to the L2 before the age of 6 years (Anderson 2004).
The idea of a critical period was first proposed by Lenneberg (1967; in Bongaerts et
al. 1997), who stated that after the beginning of puberty language acquisition becomes
extremely difficult due to biological and neurological reorganizations in the brain which occur
around the onset of puberty. He suggested that the language-processing abilities in the brain
become firmly localized in the left hemisphere. According to Lenneberg (1967; in Bongaerts
et al.,1997), this brain lateralization makes ultimate attainment in first or second language
learning virtually impossible (Marinova-Todd et al. 2000). Others have assigned the CP to a
8
loss of neural plasticity during puberty. This brain maturation, which makes the brain less
“flexible”, makes it similarly less suitable for the acquisition of languages. The exact age at
which these changes in the brain occur is, however, subject to much debate. Krashen (1973; in
Marinova-Todd 2000) suggested that lateralization already takes place around the age of 5,
whereas Scovel (1988, 2000; in Tsukada et al. 2005) placed the CP around the age of 12.
Patkowski (1988; as cited in Bongaerts 1997) likewise places the end of the CP between ages
of 12 and 15. Long (1990; as cited in Bongaerts 1997), however, states that it is impossible
for most language learners to acquire a native-like accent after the age of 6, although some
can acquire it until the age of 12. For Ioup (2008), L2 learners are regarded as ‘young’ before
the age of 8 and ‘old’ after the age of 16. Ioup nevertheless also observes that there is still
much uncertainty on whether the difficulty of language learning after a certain age is in fact
caused by neurobiological changes.
Since it is difficult to decide on an exact closing age for the CP and since it is not sure
whether L2 acquisition is even impossible after the CP, Lamendelle (1997; in Marinova-Todd
2000) introduced the notion of a sensitive period which suggests an optimal period for
language learning, but does not exclude the possibility of language learning after puberty. For
Mack (2003; in Ioup 2008), the idea of a critical period implies that the L2 learner can no
longer acquire the language completely when the CP ends, whereas the idea of a sensitive
period indicates that partial acquisition is theoretically still possible after puberty.
There is much evidence in support of the critical and sensitive period, indicating that it
is relatively likely that younger learners can achieve higher levels of L2 proficiency than older
learners. Munro et al. (1996) studied the effect of age on the production of English vowels by
a group of native speakers of Italian living in Canada at the time of the study. Their age of
arrival (henceforth AOA) in Canada ranged from 2 to 23 years old. Although some of the
older learners had lived in Canada for 32 years, none of them managed to achieve a native-
like level. Additionally, Munro et al. (1996) indicated that none of the English vowels were
completely acquired by the late-arriving group. By contrast, the production of the vowels by
the early-arriving learners was often perceived as unaccented by a group of native listeners.
Tsukada et al. (2005) similarly found that children had a higher level of correct L2 production
and perception than adults. They examined a group of native Korean early and late learners of
English living in North America and compared the results of their production and perception
tests to those of native speakers of English. As opposed to the results of Munro et al. (1996),
the native Korean children still differed from the native speakers of English in that their
accent was not regarded as unaccented or native-like. However, their level of L2 proficiency
9
was higher than that of the adult L2 learners, who failed to produce and discriminate the
English vowels correctly. Flege et al. (1999) also examined the effect of AOA in the United
States in a group of native speakers of Korean, who were all experienced in English. This
study revealed that AOA affected the proficiency level of both the pronunciation and
morphosyntax of the L2 learners.
It should be noted that many researchers agree that the CP solely influences the level
of pronunciation of L2 learners and not the level of morphosyntax. Whether an L2 learner can
acquire a native-like level of morphosyntax depends less on the amount of education (s)he
receives and on the degree of L1 use than for phonology acquisition, both of which are, as a
rule, higher for children than for adult learners (Flege et al. 1999). Other explanations for the
greater influence of age on the level of phonology than on morphosyntax are, among others,
the earlier completion of phonological learning than morphosyntactic learning in the L1
(Bever 1981; in Flege et al. 1999), the greater difficulty for L2 learners to separate their
different phonological systems than separate their different morphosyntactic systems (Cook
1992; in Flege et al. 1999) and the “greater overall perceived similarity of phonological
structures in the L1 and L2 than of corresponding morphosyntactic structures” (Felix 1980,
Ioup 1984, MacWhinney 1987; in Flege et al. 1999: 98). In addition, Scovel (1988; in Ioup
2008) indicated that pronunciation is controlled by neuromuscular movements for the
controlling of which age plays a key role. Wong and MacWhinney (2009) even state that,
based on earlier research (for example, Swain 1981, Ervin-Tripp 1974, Bongaerts 1999; in
Wong et al. 2009), “there is no reason for parents or educators to insist that children must
obligatory begin L2 learning at a very early age”, with the exception of phonological learning.
1.2 Remarks on the Critical Period Hypothesis
Other studies contradict these findings in support of the CP and argue that adult learners
possess the same ability of learning an L2 as children. Snow and Hoefnagel-Höhle (1978)
tested the CPH (i.e. that solely before puberty SLA will be relatively fast and successful,
leading to a similar proficiency level as native speakers) by studying the evolution of L2
learning of Dutch at three different intervals by five age groups ( 3-5, 6-7, 8-10, 12-15 years
and adults), who were all native speakers of English. The results revealed that the group of
12-15-years-olds performed best in all the tests, both phonologically and morphosyntactically,
while the youngest group scored worst. Snow et al. (1978) conclude from these findings that
10
the CPH cannot be maintained, since the age groups which were in the “optimal period” did
not display a faster or better language acquisition than the two older groups. In a study by
MacLeod and Stoel-Gammon (2010), simultaneous bilinguals, who had learned both
languages before the age of three, were compared to sequential bilinguals, who learned their
second language when they were 8-12 years old. Both groups used both languages to the same
extent and were equally proficient. In the study, the bilinguals’ production of the stop
consonants and vowels was studied in order to examine to what extent age plays a role in L2
production. Contrary to the presupposed hypothesis that the simultaneous bilinguals would be
better in making a distinction between similar sounds in both languages than the sequential
bilinguals, the results showed that both groups produced the consonants and vowels equally
well. As in the study by Snow et al. (1978), the CPH was not supported here, since both
groups had the same level of pronunciation in both languages.
Marinova-Todd et al. (2000) also question whether a critical period exists for L2
learning and argue that most evidence in support of the CPH is based on three fallacies:
misinterpretation, misattribution and misemphasis.
With misinterpretation Marinova-Todd et al. (2000) refer to the fact that many
researchers only take into account the final result of language acquisition by children and
from these results draw the conclusion that they can learn a language quickly and without
difficulties, whereas many studies have shown that in reality adolescents and adults are more
successful than younger learners in the early stages of language learning (Snow 1978, Rivera
1988; in Marinova-Todd et al. 2000). A second form of misinterpretation often made is based
on the assumption that the ability of language learning is associated with the ability of
problem solving. Once children would have acquired this last ability, the ability of language
learning would equally become impossible (Long,1990; in Marinova-Todd et al. 2000). A
study by Bialystok and Hakuta (1994; in Marinova-Todd et al. 2000), however, showed that
only after the age of 20 language learning becomes more difficult, which implies that no
correlation exists between the abilities of language learning and problem solving. It should
nevertheless be noted that Bialystok et al. examined proficiency in L2 grammar which is, as
already mentioned, less heavily influenced by age than accent.
The second fallacy concerning age and L2 learning discussed by Marinova-Todd et al.
(2000) is misattribution, which denotes the false assumption that the localization of language
in the brain affects the proficiency level of the speaker. For younger learners both the L1 and
L2 are assumed to be located in the same area in the brain, whereas the L1 and L2 of older
learners are located in different areas in the brain. (Kim et al. 1997; in Marinova-Todd 2000).
11
Marinova-Todd et al., however, dismiss this idea as irrelevant for the question whether late
learners can acquire an L2 successfully, since the study of Kim et al. did not take into account
the proficiency level of the participants. Late learners, who are proficient in their L2, would
display similar brain localization as early learners. Moreover, Marinova-Todd et al. (2000)
also doubt whether the maturation of the brain can be associated with the higher L2
proficiency levels of children, since neuroscientists cannot explain the precise relationship
between these two factors.
Lastly, Marinova-Todd et al. argue that most studies on the differences between early
and late L2 learners leave out those late L2 learners who have attained a native-like level and
solely emphasize the less successful learners. Adult learners should, however, not be regarded
as a single homogeneous group, but rather as a group in which the L2 learners vary greatly
from each other, as opposed to young learners who often show a more similar level of L2
proficiency.
1.3 Conclusion
These findings suggest that native-like achievement in language learning is not entirely
impossible after puberty and that some late learners eventually acquire an L2 accurately.
Flege et al. (1997) in the same line claim that “the capacity to learn new forms of speech
remains intact over the life span” (438). Nevertheless, although early learners are often not
indistinguishable from L1 speakers, they often do have an advantage over late learners. Two
main explanations may be offered for why L2 learning is easier for children with regards to
foreign pronunciation learning. Firstly, children learn a new language in a different manner
than adults. Children learn an L2 more directly, since they are “naive and inexperienced
learner[s]” (7), whereas adults tend to learn an L2 in terms of their L1 and of what they
already know about language in general (Pennington 1996). Secondly, it is argued that
“children may be better equipped than adults to imitate and reproduce speech as they hear it”
(Pennington 1996: 7), as they tend to react to those elements of the L2 which are most salient.
Adults on the other hand tend to learn an L2 from a deeper level.
In addition, Marinova-Todd et al. (2000) found in their study that the ability to learn
an L2 decreases slowly and not abruptly as the CPH suggests. The reasons they offer for this
decrease is the influence of the L1 which becomes stronger as age increases, which is most
clear in the onset of L2 learning, since the L2 is then still more dependent on the L1 system.
12
Furthermore, it is likely that better language learning is only associated with age, because of
social, educational and motivational factors which influence L2 proficiency (Binghadeer
2010).
13
Chapter 2. Second language phonology
With respect to second language phonology, there is a general consensus that one of the most
significant factors influencing a learner’s L2 pronunciation is his/her native language (Baker
et al. 2002). However, how the L1 affects the L2 productions is still subject to much debate,
as not all researchers agree on which sounds form the greatest difficulties for L2 learners
(Major 2008). Furthermore, other factors besides the L1 and age (as discussed in Chaper 1),
seem to influence learners’ L2 productions too, such as length or residence, motivation and
language use (Piske et al. 2001).
2.1 Factors influencing L2 phonology
In a study on the perception of L2 vowels by young learners, Højen et al. (2006) found that
few of the participants were able to perform on a native-like level, which indicates that early
exposure alone is not necessarily a key to success in L2 learning. In other words, age does not
guarantee a native-like performance of the L2 in the area of sound perception and production.
Even simultaneous bilingual children do not always have an accurate pronunciation of the
sounds of their non-dominant language (Mack et al. 1995; in Ioup 2008). Højen et al. (2006)
suggest that besides learning an L2 as a child, other factors may be important as well in
acquiring the language sufficiently. One of the first studies on other factors which may affect
L2 proficiency was carried out by Purcell and Suter (1980; in Ioup 2008). They found that the
L1, aptitude for oral mimicry, residency and attitude are of most significance in L2
pronunciation learning (Ioup 2008). Piske et al. (2001) similarly indicate that for accurate L2
pronunciation other factors besides age may be of influence, some of which will be discussed
here.
A first factor Piske et al. (2001) suggest is the length of residence (henceforth LOR) of
the L2 learner, which they define as “the number of years spent in a community where the L2
is the predominant language” (197). There is some criticism on the use of LOR as a factor
affecting foreign accent, since a long LOR is not always necessarily correlated with much L2
experience. Moreover, studies on LOR seem to lead to conflicting results: some have indeed
found effects of LOR on L2 pronunciation (e.g., Asher & Garcia 1969, Purcell & Suter 1980,
Flege & Fletcher 1992; in Piske et al. 2001), while others have not (e.g., Oyama 1976, Tahta
et al. 1981, Flege 1988; in Piske et al. 2001). According to Flege (1988; in Piske et al. 2001),
14
a possible explanation for this discrepancy is the fact that LOR has no longer any influence on
late L2 learners, whereas the pronunciation of younger learners is more easily influenced by
LOR. The factor LOR is therefore not as significant as age in L2 learning, or is, at least,
dependent on age.
Another factor that, according to Piske et al. (2001), possibly contributes to a higher
level of accurate L2 pronunciation is the motivation of the learner to achieve a native-like
accent. Bongaerts et al. (1997) note that “if it is of vital importance to [learners] to sound like
a native speaker of the language they are learning, it is possible that they will attain a native-
like accent, in spite of a late start” (449). However, similarly to the previous factor discussed,
studies on the influence of motivation have led to conflicting results. In those studies which
proposed that motivation may be of significance in L2 learning, there was a correlation found
between ‘professional motivation’ and the learner’s degree of foreign accent. ‘Professional
motivation’ here indicates that it is essential for the L2 learner to sound as native-like as
possible either because of studies or work. In the study by Binghadeer (2010), for instance,
the participants were all university students who would become English teachers. After a
period of L2 training, the students achieved better results in L2 production. Binghadeer
associates these positive effects with both the L2 input given and the ‘professional
motivation’ of the participants. Nevertheless, ‘professional motivation’ alone does not
necessarily lead to accurate pronunciation. Moreover, Piske et al. (2001) also remark that it is
difficult to ‘measure’ motivation. It is therefore often quite problematic to compare the results
of the different participants.
A last factor discussed by Piske et al. (2001) is the amount of influence language use
has on the degree of foreign accent. Most studies have shown that frequent use of the L2 does
not necessarily improve the learner’s L2 pronunciation. Cebrian (2006), for example,
examined the hypothesis that adult learners who have extensive L2 experience can produce
L2 sounds more accurately, but found that the experienced adult learners did not show a
greater ability in discriminating between the different L2 sounds. Flege et al. (1997), on the
other hand, noted that their study on the production of L2 vowels by adults proved that “adults
who learn a second language will come to produce and perceive certain vowels in their L2
more accurately as they gain experience in the L2” (466). In a study on the difference between
similar vowels which are articulated either close to or distant from a new vowel by Bohn and
Flege (1992), it was suggested that the influence of experience differs in both kinds of vowels.
According to their study, those vowels which benefit from experience are usually new L2
vowels, which do not have an equivalent in the learner’s L1. Baker and Trofimovich (2005),
15
finally, claim that experience in the L2 affects the L2 pronunciation of early learners more
than the pronunciation of late learners. Additionally, many studies have shown that especially
the amount of L1 use of the learner exerts influence on the degree of L2 accent. The more an
L2 learner continues to use his/her L1 frequently, the more likely it is (s)he will have a
stronger L2 accent (e.g., Flege et al. 1999b, Flege et al. 1995; in Piske et al. 2001). An
explanation for this finding is that greater L1 use often implies less L2 use and thus less
experience (Flege and MacKay 2004; in Munro & Derwing 2008).
Also MacLeod et al. (2010) propose that besides age other factors are involved in L2
learning, such as competence in and use of both languages. As already mentioned in Chapter
1, they investigated in their study if simultaneous and sequential bilinguals still differed in
their production of stop consonants and vowels if both groups used both languages equally
and both displayed a high level of proficiency. Their results revealed that there were indeed
no significant differences between the two groups of bilinguals. Next to use of the L1 and L2,
MacLeod et al. emphasize that the social value of the languages can influence the proficiency
of L2 learners as well. The participants in this study spoke both French and English fluently
and lived in Canada, where both languages have a high social value. In this case, the high
social value of the languages can be linked to motivation.
Although there are still many debates on the influence of the above discussed factors
on L2 learning, it is not unreasonable to assume that age in itself is not a guarantee for
accurate L2 learning, especially in the field of phonology. Moreover, it has been shown that it
is not entirely impossible for older learners to acquire an L2 accurately. Early learners do
have a clear advantage over older learners, since they can learn the L2 quicker and better, but
during the early stages of L2 learning early and late learners will often perform similarly. In
most cases, late learners are even more successful (Anderson 2004). Marinova-Todd et al.
(2000) also claim that the sole reason age affects language learning is because “it is associated
with social, psychological, educational, and other factors that can affect L2 proficiency” (28),
as was also suggested in Chapter 1.
2.2 The role of the native language in second language phonology
In the area of L2 phonology, there is a general consensus that the L1 influences both the
perception and production of L2 sounds, resulting in a foreign accent. Many researchers have
remarked that L2 learners frequently replace L2 phones with phones occurring in their L1.
16
The sounds produced are neither identical to the sounds of the L1 or the L2, but are a
combination of the values of both or fall in between both pronunciations (Lord 2010). A
foreign accent can also appear when the L2 learner inserts or omits phones or replaces one
phone with another (Munro 2008). These problems arise especially frequently when the L2
sound has a corresponding sound in the learner’s L1, leading to the misidentification of the L2
sound by the learner (Flege & Hillenbrand 1984). Moreover, phonology seems more open to
this type of influence than other areas of L2 acquisition. A study by Ioup (1984; as cited in
Eckman 2004) revealed that “native speakers of English can identify different groups of non-
native speakers on the basis of pronunciation but are not able to do so reliably on the basis of
only written or syntactic evidence” (515). This also implies that different native languages
lead to different problems in L2 perception and production.
2.2.1 Early vs. more recent research on L2 phonology
O’Brien & Smith (2010) mention that there is a discrepancy between earlier and more recent
research on L2 pronunciation. Whereas earlier research perceived foreign accents as the result
of direct transfer from the L1 to the L2, more recent research has shown that the influence of
the L1 on the L2 is often not as direct as first thought. Major (2008) notes that most
researchers tend to follow the hypothesis that dissimilar sounds are easier to learn than similar
sounds, thus following the theory of direct L1-to-L2-transfer. This can be explained by the
fact that those L2 sounds which are located further away from the sounds in the L1 are more
noticeable for learners than those sounds which are similar. Consequently, learners only
attempt to acquire those L2 sounds in which they can perceive a difference from their L1.
According to Eckman et al. (2003) there are three different situations of phonemic
contrast between the L1 and L2, which influence the language acquisition of the L2 learner. In
the first situation (i), neither of the contrasting phonemes of the L2 exist in the L1. In the
second situation (ii), the L1 has one of the phonemes which contrast in the L2. Lastly, in the
third situation (iii), the different phonemes, which contrast in the L2, also occur in the L1, but
do not contrast. According to Eckman et al. (2003), following from these situations several
instances concerning relative learning difficulty can be hypothesized:
(i) [those cases] where the native language has neither of the phonemes in question,
would be more difficult than (ii) [those cases] where the native language has one of
17
the relevant phonemes, [which] (ii), in turn, would be more problematic than (iii)
[those cases] where the native language has both of the phones in question, albeit as
allophones of the same phone.
(Eckman et al. 2003: 171)
In other words, the situations where the L1 and L2 differ the most from each other are
estimated as the most difficult to acquire. Lado (1957; in Eckman 2004), on the other hand,
emphasized the fact that errors develop in the L2 phonology when the sound systems between
the two languages differ only minimally. He claimed that learners face the greatest difficulties
when their L1 has two or more allophones for different phonemes in the L2. For instance,
Lado stated that for Spanish-speaking learners of English the sounds [d] and [ð], which are
allophones of /d/ in Spanish, but are in contrast to each other in English, are the most difficult
to acquire. Hamerly (1982; in Eckman 2004) similarly found that L2 learners regard
allophones as the most problematic element in L2 pronunciation learning. The learners in his
study had the most difficulties with suppressing their L1 allophones when they spoke the L2.
Also producing L1 allophones which are distributed differently in the L2 and producing L2
allophones which do not occur in the L1 were encountered with great difficulty.
O’Brien et al. (2010) too remark that researchers more recently have claimed that
those sounds between the L1 and L2 which are relatively alike cause more difficulties for L2
learners, because they generally do not perceive a difference between the similar sounds and
do therefore not make an effort to acquire those specific sounds, as they will assume they can
already pronounce them correctly. It is in this last case that transfer can occur, which means
that the L1 sometimes indirectly influences the perception and production of L2 sounds.
2.2.2 Flege’s Speech Learning Model
2.2.2.1 Flege’s Equivalence Theory
Similarly, this theory is further elaborated in Flege’s ‘equivalence theory’ (1992, 1995; as
cited in Major 2008), which forms a tenet of his Speech Learning Model (henceforth SLM).
One of the hypotheses of the SLM is that L2 learners can add new sound categories for both
‘similar’ and ‘new’ sounds until the age of 5-6 years, but are no longer capable of establishing
sound categories for ‘similar’ sounds when they are older (Flege 1997). The difference
18
between new and similar sounds in the L2 is, according to Flege (1997), that both vary in
degree of dissimilarity from the L1 both qualitatively and quantitatively. The equivalence
theory then is based on the idea that similar sounds are more difficult to acquire and can
therefore cause transfer. The theory maintains that L2 learners will analyse the similar L2
sounds as L1 sounds and, as a result, L2 learners will also pronounce them as L1 sounds.
In addition, Flege (1995, 2003; in O’Brien et al. 2010) also hypothesizes that the L2
learner will more rapidly create new sound categories for those L1 and L2 vowels which
differ greatly from each other, since bilinguals generally attempt to separate their L1 and L2
sounds in their combined phonetic space, so that they do not overlap. A study conducted by
Flege & Hillenbrand (1984) on the vowel production of American English-speaking learners
of French supports this equivalence theory. The L2 learners of French could pronounce the
phoneme /y/, which is a ‘foreign’ sound for English speakers, relatively accurately, but had
more difficulties with the French phoneme /u/, which sounds similar to the English phoneme
/u/.
Again, similar to the proposition of Eckman et al. (2003), there are three different
situations which can occur between the sounds of the L1 and L2.
In the first case, the L2 sound is ‘new’ for the L2 learner, indicating that the learner’s
L1 has no equivalent for the L2 sound. Flege (1997) predicts that most late learners will
encounter difficulties with the production of these sounds in the early stages of learning, but
will eventually master them. O’Brien et al. (2010) caution, however, that this does not
necessarily imply a native-like performance for late learners. Although they found in their
study on the production of German vowels by speakers of English that the participants were
able to create a separate category for the ‘new’ German vowel /y:/, they also found that their
productions still differed from those of the native German speakers. For younger learners, on
the other hand, new sounds will be easier, since the sounds in their phonetic system are not
yet established as firmly as with older learners. These new sounds can nevertheless also
undergo “category assimilation”, which means that the L2 learner will produce the unfamiliar
L2 sounds as L1 sounds (O’Brien et al. 2010).
In the second situation, the sounds between the L1 and L2 are ‘identical’, having the
same phonetic values. These sounds are therefore generally not very problematic for L2
learners (Collins & Mees 1984; in Flege 1997). This situation does however not occur very
frequently and Keys (2002) rightly questions whether it is even realistic “to assume an exact
phonemic match across languages” (43).
19
In the third and last situation described by Flege (1997), the L1 and L2 phonemes are
‘similar’ to each other. Here L2 learners mostly fail to perceive the small differences between
the L1 and L2 sounds and subsequently fail to produce the L2 sounds in a native-like way.
It should be noted that this theory presupposes that production follows perception in
language acquisition. Ioup (2008) observes that Flege’s SLM theorizes that “the mechanisms
needed to produce new sounds remain intact, but as with the other models [i.e. Best’s (1994)
‘Perceptual Assimilation Model’1 and Kuhl’s (1992) ‘Native Language Magnet Model’
2], it is
perception that changes with development” (49). To perceive and produce L2 sounds
accurately, the L2 learner has to be able to detect the differences between the L1 and L2 even
when they are similar (Baker et al. 2002). For Flege (1995; in Ioup 2008), younger learners
are still able to analyse the L2 independently from their L1, whereas older learners use their
L1 as a reference. Perkell et al. (2004) likewise hypothesize that perception influences
production and that those people who are capable of discriminating correctly between the
different L2 sounds will also be capable of producing those contrasting L2 phonemes
correctly. When speakers can detect the subtle differences between the L2 sounds, they will
learn how to produce these sounds, even when these are pronounced closer to or further apart
from each other than in their L1. When an L2 learner, on the other hand, has poorer
perception abilities, his/her L2 production will suffer from it accordingly. Trubetzkoy (1969;
in Flege 1997) and Logan (1989; in Flege 1997) remarked that learners will solely perceive
those sounds which are relevant in their L1 and which are important for distinguishing
between meaning differences in their L1. Consequently, L2 learners will only produce those
L2 sounds which they are able to detect.
2.2.2.2 Remarks on Flege’s Equivalence Theory
Concerning the difficulty of learnability of specific L2 sounds, Munro et al. (1996) make two
observations. First, they argue that it is not possible or desirable to generalize the inequalities
in learnability as “vowel x, but not vowel y, is readily learnable” (326). In their study on the
vowel productions of Italian learners of English, they found that none of the participants was
1 This model combines both elements from perception and production. At the onset of language acquisition the
L1 sounds will become categorized when the child has acquired their correct pronunciation. L2 sounds, then,
will be assigned to these L1 categories, based on similarities (Ioup 2008). 2 According to this model, the infant learner creates phonetic prototypes, which can be defined as “idealized
representations of phonetic categories”. These prototypes form a link between perception and production and
function for L2 learners as “a magnet forcing the learner to perceive the [closest] new sound as the prototype”
(Ioup 2008: 49).
20
able to produce the vowels accurately, but rather that they displayed individual levels of
success for each vowel. The second observation made by Munro et al. (1996) is that no
straightforward explanation can be given for why certain vowels need less effort to acquire
than others. According to Munro et al. (1996), the absence of a certain vowel in the L1 sound
system did not influence its production in the L2. Moreover, neither do the values of the
vowel productions in the L1 and L2 affect its pronunciation. For instance, the hypothesis that
the Italian vowels similar to the English ones are more difficult to produce than the dissimilar
vowels was not supported in this study. Therefore, it should be noted that not all research
supports the hypothesis that similar sounds are more difficult to acquire than dissimilar
sounds. A study by Bohn et al. (1992), for instance, revealed that certain German-speaking
learners of English could pronounce the similar sounds better than the dissimilar sounds.
Moreover, the Similarity Differential Rate Hypothesis proposes that “dissimilar
phenomena are acquired at faster rates than similar phenomena”, rather than being less
difficult than similar sounds (Major & Kim 1996; as cited in Major 2008: 74). Eckman (2004)
explains this hypothesis with the argument that beginning and advanced learners do not
encounter the same problems in L2 learning: for beginning learners, whose L2 is still subject
to much L1 transfer, similar sounds seem relatively easy to use. On the other hand, for
advanced learners, who are more aware of the differences between the similar sounds, this is
no longer the case.
In support of the findings that some learners do pronounce similar sounds better, it
should also be noted that L2 learners whose L1 resembles the sounds patterns of the L2 are
more likely to have a more accurate pronunciation than L2 learners whose L1 phonological
system differs strongly from the L2 system (Ioup 2008). The size of the vowel inventory of
the two languages might have an influence as well on L2 acquisition: according to Munro et
al. (2008) those learners whose L1 vowel system has a similar size as that of as their L2
vowel system have an advantage over those learners whose L1 and L2 vowel systems differ to
a greater extent in size, especially if the L1 vowel inventory is smaller than the L2 vowel
inventory.
21
2.2.2.3 Younger vs. older learners in Flege’s Interaction Hypothesis
Flege’s (1995) Interaction Hypothesis (henceforth IH) is a part of his SLM, which accounts
for how language learners acquire the different sound segments of the L2. According to the
IH, the differences between early and late learners can be explained by the interaction that
occurs between the learners’ L1 and L2 sound system. In younger learners, this interaction is
smaller than in older learners, because the development of the younger learners’ L1 has not
been completed yet. The sound categories in younger learners are not fixed yet and can still be
adapted, which enables them to learn foreign sounds more easily than older learners who have
fixed sound categories. Similarly, Baker and Trofimovich (2005) argue that “early bilinguals
may be more likely than late bilinguals to restructure their L1 phonetic system” (22). In other
words, the degree of interaction between the L1 and L2 depends on the degree in which the
L1 of the language learners has developed. In addition, the SLM also suggests that “children’s
L1 sound categories are less powerful attractors of L2 sounds than those of adults” (Baker et
al. 2002; 41), since children less often perceive L2 sounds in terms of L1 sounds.
Furthermore, the fact that the L1 sound categories of younger learners are still
developing also affects the direction of influence between the L1 and L2. For younger
learners the influence of the L1 and L2 is mostly bidirectional, which implies that not only
does the L1 exert influence on the L2, but the L2 also influences the sounds of the learner’s
L1. For older learners this influence is mostly unidirectional L1-to-L2, because older learners’
L1 phonetic system has become less flexible and is consequently less easily influenced by a
foreign language (Baker et al. 2005). It should be noted that according to Baker et al. (2008)
the SLM as intended by Flege does not apply for beginning learners, because the differences
between early and late learners in the initial stages of learning differ from the differences
between these two groups in later stages, as already mentioned above.
Baker et al. (2008) consider two different tenets of the IH, which will be discussed
here, mainly based on Holleman (2011).
The first tenet highlights the fact that the L1 categories in fact develop rather slowly.
Baker et al. (2008) argue that already early on in life – around 10 months – children tend to
prefer those sounds which occur in their L1, but this preference to those consonants and
vowels develops fairly slowly and continues even into adolescence. Especially between the
ages of 6 and 12 years children seem to become more skilled in discriminating between the
L1 sounds than earlier in childhood (Hazan & Barrett 2000; in Baker et al. 2008). However,
there remains a difference between children and adults in terms of accuracy in L1 perception
22
and production. It is the increasing capacity to discriminate between the L1 phonemes that
diminishes the capacity of L2 learners to correctly perceive and produce those sounds which
do not occur in their L1.
The second tenet suggests that because of the development of the L1 categories,
leading to more ‘robust’ sound categories, older adolescents and adults are better able to
discriminate and produce L1 sounds accurately, but have more difficulties in discriminating
those phonetic sounds which do not occur in their L1 and will additionally analyse L2 sounds
in terms of L1 sounds. By contrast, children, whose L1 sound categories are still developing,
can better differentiate between unfamiliar sound differences. Flege (1997) notes that it is
easier for adolescents and adults than for children to classify different L1 sounds under non-
ideal listening conditions (such as with background noise), because their sound categories are
more firmly set and the categories’ “tolerance limits [...] expand” (14). The downside of this
being that these fixed categories make it more difficult to discriminate between L1 and L2
sounds in L2 learning.
In the study by Baker et al. (2008) on the interaction between L1 and L2 sound
systems in early and late bilinguals, these tenets of the IH were partially confirmed. Baker et
al. (2008) examined both the perception and production of vowels by early and late Korean
learners of English. The results revealed that the children did not outperform the adults in the
perception of the L2 vowels, but did produce two of the four vowels more accurately than the
adults. The two vowels which were produced accurately were those English vowels which are
absent in Korean (/ɪ/, /ʊ/), indicating that children are better in discriminating those sounds
which do not appear in their L1 sound system. This study therefore supported the hypothesis
that there is a correlation between age and the ability to accurately perceive and produce L2
sounds which are not present in the L1 or sounds which are similar, and that this ability is
associated with the state of development of the learner’s L1 sound system.
2.3 The role of universal principles
Another L2 phonology model is Major’s Ontogeny Phylogeny Model (2001; in Major 2008)
(henceforth OPM). This model was originally designed for adult L2 phonology, but was
applied on child L2 learning by Anderson (2004). According to Major (2008), L2 speech
consists of three different elements: the L1, the L2 and ‘universal principles’. Illustrations of
these universal principles are, for instance, markedness (e.g., voiced obstruents in initial
23
position are acquired before obstruents in final position) and hypercorrection (e.g., the
exaggeration of the pronunciation of American English /r/ by certain L2 learners) (Major
2008). Each of these three elements which affect L2 speech exerts more influence during a
particular stage in the L2 acquisition. In the model as developed for adult L2 learners, the L1
will exert more influence in the early stages of learning, while the L2 and the universal
principles will be of stronger influence in the later stages. For children, however, Anderson
(2004) suggests that the universal principles will in general play a more significant role in the
acquisition process, since the L1 sound system of children is not fully established yet and can
therefore not yet function as a starting point for language learning.
Anderson (2004) specifically examined the phonological L2 acquisition by young
children via immersion. Her study led to three conclusions: (i) there is transfer of the sound
systems from the L1 and L2, (ii) this transfer is unidirectional from the L1 to the L2,3 and (iii)
the children separate the phonological systems of their L1 and L2 from one another. At the
onset of her study she suggested two different outcomes for child L2 acquisition: either their
L1 may affect the development of their L2 (because the children already have established
their L1 sound system), or universal patterns of development, together with elements from the
L2, will influence their language acquisition. A previous study by Piper (1984; in Anderson
2004) had already revealed that the L2 guides the development of the early learners: the
participants in the study all had different L1 backgrounds, but showed similar productions of
the L2. In the study by Anderson (2004), similar results were found. Furthermore, Anderson
notes that the older children were more proficient in L2 learning, possibly drawing on their
previously acquired knowledge of their L1 phonological system. However, in general, all
young learners in this study appeared to acquire the L2 relatively effortlessly in the immersion
context. The author assumes that the L1 primarily exerted influence on the articulatory level
of the L2, implying that similar L2 sounds were influenced by the L1 sounds. The influence
on the phonological level, which is defined as “the level of rules that govern how sounds are
sequenced and used” (Anderson 2004: 199), however, was not as high. Anderson concludes
that, because of the still ongoing development of the children’s L1 sound system, the
universal principles were of stronger influence than the L1 on the L2 acquisition.
3 Note that this conclusion is not in line with Flege’s (1995) Interaction Hypothesis, as discussed in Section
2.2.2.3.
24
Chapter 3. The role of instruction on second
language phonology
In the structural approach to teaching, to obtain accurate L2 skills was paramount and was
estimated as more important than achieving fluency, and this translated itself into an emphasis
on repetition in the courses and an intolerant attitude towards any L1 influence entering the
L2 speech (Isaacs 2009). Because of the stress that was put on accuracy, much attention was
devoted to pronunciation teaching. Later, however, when both the naturalistic approach to
teaching and the communicative language teaching were introduced, pronunciation was
regarded as a part of linguistics rather than communicative competence, and therefore no
longer seen as important. It was thought that even without any pronunciation teaching, the
students would achieve a sufficient level of L2 pronunciation through input received in class.
Therefore, pronunciation was often disregarded in the communicative language courses,
although it plays a central role in the success of communication (Isaacs 2009). Another
problem in the field of pronunciation instruction is that teachers seem to “see the rare learner
who achieves a native-like accent as an achievable ideal, not an exception” (Levis 2005; as
cited in Saito 2011: 55), although Piske et al. (2001) have observed that there is no evidence
that L2 teaching is a guarantee for a native-like accent. This aim, set by the L2 teachers, may,
according to Field (2005; in Saito 2011), be seen as an obstacle for many language students.
The following sections will introduce some of the several different approaches to
pronunciation teaching which can be distinguished.
3.1 The Integrated Approach
A first approach is the integrated approach as discussed by Wong et al. (2009). They
underline the idea that language teachers should organize their courses in relation to how
language is acquired and should also adjust the learning activities so that the children can
more easily learn the foreign speech sounds. In order to do so, teachers have to create “a
language-rich environment” (Wong et al. 2009: 20) where the children receive the
opportunity to use the L2 (in this case English) both passively and actively.
The motive behind this idea is that this learning environment appeals more to the
children’s interest and gives them more motivation. Wong et al. based this approach on
Bredekamp and Rosegrant’s (1995; in Wong et al. 2009) concept of integration, which
25
proposes that children will learn more efficiently when the subjects taught are integrated with
their personal, social and pre-academic experiences of languages, mathematics and science,
and creative subjects. In classrooms these activities are in general already applied for learning
L2 vocabulary and grammar, but here they serve to improve pronunciation.
To illustrate this, Wong et al. offer some articulation activities which can be used in
this approach, such as “circle time”, where teachers focus on the articulation of the individual
speech sounds by means of a story or pictures. It is paramount here that the children do not
notice that they are in fact practising their pronunciation. Other activities are, for instance,
asking certain questions to which the children need to answer with a sentence containing the
particular sounds, using arts and crafts, music and movement, game, and mathematics and
science.
3.2 Explicit Instruction
A second approach to pronunciation is the explicit instruction, which DeKeyser (2003; in
Saito 2011) explains as an instructional treatment in which “rule explanation forms part of the
instruction (i.e., deduction) or [in which] learners are asked to attend to particular forms and
try to find the rules themselves (i.e., induction)” (46). Possible advantages of explicit
pronunciation teaching are that students will become more aware of their own pronunciation
and the pronunciation of native speakers (Derwing and Munro 2005; in Saito 2011). In other
words, explicit phonological instruction helps to shape the ‘phonological awareness’ of the
students, which Saito (2011) defines as “the conscious knowledge of segmentals and
suprasegmentals” (46).4
In his study, Saito specifically examined the effects of segmental-based instruction,
which he values because of its teachability. The advantages of this kind of instruction for
teaching are that these features can easily be explained to the students by means of the
phonetic characteristics (articulator organs, place of articulation and manner of articulation)
and that possible pronunciation problems can be foreseen, if the students share the same L1.
Although the participants in the study by Saito did not display an improvement of their
L2 accent after having received pronunciation instruction, the participants in the study by
Lord (2010) on explicit pronunciation instruction did obtain higher results than those students
4 Words can be distinguished from one another on both the segmental level (e.g., bellow vs. mellow, or rusty vs.
trusty) and on the suprasegmental level, which refers to differences in pitch, amplitude and duration of syllables
(Koster & Cutler, 1997).
26
who did not receive any pronunciation instruction. However, it should be noted that the
results by Lord may be influenced by the fact that the participants were also enrolled in a
language immersion program, which was highly advantageous for the L2 pronunciation of the
participants. Lord concluded from these findings that the combination of both explicit
instruction and immersion leads to the best results for the L2 learners. With respect to the
study by Saito (2011), it should be noted that the participants were all adult learners, which
may imply that they faced greater difficulties physically compared to younger learner.
3.3 Form Focused Approach
Both Isaacs (2009) and Trofimovich and Gatbonton (2006) discuss the form-focused
instruction in pronunciation teaching. In form-focused instruction the stress is on grammar,
vocabulary, or pronunciation “in a meaning-oriented instructional context” (Isaacs 2009: 5)
and its aim is to “[push] learners to notice the gap between their un-target-like interlanguage
utterances and the target form” (Isaacs 2009: 43). This means that teachers should attempt to
make their students aware of the different linguistic features which exist between their L1 and
L2.
According to Isaacs, there are two different manners in which form-focused
instruction can be taught: either reactively, where the teacher gives corrective feedback to
mistakes made by the student, or pre-emptively, where the teacher draws attention to the
differing linguistic features beforehand. Trofimovich et al. (2006) remark that these form-
focused language activities have a positive effect on the students’ pronunciation, since they
will be able to process the different sound system of the L2 faster and possibly also more
efficiently. However, Isaacs (2009) observes that it is difficult in pronunciation teaching to
strike the right balance between focus on forms (where the form is more important than the
meaning) and focus on meaning. According to DeKeyer (1998; in Isaacs 2009), L2 learners
need instruction to achieve a native-like L2 production of sounds, whereas instruction is not
necessarily required for vocabulary learning. A too strong emphasis on meaning could also
obstruct students from correctly detecting and acquiring the L2 sounds (Trofimovich et al.
2006). Therefore it would be more logical to focus instruction on forms rather than on
meaning. However, although L2 phonology learning benefits from repetition, this would have
negative consequences for the communicative aspect of L2 teaching and would imply a return
to rote learning. A solution Isaacs (2009) offers for this problem is the ‘zoom principle’ by
27
Firth (1997; in Isaacs 2009), which means that teachers should focus their teachings on certain
problems in pronunciation when they appear in the communicative activities.
Trofimovich et al. (2006) found in their study on L2 learners of Spanish that their
pronunciation had improved after having received both spoken L2 input on a repetitive basis
and specific form-focused instruction. They, similarly to Isaacs (2009), note that “L2
pronunciation learning [...] can be conceptualized in the context of ‘mindful repetition in an
engaging communicative context by motivated learners’ (Ellis 2002: 177; original emphasis)”
(as cited in Trofimovich et al. 2006: 532).
3.4 Comparison of the different approaches
Pennington and Richards (1986; in Derwing et al. 1998) remarked that there is still much
uncertainty about which kind of instruction is the most optimal for improving L2 learners’
pronunciation and not all scholars accept that there is even any advantage in pronunciation
instruction. Derwing et al. (1998) examined the results of three different approaches to
pronunciation instruction: segmental accuracy (i.e., instruction at the level of the word or
smaller units), general speaking habits and prosodic factors (i.e., global strategies), and no
specific instruction. In their study, they also draw attention to the difference for L2 learners
between the dimensions of accentedness, comprehensibility, and fluency. Accentedness refers
to the degree of accuracy of the production of the L2 learners. Comprehensibility indicates
how easily listeners can understand the production of the learners. Fluency specifies the
extent to which learners can produce utterances in the L2 both smoothly and rapidly
(Kennedy & Trofimovich 2010).
Derwing et al. (1998) conducted two different experiments to examine whether
accentedness, comprehensibility and fluency improved after having received either
segmental-based, global or no pronunciation-specific instruction. In the first experiment they
collected speech samples of students at two different times: once at the beginning of the
courses and once after 11 weeks. The speech samples consisted of simple statements, which
the participants repeated from native speakers, after having read them first. Immediately after
the first experiments, the participants had to perform the second experiment, in which they
were asked to describe a standard picture story. For the first experiment, all groups displayed
a slight improvement, but only the group who had received segmental-focused instruction
showed a significantly better result. For the second experiment, only the group who had
28
received global instruction had improved on the levels of comprehensibility and fluency. The
degree of accentedness did, however, not increase in any of the groups.
These results led to the conclusion that both segmental-based and global instruction
are beneficial for the pronunciation of L2 students, but both in different situations. The
segmental-based approach is especially useful in case of a mispronunciation which has caused
a breakdown in communication. Segmental-based instruction may help the L2 learner to
notice his/her mistake and correct him/herself. However, it appeared that the attention given
to the accurate production of the L2 sounds was at the expense of the language processing
abilities of the learners (Derwing & Rossiter 2003). Students who have received global
instruction, on the other hand, seem to be better prepared to perform in spontaneous speech
(Derwing et al. 1998).
3.5 The role of language awareness, input and output in
pronunciation instruction
In the study by Kennedy et al. (2010), the relationship between the level of language
awareness of L2 learners and their level of pronunciation was tested. In addition, they also
studied whether the different approaches of the L2 learners to language awareness (i.e., the
quantitative angle and the qualitative angle)5 had a different effect on pronunciation. The
language awareness of the students was tested through entries in dialogue journals they had to
make on language and learning.
The experiment revealed that there is a correlation between the accuracy of
pronunciation and the number of qualitative language awareness comments the student made.
Kennedy et al. connect this finding with the results of previous studies on L2 grammar which
also found a relation between grammatical awareness and correct use of the grammatical
features. An explanation for this, proposed by Kennedy et al., is that those students who
displayed the highest degree of qualitative language awareness comments were already more
proficient in accentedness, comprehensibility, and fluency from the beginning of the course
onwards. However, Kennedy et al. also suggest that a quantitative approach might be
beneficial in the early stages of language learning for those students who show problems with
5 In the quantitative approach to language awareness, language learning can be seen “as a set of items
(grammatical concepts, word patterns, etc.) which need to be taken in and remembered”, whereas the qualitative
approach sees language “as something that carries meaning” and “as a context in which learning can occur”
(Kennedy et al. 2010: 177).
29
L2 phonology, since the quantitative approach is more helpful in teaching basic pronunciation
skills than the qualitative approach.
Furthermore, Kennedy et al. emphasize the importance of input in L2 acquisition.
They claim that, in pedagogy, L2 learners should first attain a certain level of proficiency in
L2 listening before they can show L2 awareness and consequently improve their
pronunciation.
For Saito (2009), on the other hand, output is equally important for L2 pronunciation
learning, because it increases the likelihood that the L2 learners will become more aware of
their own mistakes and of the correct pronunciation of the L2 forms. Types of production
activities he proposes are either controlled practice, such as reading tasks, or communicative
practice, such as picture-describing tasks. The aim of these activities is that, since the L2
students will have to notice their own accuracy on the level of syntax, word choice and
pronunciation, they will participate more in spontaneous speech, similarly to the qualitative
approach of pronunciation instruction discussed by Kennedy et al. (2010).
3.6 Conclusion
All these findings indicate that pronunciation instruction is mostly more beneficial for the L2
learners’ production than no attention to pronunciation at all. However, there exists no
definite agreement on which kind of approach leads to the best results. To achieve
communicative competences, a global approach appears to be the optimal method. However,
if the segmental elements of L2 pronunciation need to be improved, explicit or form-focused
instruction are better suited. The most important aim of pronunciation instruction is
nevertheless to create an awareness of the L2 sounds in the language learners. Consequently,
when the L2 learners are able to notice and correct their mistakes, thus improving their
accentedness, their comprehensibility and fluency will eventually improve as well.
30
Chapter 4. Vowels
Vowels can be described as vocalic sounds, which are “produced by egressive pulmonic
airflow through vibrating or constricted vocal folds in the larynx and through the vocal tract,
and the sound generated at the larynx is modified by the cavities of the tract” (Clark & Yallop
1995: 22). The size and shape of the tract can be modified by varying the positions of the
tongue and lips, resulting in different vowels, each with their own specific value. To produce
a particular vowel, the tongue has to adjust its shape and position, whereas the lips have to
adjust their shape and degree of protrusion (Clark et al. 1995).
4.1 Vowel analysis
In vowel analysis, it is especially difficult to determine the exact shape and position of the
tongue, since the tongue does not make “any specific construction in the area surrounding the
midline of the oral cavity” (Clark et al. 1995: 22). This also poses problems for the plotting of
the position of the tongue in a vowel diagram. A traditional method of plotting the position of
the tongue is a two-dimensional diagram, corresponding to the articulatory space, in which
both the tongue height (on the vertical axis) and the tongue fronting (on the horizontal axis)
are represented.
Vowel analysis can be visualized with the help of speech spectrograms (see Figure
4.1), which are two-dimensional formats which represent the changes which occur in the
speech spectrum over time. On the spectrogram three different dimensions can be found:
frequency (on the vertical axis), time (on the horizontal axis), and the magnitude of acoustic
energy (shown by the either bright or dark intensity of the display) (Clark et al. 1995).
31
Figure 4.1. Spectogram of the vowel [ɜ] (taken from Clark et al. 1995: 253)
Figure 4.2 shows the different acoustic properties which together are necessary for the
productions of vowels, in this case the vowel [ɜ]: (a) the harmonic spectrum of the phonation
source; 6
(b) the resonant response of the vocal tract; (c) the phonation spectrum which results
from the effects of tract resonance; (d) the spectrum of the final radiated acoustic sound
pressure wave, which denotes “the acoustic radiation properties of the human head” (Clark et
al. 1995: 246); and (e) the time domain pressure wave. The phonetic spectrum (a) and the
resonance peaks (b) together result in a spectrum (c), having peaks of energy caused by the
resonance. These peaks of energy, following from resonance, are called formants (Clark et al.
1995). In other words, “the formant frequencies refer to the characteristic ‘pitch overtones’ of
a given vowel as a function of the size and shape of the articulatory tract” (Zampini 2008:
226). They contain information on the quality of vowels and each type of formant has
received a number (F1, F2 etc.) according to its degree of frequency. The position and shape of
the tongue play a key role in the value of the formants (Clark et al. 1995). F1 refers to the
vowel height, determined by the position of the tongue. As the height of the vowel decreases,
the F1 will increase. This implies that a low vowel will have a higher F1 than a high vowel. F2
is also dependent on the position of the tongue and denotes the frontness or backness of the
vowel. The F2 value decreases as the vowel is produced more back. Front vowels, on the other
hand, have higher F2 values (Zampini 2008).
6 Phonation can be defined as “the periodic vibration of the vocal folds” (Clark et al., 1995: 237).
32
Figure 4.2. Acoustic properties of the vowel [ɜ] (taken from Clark et al. 1995: 247)
In a standard diagram of the vowel quality (see Figure 4.3), the F2 is plotted on the horizontal
axis, with the lowest values starting from the right increasing to the left. The F1 is plotted on
the vertical axis, with the lowest values starting from the top increasing to the bottom (Clark
et al. 1995). One of the problems this two-dimensional visualisation causes is that, for
instance, vowels which have similar F1 and F2 values but different lip positions, and therefore
different pronunciations, will be plotted on the same place in the diagram (Ladefoged 1967; in
Clark et al. 1995).
Using these formant frequencies is an objective way to describe the qualities of the
vowels. However, it should be noted that not the absolute values of the formant frequencies
are of significance in perception and production but rather their relative relationship to one
another, because of the individual differences between speakers. Languages only use a select
number of sounds of the total human articulatory potential. The speakers of a certain language
are obliged to use these sounds their language prescribes. Clark et al. (1995), therefore,
indicate that languages are normative in nature. This implies that certain language productions
can be regarded as either ‘normal’ or as strange or foreign. The same reaction to sounds can
also emerge when language users come into contact with another language and consider
certain words or names as ‘unpronounceable’. However, even within the same language it can
be expected that various productions of the same sound occur, since “the nature of the
33
articulatory mechanism, the precision and coordination needed to control it, and the fineness
of auditory discrimination” (Clark et al. 1995: 83) all control language pronunciation.
This variation of productions does not pose a problem for the everyday language users,
but it does for phoneticians, because it hinders the possibility of indicating the exact values
for, for instance, vowels which hold for each individual speaker. The most significant cause
for pronunciation differences is the variation in the size and the shape of the vocal tract and
articulators. This is most prevalent in the different productions of adults and children, since
the length of the vocal tract is higher in adults than in children. This difference in length of the
vocal tract has a rather significant influence on the values of the formant frequencies (Clark et
al. 1995).
However, speakers do tend to preserve the specific contrast existing between the
different vowels in their sound system. Even when speakers display different formant values
for their vowels, the contrast between them will be maintained. This allows listeners to
understand the productions of the speaker accurately, because language users can change their
mental sound map which they have of the language for each different speaker (Clark et al.
1995). Formant frequencies should therefore also be analysed within the vowel system of the
individual speaker and not generally, since productions vary according to the speaker and can
lead to overlap between the vowels (Ladefoged & Broadbent 1957; in Clark et al. 1995), as
displayed in Figure 4.3, representing English vowels produced by native speakers of
Australian English.
Figure 4.3. The F1 and F2 values of the English monophtongs as produced by 172 adult male speakers
of Australian English. The lines surrounding each vowel show the different distributions (taken from
Clark et al. 1995: 316)
34
4.2 Duration
Besides differences in the qualitative aspect (i.e., the formant frequencies), vowels can also
differ from one another in quantity, which is of phonological significance in some languages
(Zampini 2008). The duration of a particular vowel depends on, among other things, the
quality of the vowel and the consonants preceding and following it. However, according to
Lehiste (1976; in Clark et al. 1995), each vowel also has its own specific length, independent
from other factors. Lower vowels, for instance, generally have a longer duration than higher
vowels, because low vowels require greater effort to be produced.
The influence from the neighbouring consonants on a vowel can be either language
specific or universal because of its place of articulation. Some effects of consonants on vowel
duration only occur in some particular languages. In English, for instance, voiced stops and
fricatives lengthen the preceding vowel more than voiceless consonants. According to Clark
et al. (1995), this seems a natural effect for native speakers of English, but is in fact not a
universal language feature. However, the influence of the place where a consonant is
produced on the adjacent vowel does seem to be a universal feature. When the production of a
particular consonant requires tongue movement, the complete production of the consonant
will consequently take longer and the duration of the neighbouring vowel will therefore have
a longer duration as well. Alveolars or velars will, for instance, have a lengthening effect on
the preceding vowel, as opposed to bilabials (Clark et al. 1995).
Similar to the difference in quality between the vowels, the difference in quantity is
also relative rather than absolute, since the duration of vowels depends on their quality and
the adjacent consonants. Furthermore, Clark et al. (1995) remark that long vowels do not have
a minimum length of duration, or short vowels a maximum length. What is more important is
that similar vowels, solely differing in length, display relative duration differences when they
are placed in comparable contexts (Clark et al. 1995).
In the field of L2 learning, it has been suggested that the quantitative aspect of vowels
may play a significant role in L2 learners’ discrimination of L2 vowel contrasts which do not
occur in their L1. For instance, some studies (e.g., Minnick-Fox & Maeda 1999, Altenburg &
Vago 1987, Wang 1997; in Cebrian 2006) have revealed that to contrast the English vowels /i/
and /ɪ/ from one another, L2 learners will make greater use of duration to discriminate
between the vowels than native speakers of English would. Furthermore, other research has
shown that duration is used more for those vowel contrasts which do not occur in the
speaker’s L1 than for those contrasts which are similar between the L1 and L2 (Flege & Bohn
35
1989; in Cebrian 2006). According to Cebrian (2006), the use of duration by L2 learners can
be explained by the fact that “learners may detect some difference between the target vowels
and interpret it as a temporal contrast only” (384). L2 learners will eventually notice the non-
native L2 features and will use these features to discriminate between the L2 sounds, but
when the L2 contrast has no L1 equivalent L2 learners will only rely on the quantitative
aspect to discriminate rather than on the qualitative aspect. On the other hand, Cebrian also
remarks that when the sounds of the L2 vowel contrast only differ from one another in their
duration and do not occur in the learner’s L1, the L2 learner will not be able to discriminate
between the sounds.
4.3 The vowels of English
Compared to other languages, the English vowel system is rather large and contains a
relatively complex system of vowel contrasts. Moreover, it should be noted that the
pronunciation of the English vowels varies greatly depending on the variety of English that is
spoken. There is no standardized articulation of the English vowel system (Clark et al. 1995).
English has seven checked steady-state vowels, meaning that both the tongue and the
lip shape are held constant during the articulation (see Figure 4.4). Furthermore, the checked
steady-state vowels can be distinguished from the free vowels in that they cannot occur in
word-final stressed open syllables. However, they can occur before the consonant /η/, and
have a relatively shorter length than free vowels (Collins & Mees 2003). In addition, the short
and long vowels in English are not only distinguished from each other quantitatively, but they
also show different qualitative values (e.g., the F1 value of the free steady-state English vowel
/i:/ is lower than the one of the checked steady-state vowel /ɪ/), as opposed to some other
languages, such as Finnish, where the vowel qualities of long and short vowels are the same
(Zampini 2008).
The seven checked steady-state vowels in English are /ɪ/, /ɛ/, /æ/, /ʌ/, /ɒ/, /ʊ/ and /ə/
(see Figure 4.4). The vowels /ʌ/, /ɒ/, and /ə/ or schwa, will not be discussed here, since they
will not be included in the study presented in Chapters 5-7. The checked-steady state vowels
which will be discussed here are /ɪ/ or KIT-vowel, which is a front-central, close-mid,
unrounded vowel; /ɛ/ or DRESS-vowel, which is a front, above-open mid, unrounded vowel;
/æ/ or TRAP-vowel, which is a front, slightly above-open, unrounded vowel; and /ʊ/ or FOOT-
vowel, which is a back-central, close-mid and slightly rounded vowel (Collins et al. 2003).
36
Figure 4.4. The checked steady-state vowels of English (taken from Collins & Mees 2003: 89)
The English free vowels can be separated into two groups: the steady-state vowels and the
diphthongs. The free steady-state vowels maintain the same tongue and lip position during the
production, as opposed to the diphthongs. They can occur in stressed open syllables, but do
not occur before the consonant /η/ (except as the result of assimilations, e.g., ‘green card’ [gri:
η ka:d]). Their duration is relatively longer than the one of checked vowels. The English
vowel system has five free steady-state vowels.
The five English free steady-state vowels are /i:/, /u:/, /ɑ:/, /ɔ:/ and /ɜ:/ (see Figure 4.5).
Only the vowel /u:/ or GOOSE-vowel, which is a centred from back, close, rounded vowel, will
be discussed here as it is examined in the present study, presented in Chapters 5-7 (Collins et
al. 2003).
Figure 4.5. The free steady-state vowels of English (taken from Collins & Mees 2003: 100)
As mentioned above, the place of articulation of vowels can be measured by formant
frequencies. The mean formant values of the English vowels are presented in Table 4.1 and
4.2, presenting data of native speakers of American English and RP, respectively. For the
American-English speakers, also vowel duration data are added.
37
Table 4.1. The average duration (in ms) and formant frequencies (in Hz) of vowels produced by 46
American-English speaking children (taken from Hillenbrand et al. 1995)
/ɪ/ /ɛ/ /æ/ /ʊ/ /u:/
Duration 248 235 322 247 278
F1 511 749 717 568 494
F2 2552 2267 2501 1490 1345
Table 4.2. The mean formant frequencies (in Hz) of vowels produced by native RP speakers, aged 20-
25 years old (taken from Hawkins & Midgley 2005)
/ɪ/ /ɛ/ /æ/ /ʊ/ /u:/
F1 393 600 917 413 289
F2 2174 1914 1473 1285 1616
4.4 The vowels of Dutch
Dutch has six checked steady-state vowels (see Figure 4.6), which, similarly to English,
cannot occur in word-final stressed open syllables. All these vowels have a relative short
duration. The six checked steady-state vowels of Dutch are /ɪ/, ɛ/, /ɑ/, /ɔ/, /Y/ and /ə/ (Collins
et al. 2003).
The vowels which will be discussed here are /ɪ/ or VIS-vowel, which is a front central,
above close-mid and unrounded vowel; and /ɛ/ or FLES-vowel, which is a front, open-mid and
unrounded vowel.
Figure 4.6. The checked vowels of Dutch (taken from Collins & Mees 2003: 128)
38
Dutch has four free steady-state vowels (see Figure 4.7), which, as opposed to the checked
vowels, can occur in word-final stressed open syllables. In the Flemish varieties of Dutch,
these vowels are generally produced with a longer duration than in the Northern Dutch
varieties.
The four free steady-state vowels of Dutch are /u/, /y/, /i/ and /a:/. The free steady-state
vowels which will be discussed here are /u/ or BOEK-vowel, which is a back-central, closed,
rounded vowel; and /y/ or NU-vowel, which is a front-central, between close and close-mid,
rounded vowel (Collins et al. 2003).
Figure 4.7. Free steady-state vowels of Dutch (taken from Collins & Mees 2003: 132)
As in English, there exists considerable accent variation in the pronunciation of the Dutch
vowels. These accent variations in Standard Dutch are affected by the dialects spoken in the
different areas (Adank et al. 2004). Although Adank et al. note that especially Southern
Standard Dutch (i.e., the variety of Dutch spoken in Belgium) shows less variety in tongue
height than tongue position (i.e., front versus back), Collins et al. (2003) suggest that vowel
height slightly differs between the Dutch accents.
With reference to the present study, it should be noted that the vowel /ɪ/ is often
pronounced more closed in the dialect of Antwerp than in Standard Dutch and that its
pronunciation resembles the free vowel /i/; the vowel /ɛ/ is likewise more closed in the area of
Antwerp compared to the other Southern Dutch dialects, which often have a production of /ɛ/
closer to /æ/; the vowel /u/ is produced more front and lower in the dialect of Antwerp; and
the formant values of the free vowel /y/ are close to those of the checked vowel /Y/
(Verhoeven & Van Bael 2002; Collins et al. 2003). Furthermore, the vowel duration also
varies between the different dialects. Of significance here is that the vowel /y/ is produced
longer in the dialect of Antwerp, compared to the other Southern Standard Dutch dialects. The
duration of the other vowels which will be discussed does not differ to such a great extent
39
(Verhoeven et al. 2002). Data of vowel formant frequencies and duration by speakers from
the area of Antwerp are presented in Table 4.3.
Table 4.3. The average duration (in ms) and formant frequencies (in Hz) of vowels produced by
speakers of the area of Antwerp (taken from Verhoeven & Van Bael 2002)
/ɪ/ /ɛ/ /u/ /y/
Duration 128 102 246 290
F1
Male speakers
Female speakers
329
326
473
548
350
374
297
302
F2
Male speakers
Female speakers
2122
2600
1894
2250
840
935
1833
2051
4.5 Predicting difficulties for Dutch-speaking learners of English
As hypothesized by Flege (1987) in his SLM, most difficulties for Dutch-speaking learners of
English will arise for those sounds which are similar to one another but not identical. New
sounds will usually pose problems at the onset of learning, but will eventually be mastered by
the majority of the L2 learners (see Section 2.2.2.1).
Tops et al. (2001) propose the following learning schedule for the English vowels by
speakers of Dutch, in which the shaded phonemes are almost identical in both languages and
therefore cause only minor learning difficulties, and in which the unshaded phonemes are
similar or new phonemes and are therefore relatively problematic to learn. Note that Figure
4.8 only accounts for the acoustic properties of the vowels.
Figure 4.8. The English vowels (taken from Tops et al. 2001: 2)
40
According to Figure 4.8, the English vowel /ɪ/ should generally not form any obstacle for L1
Dutch speakers, since the Dutch phoneme is almost identical to its English equivalent.
However, as was already mentioned above, in the Antwerp accent /ɪ/ is often produced more
closed and front, which could lead to a confusion between the vowels /ɪ/ and /i:/ (Collins et al.
2003; Tops et al. 2002).
The Dutch vowel /ɛ/, however, falls between the English vowels /ɛ/ and /æ/, which
tends to lead to the confusion of both vowels by Dutch-speaking learners of English. This
confusion is further enhanced, because the vowel /æ/ does not occur in Dutch. In the dialect
spoken in Antwerp, the Dutch vowel /ɛ/ is produced quite closed, and as a result the
production of English /ɛ/ may resemble English /ɪ/ for speakers with an Antwerp accent. Since
the vowel /æ/ does not exist in the Dutch vowel system, it is often produced identical to /ɛ/,
which may lead to a breakdown in intelligibility (Collins et al. 2003; Tops et al. 2002).
In Dutch there is no equivalent to the vowel /ʊ/. Confusion often occurs between the
vowels /ʊ/ and /u:/, as Dutch speakers produce these as identical vowels, both on the
qualitative and quantitative level, perceiving both of them as instances of the Dutch vowel /u/.
The Dutch vowel /u/ is, however, produced less central than English /ʊ/, and closer and
shorter than English /u:/ (Collins et al. 2003; Tops et al. 2002).
41
Chapter 5. Case study
5.1 General aim and procedure
The aim of the present case study was to examine the productions of the English vowel
contrasts /ɛ/-/æ/-/ɪ/ and /ʊ/-/u:/ by Dutch-speaking children, as these vowel contrasts have
been reported to be difficult for speakers of Dutch (see Chapter 4). Besides the influence of
the L1, also the influence of age and instruction on L2 pronunciation were examined. In order
to obtain the data needed for this study, a group of ten school children were asked to perform
several production tasks, both in English and Dutch. These experiments made it possible to
examine both the influence of the L1 and the influence of instruction on L2 vowel production.
The informants performed the experiments at two different intervals. In the first
session, the participants first carried out a Dutch picture-matching task, where they had to
produce existing Dutch monosyllabic words. Secondly, an English repetition task and an
English picture-naming task were conducted, containing existing English monosyllabic
words. In the second session, the same English and Dutch words were used which had been
used in the first session. For the Dutch part a slightly different picture-matching task was set
up.
The experiments were conducted in the school of the participants and all participants
were tested individually in each session. Although it is often claimed that when L2 learners
are tested in the L2 their L1 will interfere less with their L2 productions (Escudero et al.
2008), the language used during the current tests was Dutch, the informants’ native language.
The L1 of the participants was chosen as the principal language for explanation during the
experiments in order to ensure that the participants would sufficiently comprehend all the
tasks and additional explanations, since their proficiency level of English was not very high
yet. Furthermore, Escudero et al. (2008) remark that mostly advanced L2 learners may
experience L1 interference when their L1 is used. In the current study, all participants were,
however, beginning learners.
42
5.2 Methodology
5.2.1 Participants
Ten children participated in the tests, all of whom were native speakers of Dutch. None of
them were reported to have any speech deficits, hearing or learning disabilities. At the time of
the experiments, the participants were in the fifth grade of primary school and were all
between nine and twelve years old. This implies that they could still be counted as young L2
learners at the time of testing (Bongaerts et al. 1997). The school took part in a pilot project of
the Flemish community to promote English instruction in primary school. As part of the
programme, the children received two hours of English instruction every two weeks, mostly
in the form of games and songs. As English is not part of the official school curriculum, the
children did not take any tests or exams in English.
Prior to the experiment, none of the participants had received any formal education in
English, had lived in an English-speaking environment, or used English on a regular basis. It
should nevertheless be noted that some of the participants already displayed a higher
knowledge of English vocabulary than others. They reported to have learned some English
through television programs and other media. Nonetheless, since the first experiment was
conducted in September 2010, before the English classes had started, and the second
experiment in May 2011, the level of proficiency in English of all participants was quite low
at both times of testing. Mainly, because in all the informants had only received
approximately ten hours of instruction. The data of these experiments were gathered as part of
a larger research project.7
Although some of the parents of the participants had a different linguistic background,
all participants spoke Dutch at home. Only one participant reported to speak both Dutch and
Portuguese with his parents and siblings. All informants lived in the area of Mol, Flanders,
where they also went to school. This may have an influence on the participants’ productions
of the English and Dutch vowels, as discussed in Section 4.4. The children were not aware of
the actual aim of the English production tasks, but believed it was an English vocabulary test.
7 The data were gathered by Dr. Ellen Simon as part of an FWO project on ‘Child L2 Development’. The
analysis of the data (see Chapter 6) was performed by myself.
43
Table 5.1 presents information on the gender, age and linguistic background of the
informants.
Table 5.1. Informants’ background information
Informant Gender Age Other language
knowledge8
1 M 10 -
2 M 10 -
3 M 10 -
4 M 10 -
5 F 9 -
6 M 9 -
7 M 10 -
8 F 12 Romani
9 F 11 Moroccan (fluent)
10 M 10 Portuguese (fluent)
5.2.2 Design and stimuli
5.2.2.1 Session 1
Before the experiments were started, it was first explained to the participants that they would
perform games in English and Dutch, but that these were not connected to any schoolwork or
would not have an influence on any of their school grades. After the Dutch and English tasks
were performed, the participants were asked if they found the tasks difficult, especially with
reference to the English words used.
The Dutch task
In Session 1, the participants were first asked to perform a Dutch production task, in order to
examine the informants’ pronunciation of the Dutch vowels. The method used was through a
picture-matching task which consisted of two parts. In the first part, the participants were
given two sets of pictures, of which each picture could be semantically linked to another
picture of the other set. For instance, the picture of a ball of wool [wol] could be linked to a
picture of a cat [kat]. The instructions for the tasks were provided orally in Dutch. The
8 The category ‘other language knowledge’ does not include Dutch, French and English, since Dutch is the
native language of all participants and they were taught English and French at school.
44
participants were told that the pictures were jumbled and were instructed to match these
pictures again and in doing so say the word the picture represented. To ensure that the
participants understood the task, they were first tested with an example sentence. After this,
the informants were asked to repeat the words of each picture before placing them, one at a
time, back in an envelope which was handed to them, in order to have two productions of
each stimulus.
In the second part of the Dutch production task, the participants were given a sheet on
which pictures were randomly linked to other pictures. The participants were asked to follow
the lines that were drawn between the pictures and name the two connected pictures by saying
the sentence “the X is connected to the Y” [“de X is verbonden met de Y”]. For instance, the
picture of a wall [muur] was connected to the picture of a bag [zak].
The stimuli used in these production tasks were all monosyllabic words with a CVC-
structure. Since the participants could choose the order in which they named the pictures
themselves, the stimuli were arranged randomly. In this experiment, twelve Dutch steady-
state vowels were used, of which four will be examined in the current study, as they may have
an influence on the production of the English vowel contrasts discussed in this study (see
Chapter 7). In total, 24 stimuli were used. When a participant did not know a word or could
not immediately find the word which was intended, they were first given a description,
because repetition of the word by another speaker could influence the participants’ natural
pronunciation (Munro, 2008). In most cases this enabled the participants to name the correct
word. The stimuli of the examined vowels are presented in Table 5.2.
Table 5.2. The words containing the relevant vowels used in the Dutch picture-matching tasks
Target vowel Dutch words
/ɪ/ kip, vis
/ɛ/ bed, zes
/u/ boek, voet
/y/ muur, vuur
The English tasks
After the Dutch production tasks, the participants were asked to perform two English tasks.
The aim was to test the participants’ productions of the English vowels, before they had
received any formal instruction in English.
45
The first task was a repetition task. The participants were told that they would see
pictures on a computer screen and would hear the matching word in the meantime, spoken by
a native speaker of English, through the headphones they were wearing. After having heard
the word, they were instructed to repeat it loud and carefully. The informants repeated each
word twice. Again, similar to the Dutch production tasks, the instructions were given orally
and in Dutch, to ensure that the participants fully understood everything. Furthermore, the
participants were advised to attempt to memorize the words they heard, because most of them
would return in the following task. The words were placed in a random order, with none of
the words containing the same vowel occurring in a sequence.
The second task was a simple picture-naming task, where the participants were shown
pictures and were asked which of them they could name in English.9 Not all participants were,
however, able to remember all the words asked. In case the participants could not name the
picture, the correct word was given to them and they were asked to repeat it. These repetitions
were not used in the analysis.
The pictures were shown to the participants in a random order, each time in groups of
four. In these two tasks, eleven English steady-state vowels were tested. In both tasks 24
stimuli were given. For each vowel, three stimuli were used, except for /æ/ and /ʊ/, which had
two stimuli in both tasks. The stimuli were all monosyllabic words with a CVC-structure.
Table 5.3 presents the stimuli which contain the vowels which will be discussed in the present
study.
Table 5.3. The words containing the relevant vowels used in the English repetition and
picture-naming tasks
Target vowel English words
/ɪ/ fish, pig, six
/ɛ/ bed, head, pen
/æ/ cat, hat
/ʊ/ book, foot
/u:/ goose, pool, shoes
9 The initial idea was to conduct a quartets card game, in which the participants would have to ask questions and
accordingly name the pictures in order to collect a complete set of four cards of the same colour. However, since
the participants had not yet received any English instruction, this task proved to be too difficult. Furthermore,
there was a rather big difference between the participants in their knowledge of English vocabulary.
46
5.2.2.2 Session 2
The aim of the tests in the second session was to examine whether the pronunciation of the
English vowels had changed after the participants had received English instruction. Besides
the production of the English vowels, the Dutch vowels were also tested. Before the tests were
carried out by the participants, all participants were told that, similarly to the first session,
they would be asked to play some language games, the results of which would not affect any
of the participants’ school grades.
The English tasks
In the second session, first the participants’ production of the English vowels was tested in a
repetition and picture-naming task. The words used and the order in which they appeared
were completely identical to the English repetition and picture-naming tasks conducted in
Session 1 (see Section 5.2.2.1). The instructions were given in Dutch, because the English
proficiency level of the participants was still quite low. However, since the participants had at
that moment already received several English classes, their knowledge of the English
vocabulary had slightly improved compared to Session 1.
The Dutch task
After the two tasks for English were conducted, the participants were asked to perform a
Dutch production task. The instructions for this task were given orally in Dutch. Again, the
same words were used in this task as in the Dutch production tasks conducted in the first
session (see Section 5.3.1).
This time, the participants were handed a set of cards with words on them and a set of
cards showing pictures. Each word could be connected with a picture, having some form of
relationship to one another. The participants were instructed to place the cards with the
pictures next to those cards with the matching words. For example, the word ‘midnight’ was
matched with a picture of a ghost [spook]. When doing so, they were asked to name the words
which the pictures represented. As already mentioned above, the stimuli used in this task were
the same as in the first session. In total, 24 stimuli were used, of which 8 contained one of the
vowels which are discussed in the present study (see Table 5.2). In case the participants could
not immediately find the correct term, the word was described to them in Dutch or a Dutch
synonym was given.
47
5.3 Coding and analysis
The vowels in the present study were analysed with the help of the computer program
PRAAT (Boersma & Weenink 2012), which is specially developed for doing phonetics by
computer. In the present study, both the formants and duration were calculated by the use of
PRAAT.
As a first step, each word was separated by putting boundaries roughly at the onset and
end of it. Secondly, the vowels of each word were selected, similarly by putting boundaries
around them. Both the words and their vowels were selected manually. This results in a
waveform with two textgrids, as can be seen in Figure 5.1, which represents the English word
‘pen’.
The duration and formants of each vowel were then measured automatically by
running scripts, developed for PRAAT (Boersma et al. 2012). The scripts indicate the
beginning and end point of the vowels, their duration, and the mean value of each vowel’s
first, second and third formant (F1, F2, F3). In the present study, the duration and F1 and F2 of
both the English and Dutch vowels will be taken into account. As mentioned in section 4.1,
the F1 value gives information on the openness or closeness on the vowel, while the F2 value
gives information on the frontness or backness of the vowel.
Those vowels which were spoken too softly or which had background noise were not
used, because this may affect the formant frequencies. Words in the English picture-naming
which were repeated rather than produced spontaneously tasks were not analysed either.
Figure 5.1. The waveform and textgrid for the word ‘pen’ as produced by Participant 1 in the English
repetition task (hence ‘R’) during the first session
48
5.4 Hypotheses
Based on earlier studies discussed in Chapters 1-4 a number of hypotheses can be formulated.
At first, since this hypothesis may possibly carry implications for the other hypotheses, the
influence of the L1 of the informants on their L2 vowel production will be discussed. The
hypothesis states that the English vowel production of the informants will be influenced by
their L1 both on the level of the formant frequencies and on the level of duration. Because the
informants in this study are children, the second hypothesis concerns the advantage of young
learners over older learners in L2 learning, more specifically in the field of phonology. It is
hypothesized that the vowel production of the participants will not be native-like yet, as they
performed in the experiments at the very onset of learning, despite their relatively young age.
A third hypothesis relates to the effects of instruction on the pronunciation of the informants,
for which they were tested at two different intervals. As seen in Section 5.2, the participants
received only a rather limited amount of instruction between the two sessions, which leads to
the hypothesis that the participants will not show a considerable improvement of their vowel
production at the second time of testing.
Differences and similarities between the L1 and L2
For this study, not only the English vowel productions of the participants were examined, but
also their Dutch vowels in order to be able to study the influence of differences and
similarities between a learner’s L1 and L2. As discussed in Chapter 2, researchers have
proposed opposing ideas on whether similar or dissimilar sounds form greater difficulties for
L2 learners. In this study, the hypothesis of Flege’s SLM (1997) will be followed, stating that
similar sounds are more difficult for L2 learners than dissimilar sounds. The hypothesis,
among others proposed by O’Brien et al. (2010), that new sounds will be easier for younger
learners than for older learners, because the former do not have established their phoneme
system yet, will however not be followed here. It is expected that the informants at the age of
ten will already have established their vowel system fairly well, although intraspeaker
variation is still possible. Rather, it is hypothesized that the new vowels will equally pose a
problem for the present informants, as they may undergo “category assimilation”, which
implies that the informants perceive them as L1 sounds (O’Brien et al. 2010).
With respect to duration, it was proposed that duration is often used by L2 learners for
those vowel contrasts which do not occur in their L1 (see Section 4.2). It is therefore
49
hypothesized that the duration contrasts will be most visible between the front vowels /ɪ/-/ε/-
/æ/, because /æ/ does not occur in the informants’ L1. Furthermore, it is hypothesized that the
English vowels /ɪ/ and /ε/ will be reasonably influenced by the duration of their closest Dutch
equivalents /ɪ/ and /ε/. The vowel /æ/, which does not have an equivalent in Dutch, may be
subject to more production variation. Regarding the two back vowels /ʊ/ and /u:/, it can be
hypothesized that the participants will perceive the contrast between the short vowel /ʊ/ and
the long vowel /u:/ and will accordingly make this contrast in their productions.
Concerning place of articulation, the following hypotheses are proposed (see Section 4.5):
Because the vowel /ɪ/ occurs both in Dutch and in English, it is hypothesized that the
participants will make use of their L1 pronunciation for the production of the English
vowel, although /ɪ/ is pronounced somewhat differently in both languages. In Dutch,
/ɪ/ is produced more back.
A similar hypothesis can be made for /ε/, which also has an equivalent in both
languages which differ slightly in place of articulation, as Dutch /ε/ is more open.
For the production of the new vowel /æ/, it is proposed that the participants will not
notice a difference between /ε/ and /æ/ and will consequently rely on their production
of /ε/ for the pronunciation of /æ/, thereby mispronouncing the vowel, as it should be
produced more open than /ε/.
Lastly, because /ʊ/ nor /u:/ have an identical equivalent in Dutch, it is hypothesized
that the participants will not be able to contrast them (regarding formant frequencies,
not duration) and will produce them similarly to Dutch /u/. Theoretically, Dutch /u/
differs from English /u:/ solely in length, and should therefore not pose a problem
acoustically.
The influence of age
The hypothesis that younger learners perform better in L2 acquisition, because of social,
educational and motivational advantages, is not followed here. Besides age, it was suggested
that other factors may be important for L2 learning too, most significantly the amount of
experience the learner has in his/her L2. Furthermore, most studies which stated that young
learners perform better than late learners examined L2 learners who were immersed in an L2
speaking country (for example, Snow et al. 1978; Munro et al. 1996; Baker et al. 2008). The
informants in the present study, however, have very limited experience in speaking English,
50
as they had received only a small amount of instruction at the time of testing, and are not
immersed in a English-speaking environment.
The influence of age is also of significance concerning the variation in articulation of
the participants, both for their Dutch and English vowels. Because the phoneme system of
children is not firmly fixed yet (see Section 2.2), they may display intraspeaker variation in
their L1, which also has implications for the production of their L2. Since the informants were
all approximately ten years old at the time of testing (i.e. before puberty), it is hypothesized
that the participants will show a small degree of intraspeaker variation in the production of
their Dutch vowels, although not comparable to the variation found in younger children’s
speech. This variation will consequently also cause intraspeaker variation in the production of
their English vowels, especially since the informants’ production is examined at the onset of
L2 learning, which implies that their vowel system is not established yet.
Effects of instruction
In the field of pronunciation instruction, there is no consensus about which approach
generally leads to the best results in accurate L2 phonology, but some attention must be
devoted to pronunciation in instruction in order to find improvements in the learner’s
articulation (Derwing et al. 1998). At the first time of testing, the participants had not
received any English instruction yet. Therefore, they were asked to perform the same tasks at
a second interval to examine whether their production of the English vowels had benefited
from the instruction they had received. However, most attention in the lessons was devoted to
vocabulary and grammar and the amount of instruction received was very limited (see Section
5.2. This leads to the hypothesis that in general for the present participants no effects of
instruction will be found on their L2 vowel production. However, it should be remarked that,
since the knowledge of vocabulary of the participants had increased between the two sessions,
their awareness of the L2 may have developed too. A greater awareness of the informants of
the differences between Dutch and English would lead to a more accurate pronunciation.
However, this last hypothesis is formulated with caution, since the proficiency of English of
the informants was still relatively low at the second session.
51
Chapter 6. Results
This section presents the results obtained from the experiments as discussed in Chapter 5.
First the results from the Dutch tasks will be presented in Section 6.1, followed by the results
from the English tasks in Section 6.2. The discussion of the English tasks is divided according
to the hypotheses. All relevant vowels were analysed with the use of PRAAT (Boersma et al.
2012).
All tables presented show the average (-Outliers) values and standard deviation (-
Outliers) values. These values are based on the mean values of the vowels’ F1, F2 and
duration, and their standard deviations. The standard deviations revealed that the productions
of the participants showed great variation, especially in the F2 values, which may be due to
measurement errors or by accidental untypical productions by the participants which led to
outlying values. As it is essential for the comparison with the English formant values to find
the most representative results, a second average was calculated to reduce the influence of
potential uncharacteristic values on the results. In this case, all F1 and F2 values which were
greater or smaller than 1.5 standard deviations from the average value were omitted. In the
tables, the standard deviation (-Outliers) values will be placed between brackets. Only these
last results will be used in the presentation of the results for both the Dutch and English tasks.
6.1 Dutch
As discussed in Section 2.2, the L1 plays a significant role in the production of an L2, which
led to the hypothesis in this study that the participants’ English vowel productions will be
influenced by their Dutch vowel production. Therefore, the participants in this experiment
were asked to perform two Dutch picture-naming tasks. The results of these tasks are used to
compare the influence of the L1 on the L2 in the present study. Additionally, it should also be
noted that, since the L1 accent of the participants is regionally influenced, these results may
lead to more accurate observations than data retrieved from existing literature (see Section
4.4).
52
1. Formant frequencies
Table 6.1 shows the average (-Outliers) and standard deviation (-Outliers) values of the F1 and
F2 frequencies of the Dutch vowels from Session 1.
Table 6.1. Formants for the Dutch spread vowel productions recorded during Session 1 (in Hz)
/ɪ/ /ɛ/ /u/ /y/
F1 F2 F1 F2 F1 F2 F1 F2
Average
531
(33)
1777
(543)
646
(85)
1769
(197)
398
(70)
806
(184)
405
(80)
2039
(371)
According to these results, the F1 values show less variation in production than the F2 values
of the participants, implying that the F1 frequencies are more stable in the productions of the
informants. The F1 values seem to represent the typical productions relatively well, as the
standard deviation values are rather low. The F2 values, on the other hand, still display rather
high standard deviation values, even after the exclusion of the outlying values. The high
standard deviations denote that there is much variation found in the informant’s productions.
A possible explanation for why the F2 values on average show higher variation than the F1
values may be linked to the articulatory vowel space in which the vowels are produced. The
F1 values for the Dutch vowels of the first session range from 398 Hz (for /ʊ/) to 646 Hz (for
/ε/), while the F2 values range from 806 Hz (for /u/) to 2039 Hz (for /y/). This might lead to a
smaller chance of overlap between the vowels concerning fronting than height, which may
accordingly allow more variation in the F2 values.
These values indicate that the highest, and therefore most closed, vowels of the
informants are /u/ and /y/, whose F1 values do not differ significantly. Of the two spread
vowels /ɪ/ and /ε/, the highest vowels is /ɪ/, as was expected. With the respect to the F2 values,
/y/ is produced most front, followed by /ɪ/ and /ε/ who have almost equal results. Most
striking are the F2 high standard deviation values of particularly /ɪ/ and /y/.
In order to be able to provide a more visual representation of the spectral quality of the
vowels, the formant frequency values can be placed into a graph. Figure 6.1 displays the mean
values of the four Dutch vowels discussed in the present study, as produced by the
participants during Session 1, and also includes the formant frequency values of speakers with
an Antwerp accent (taken from Verhoeven & Van Bael 2002), as discussed in section 4.4, for
53
comparison. Although Verhoeven et al. separated the data of the male and female speakers, an
average result of both groups is used here.
Figure 6.1. The formant values for the Dutch vowels produced by the participants during Session 1
and formant values retrieved from Verhoeven & Van Bael (2002)
Figure 6.1 reveals that the average (-Outliers) values of the participants in this study and the
average values retrieved from the study of Verhoeven et al. (2002) are considerably different.
The difference in vowel productions is most apparent in the comparison of the formant values
of the vowels /ɪ/ and /u/. A main reason for this variation may be the difference in age of the
informants of the present study and the informants of the study of Verhoeven et al., as age
influences the size and the shape of the vocal tract and articulators, which influence the nature
of the sound productions (Clarke et al. 1995). Another explanation may be the differences in
accent between the two groups, since Mol, where the informants live, is not located in the
centre of the province of Antwerp. Therefore, the data concerning the formant frequencies
from Verhoeven et al. (2002) will not be further used in the discussion on the influence of the
L1 on the L2, as these data are not relevant for the participants in the present study.
Furthermore, Figure 6.1 shows that for the current participants the F2 values of the
vowels /ɪ/ and /ε/ are relatively similar, both pronounced reasonably front, but that these
vowels contrast in their F1 values, with /ɪ/ produced higher. For /u/ and /y/, on the other hand,
whereas the F1 values are approximately identical, the F2 values differ greatly, with /y/ as the
most fronted vowel and /u/ as the most backed vowel in this study for these speakers.
200
400
600
800
5001000150020002500
F1
(in
Hz)
F2 (in Hz)
Participants /ɪ/ Participants /ɛ/ Participants /u/ Participants /y/
Literature /ɪ/ Literature /ɛ/ Literature /u/ Literature /y/
54
To examine whether these data are accurate, the participants also performed a Dutch picture-
naming task during the second session. Of these results again the average (-Outliers) and
standard deviation values will be presented (see Table 6.2).
Table 6.2. Formants for the Dutch vowel productions recorded during Session 2 (in Hz)
/ɪ/ /ε/ /u/ /y/
F1 F2 F1 F2 F1 F2 F1 F2
Average
542
(46)
1845
(308)
682
(96)
1610
(246)
443
(53)
998
(148)
366
(68)
1889
(386)
As expected, the formant values from the second session are not completely identical to those
of the first session, but some similarities can be found. First of all, the F2 values show greater
variety than the F1 values. Secondly, the mean F2 value of /u/ is the lowest, while the mean F2
value of /y/ is the highest. Thirdly, the mean F1 values of /ɪ/ and /ε/ are approximately the
same to those of Session 1. Different, however, are the mean F2 values of these two vowels.
Whereas these vowels hardly differed from each other in their F2 values in the first session,
they show a greater contrast in the second session. The F2 values for /ɪ/ are higher compared
to those of the first session, which accordingly causes the F2 values of /ɪ/ and /y/ to be more
similar. Here too the average formant values can be placed into a graph to provide a more
visual representation of the results.
Figure 6.2. The formant values for the Dutch vowels produced by the participants during Session 2
Figure 6.2 supports the findings which were deduced from the results presented in Table 6.2.
However, most importantly, the same production tendencies of the first session (see Figure
6.1) are also represented in this graph. The highest and most fronted vowel is /y/, whereas the
200
400
600
800
5001000150020002500
F1
(in
Hz)
F2 (in Hz)
Participants /ɪ/ Participants /ε/ Participants /u/ Participants /y/
55
most backed vowel is /u/ and the lowest vowel is /ε/. The vowel /ɪ/ takes in a middle position
between /y/ and /ε/. These results indicate that the vowel productions of the informants are
relatively consistent.
2. Duration
Next to the formant frequencies of the vowels, the duration of the Dutch vowels of the
participants was measured, to examine to what extent the informants’ L1 vowels influence the
duration of the informants’ L2 vowels. The mean values and standard deviations (between
brackets) are presented in Table 6.3.
Table 6.3. Duration of the Dutch vowels recorded during Session 1 (in ms)
/ɪ/ /ε/ /u/ /y/
Average 113
(28)
126
(30)
117
(39)
184
(45)
One of the most striking features of the results presented above is that the standard deviation
for all vowels is quite small. These findings suggest that the production of the vowels is fairly
consistent and similar between the different informants regarding duration. Figure 6.3
presents these average duration values of each vowel as produced by the participants during
Session 1 in a bar graph.
Figure 6.3. The average duration of the Dutch vowels recorded during Session 1 (in ms)
Figure 6.6 clearly shows that the vowel /y/ has the longest duration of the selected vowel
group, thereby visibly contrasting with the vowel /u/. Both these vowels will be used in the
comparison with the English back vowels /ʊ/ and /u:/. Figure 6.6 further indicates that /ε/ has
0 50 100 150 200
(in ms)
/ɪ/
/ε/
/u/
/y/
56
a longer pronunciation than /ɪ/, both of which are short vowels in Dutch. It would, however,
be interesting to examine if this tendency holds for each individual participant, since the
difference in length between these two vowels is relatively small. The values per participant
are therefore included in the appendices (see Tables 8.3 and 8.4). They show that there is
indeed some variation in duration between these vowels. In addition, it may be noted that the
informants in the study of Verhoeven et al. (2002) display a longer pronunciation for /ɪ/,
instead of /ε/ (see Section 4.4).
Furthermore, the duration values as measured during the first session can also be
compared to those measured during the second session, to examine whether the findings are
accurate and whether the participants display a consistency in their pronunciation. Table 6.4
presents the average (-Outliers) and standard deviation (-Outliers) values of the duration of
these vowels.
Table 6.4. Duration of the Dutch vowels recorded during Session 2 (in ms)
/ɪ/ /ε/ /u/ /y/
Average 122
(29)
150
(24)
134
(16)
223
(37)
In relation to the values retrieved during the first session, these data do not show much
variation, as the standard deviations are relatively small. Striking is that, compared to the first
session, the pronunciation of the vowels appears to be longer here, leading to a higher average
result for each vowel. Most important for the comparison with the English vowels, however,
is the question whether the average values of the first session correspond to those of the
second session, with respect to the relative contrast in duration between the vowels.
Therefore, the values are again placed into a bar graph (see Figure 6.4).
Figure 6.4. The average duration for the Dutch vowels recorded during Session 2 (in ms)
0 50 100 150 200 250
(in ms)
/ɪ/
/ε/
/u/
/y/
57
As can be clearly seen from Figure 6.4, the mean values of the vowels retrieved during the
second session show a greater contrast from each other than those recorded during the first
session, especially for /ɪ/, /ε/ and /u/. Nevertheless, the general tendencies found in the first
session can again be found here. Of importance for the present discussion is the confirmation
of the finding that the informants seem to produce /ε/ with a longer duration than /ɪ/.
Therefore, as was already proposed in the presentation of the results of the formant
frequencies, the data of the participants in the study of Verhoeven et al. (2002) do not hold for
the participants in this study and will consequently not be used.
6.2 English
The participants’ productions of their L1 vowels, as presented in the previous section, were
examined in order to compare them with the productions of their English vowels. Therefore,
the participants also performed two English production tasks during the first and second
session. This section will first compare the results from the repetition and picture-naming
task, followed by a comparison of the vowels of the first and second session. Thirdly, the
amount of variation found in the informants’ productions will be presented. Finally, formant
and duration values of native speakers will be provided for comparison. In each part, first the
results of the formant values will be presented and then the results of the vowels’ duration.
In order to provide a visual depiction of the qualitative and quantitative aspects of the
vowels, the results will be presented in graphs, using the mean values from the repetition and
picture-naming tasks of the first and second session, which are displayed in Table 6.5 for the
formant frequency values and Table 6.6 for duration. In the presentation of the duration
results, the data of the repetition and picture-naming task were taken together, because it was
found that the difference in results between these tasks was not statistically significant (p >
0.5).
58
Table 6.5. The formant values of the English repetition (= Rep.) and picture-naming (= P.N.) task in
the first and second session (in Hz)
/ɪ/ /ε/ /æ/ /ʊ/ /u:/
F1 F2 F1 F2 F1 F2 F1 F2 F1 F2
Rep. Session 1 526
(43)
1742
(676)
695
(93)
1851
(448)
738
(102)
1812
(280)
528
(49)
1100
(177)
491
(68)
956
(303)
P.N. Session 1 538
(26)
2371
(227)
628
(83)
1879
(297)
658
(65)
2004
(96)
405
(78)
963
(228)
384
(30)
776
(75)
Rep. Session 2 516
(38)
2179
(543)
725
(72)
1860
(324)
753
(81)
1906
(146)
535
(51)
1270
(240)
448
(42)
1026
(172)
P.N. Session 2 534
(52)
2212
(704)
730
(52)
1716
(347)
709
(67)
1685
(420)
474
(55)
1011
(185)
436
(61)
866
(269)
Table 6.6. The duration values in the first and second session (in Hz)
/ɪ/ /ε/ /æ/ /ʊ/ /u:/
Session 1 146
(48)
186
(50)
207
(73)
150
(48)
208
(37)
Session 2 149
(31)
207
(45)
198
(33)
132
(19)
219
(49)
6.2.1 Repetition vs. picture-naming task
The results from the repetition and picture-naming tasks will be compared, because repetition
tasks where an informant repeats the stimuli after a native speaker “may produce better-than-
normal output by closely imitating the model voice” (Munro 2008: 202). In this part only the
acoustical quality of the vowel productions by the informants will be discussed, as the results
for duration were taken together. Figure 6.5 presents the results from both tasks of the first
session.
59
Figure 6.5. The vowels produced during the first session
(Rep. = Repetition task; P.N. = Picture-naming task)
With respect to the front vowels, Figure 6.5 shows that the productions of /ε/ and /æ/ hardly
differ in the repetition task from the picture-naming task. Moreover, it seems that these two
vowels are not contrasted from one another, but rather that the informants produce them as if
they were the same vowel. However, more striking are the results for /ɪ/. Although the
informants show that they are able to contrast this vowel from /ε/ and /æ/, they produce the
vowel rather differently in both tasks. Whereas the informants produced /ɪ/ very back in the
repetition task, the vowel becomes very front in the picture-naming task. Taken together with
the fairly high standard deviations for the F2 values of this vowel, as presented in Table 6.5, it
appears that the informants show a rather inconsistent production for /ɪ/, with respect to its
fronting.
The formant frequencies of the back vowels seem to follow a quite regular pattern in
the repetition and picture-naming tasks. In both cases, the vowel /ʊ/ is produced relatively
more front and lower than /u:/, although the absolute values of the repetition and picture-
naming task differ from each other. However, these results indicate that the informants are
able to contrast between /ʊ/ and /u:/, both when repeating an L1 English speaker in the
repetition task and independently in the picture-naming task.
To be able to examine whether the findings from the first session are typical of the
informants’ production, the results from the second session are presented too (see Figure 6.6).
200
400
600
800
5001000150020002500
F1
(in
Hz)
F2 (in Hz)
Rep. /ɪ/ Rep. /ε/ Rep. /æ/ Rep. /ʊ/ Rep. /u:/
P.N./ɪ/ P.N. /ε/ P.N. /æ/ P.N./ʊ/ P.N./u:/
60
Figure 6.6. The vowels produced during the second session
(Rep. = Repetition task; P.N. = Picture-naming task)
When comparing both tasks, the values for /ɪ/, as presented in Figure 6.6, seem to be
relatively regular, as the informants produced this vowel virtually identical in both tasks. The
vowel /ɪ/ is also again clearly contrasted from /ε/ and /æ/. Striking in these results is that the
values of /ε/ and /æ/ lie fairly close to each other in the two tasks, suggesting that these
vowels influence each other’s production. Furthermore, it is noteworthy that their relation to
one another seems similar in both separate tasks: /ε/ is in both cases produced lower and more
front. The absolute values of both vowels differ, however, between the repetition and picture-
naming tasks, as the values of the picture-naming tasks are lower for all front vowels.
As is shown in Figure 6.6, the contrast between /ʊ/ and /u:/ appears to be relatively
similar in both tasks: /ʊ/ is on average produced more front and lower than /u:/ in both
session. This finding would suggest that the participants do maintain a contrast between these
two vowels, as the same observation was made in the results of the first session for the back
vowels.
Summary
The comparison of the results from the repetition and picture-naming task indicates that the
informants’ back vowels show most variation. In both sessions, the pronunciation of the back
vowels differs in the repetition and picture-naming task, although the relative contrast
between the vowels seems to be maintained, which may indicate that the informants are aware
that there is a contrast between these two vowels, but do not have a fixed production yet.
200
400
600
800
5001000150020002500
F1
(in
Hz)
F2 (in Hz)
Rep. /ɪ/ Rep. /ε/ Rep. /æ/ Rep. /ʊ/ Rep. /u:/
P.N./ɪ/ P.N./ε/ P.N. /æ/ P.N./ʊ/ P.N. /u:/
61
The vowels /ε/ and /æ/ were in both tasks in both sessions produced very similarly,
hardly showing any contrast, implying that the informants are not able to produce any
acoustical contrast. In the first session, the informants produced /ɪ/ very differently in both
tasks, as opposed to the second session, where the mean values of both tasks were virtually
identical, which indicates that the informants are in fact able to produce this vowel fairly
consistently.
6.2.2 Pre-instruction vs. post-instruction vowels
To examine whether the vowel productions of the informants had improved after having
received some English instruction, the mean vowel values of the first and second session will
be compared.
1. Formant frequencies
For clarity, the front and back vowels will be presented separately in this part. Figure 6.7
displays the formant values of the front vowels of both tasks.
Figure 6.7. The front vowels produced during the first and second session
(Rep. = repetition task; P.N. = picture-naming task)
Figure 6.7 shows that the formant values of the vowels produced in Session 2 provide a rather
different image than the formant values of the Session 1. First of all, the irregular production
of /ɪ/ in the first session, does not occur in the second session, as the informants produce this
400
600
800
150020002500
F1
(in
Hz)
F2 (in Hz)
Session 1
Rep. /ɪ/ Rep. /ε/ Rep. /æ/
P.N. /ɪ/ P.N. /ε/ P.N. /æ/
400
600
800
150020002500F
1 (
in H
z)
F2 (in Hz)
Session 2
Rep. /ɪ/ Rep. /ε/ Rep. /æ/
P.N. /ɪ/ P.N. /ε/ P.N. /æ/
62
vowel almost identically in both tasks. In both sessions, /ε/ and /æ/ seem to be produced quite
similarly by the informants in both tasks, although their relative relations to one another differ
slightly. In the picture-naming task of the first session /ε/ and /æ/ are produced more front
than in the repetition task, while they are produced more back in the picture-naming task of
the second session. This difference may, however, not be very significant, as on average
hardly any contrast is made between these vowels.
Figure 6.8 presents the formant values of the back vowels from both tasks.
Figure 6.8. The back vowels produced during the first and second session
(Rep. = repetition task; P.N. = picture-naming task)
It was suggested in Section 6.2.1 that the informants contrasted /ʊ/ and /u:/ fairly regularly in
both tasks in both sessions. Figure 6.8, however, also shows that the informants do not
contrast these vowels similarly in the first and second session. In both sessions, /ʊ/ is
produced more open and front than /u:/ in both tasks, but while /ʊ/ is produced more closed in
the picture-naming task than /u:/ in the repetition task in the first session, it is produced
slightly more open in the picture-naming task than /u:/ in the repetition task of the second
session. It may be even remarked that in the second session there is virtually no contrast
between the formant values of /u:/ in the repetition task and the formant values of /ʊ/ in the
picture-naming task. Furthermore, all values from the second session seem to be produced
more front than those from the first session.
200
400
600
50010001500F
1 (
in H
z)
F2 (in Hz)
The first session
Rep. /ʊ/ Rep /u:/ P.N. /ʊ/ P.N. /u:/
200
400
600
50010001500
F1
(in
Hz)
F2 (in Hz)
The second session
Rep. /ʊ/ Rep. /u:/ P.N. /ʊ/ P.N. /u:/
63
2. Duration
Besides formant frequencies, the duration of the English vowels was measured to examine
whether the child L2 learners reached native-like values for duration and whether the vowels’
duration was influenced by the participants’ L1. The mean duration values from Session 1 and
2 for both the front and back vowels are displayed in Figure 6.9.
Figure 6.9. The average duration for the English vowels in the first and second session (in ms)
According to 6.11, the participants seem to produce the vowels /ɪ/ and /ʊ/ and the vowels /æ/
and /u:/ almost equally long in the first session, barely displaying any contrast in length of
production. The vowel /ε/ falls in between these two groups. The results from the second
session, however, show rather different values for duration. The vowel /u:/ is produced
considerably longer than /æ/ in the second session, while these two vowels had an almost
equal length in the first session. A similar tendency occurs between /ɪ/ and /ʊ/, with /ɪ/ being
produced longer in Session 2. Also noteworthy is that /ε/ has a somewhat longer duration than
/æ/ here, as opposed to Session 1. It appears that the informants’ productions of the front
vowels differ more from each other in the first and second session than their back vowel
productions, which are more stable throughout both sessions.
3. Summary
The comparison of the first and second session revealed that the informants’ pronunciation
did not differ significantly after having received instruction. Particularly the formant
frequency values seemed rather similar between the two sessions, with some exceptions, such
0 50 100 150 200 250
(in ms)
The first session
/ɪ/ /ε/ /æ/ /ʊ/ /u:/
0 50 100 150 200 250
(in ms)
The second session
/ɪ/ /ε/ /æ/ /ʊ/ /u:/
64
as the production of /ɪ/ in the repetition tasks. With respect to duration, there seemed to be
more difference between the first and second session, especially in the production of the front
vowels. Most striking was the finding that while /æ/ had a longer duration than /ε/ in the first
session, it was produced shorter than /ε/ in the second session.
6.2.3 Intra- and interspeaking variation
As the informants are young learners at the onset of L2 acquisition, it is likely that their
productions will show variation. Both the productions of the same informant may vary (i.e.
intraspeaker variation) and the productions between all informants (i.e., interspeaker
variation). The variation may be examined through the standard deviation values, which were
already presented in Table 6.5 for the formant frequencies and in Table 6.6 for duration.
1. Formant frequencies
The standard deviations, which were first presented in Table 6.5, are repeated here in Table
6.7.
Table 6.7. The standard deviation (-Outliers) values of the formant frequencies of the English
repetition (= Rep.) and picture-naming (= P.N.) task in the first and second session (in Hz)
/ɪ/ /ε/ /æ/ /ʊ/ /u:/
F1 F2 F1 F2 F1 F2 F1 F2 F1 F2
Rep. Session 1 43 676 93 448 102 280 49 177 68 303
P.N. Session 1 26 227 83 297 65 96 78 228 30 75
Rep. Session 2 38 543 72 324 81 146 51 240 42 172
P.N. Session 2 52 704 52 347 67 420 55 185 61 269
Similarly to the results of the Dutch vowels, the standard deviations are quite high, as can be
seen in Table 6.7. In general, the standard deviation values of the F1 are smaller than those of
the F2, implying that the informants’ productions vary more in terms of frontness or backness
than in vowel height. This may suggest that the informants are relatively aware of how high or
low they should pronounce a vowel, but are less sure on the vowels’ frontness or backness.
There seems to be no significant difference between the standard deviations from the
first and second session, and between the standard deviations of the repetition and picture-
65
naming tasks, which indicates that the informants’ productions are not influenced by repeating
an L1-model or by instruction with respect to variation. Striking in this case is that the lowest
overall standard deviations are found for the picture-naming task of the first session, implying
that the informants’ vowel productions seem to have varied least when they were not
influenced by the productions of a native speaker or by L2 instruction. However, it should be
remarked there were fewer data for this picture-naming task, as the participants did not readily
remember all the words asked (see Table 8.12 in the Appendices).
2. Duration
The values given in Table 6.8 were first displayed in Table 6.6, and present the standard
deviation values for duration, taking together the results from the repetition and picture-
naming tasks.
Table 6.8. The standard deviation (-Outliers) values of duration (in ms)
/ɪ/ /ε/ /æ/ /ʊ/ /u:/
Session 1 48 50 73 48 37
Session 2 31 45 33 19 49
As opposed to the values of the formant frequencies which did not show any significant
difference between the first and second session, it appears that there is less variation in the
second session than in the first session regarding duration, with the exception of /u:/. Overall,
in both sessions, the variation seems to be relatively small, implying that the informants have
a rather stable L2 production with respect to duration.
This finding may be explained by the assumption that it is likely that the informants do
not have any problems in distinguishing between the general groups of checked and free
vowels, and will accordingly pronounce the vowels either short or long. It is therefore more
interesting to examine whether the informants are able to contrast the vowels accurately from
one another in terms of duration, as was discussed in Section 6.2.2.
6.2.4 Native speakers
To compare the results of the child L2 learners in the present study with the average
productions of adult L1 speakers, it would be interesting to also have a visual representation
66
of the formant values and duration of native speakers of English. The values used were
already discussed in Section 4.3.
1. Formant frequencies
Figure 6.10 presents the formant values of native speakers of American English and RP. The
exact values were presented in Tables 4.1 and 4.2 (see Section 4.3).
Figure 6.10. Formant values of the vowels for American English (data taken from Hillenbrand et al.
1995) and RP (data taken from Hawkins & Midgley 2005)
The formant values of /ɪ/ and /ε/, as presented in Figure 6.10, show a regular pattern for both
American English and RP, with the American English vowel in both cases being more fronted
and lower than the RP production. Naturally, because the difference in accent, the American
English and RP speakers do not produce the vowels with the same acoustic values, but in the
productions of both the American English speakers and the speakers of RP /ɪ/ is more fronted
and produced higher than /ε/. Interesting for this study, however, is the production of /æ/,
which has a different place of articulation in these two accents. Whereas the speakers of
American English produce this vowel roughly in between the productions of /ɪ/ and /ε/, the
speakers of RP produce /æ/ rather backed and lower than the other two front vowels. In the
vowel diagram presented in Section 4.3, the American English production is followed, as /æ/
is produced lower and slightly more backed than /ε/ (Collins et al. 2003).
As opposed to the formant frequencies of the front vowels, the values of the back
vowels seem to follow a more irregular pattern between the different accents of English. Most
200
400
600
800
1000
10001500200025003000
F1
(in
Hz)
F2 (in Hz) American English /ɪ/
American English /ε/
American English /æ/
American English /ʊ/
American English /u:/
RP /ɪ/
RP /ε/
RP /æ/
RP /ʊ/
RP /u:/
67
notable in Figure 6.10 is the articulation of /u:/ of the speakers of RP, which is relatively high
and more front compared to the other back vowels here. However, the data used here are
those of the age group of 20-25 years old and when looking to the mean formant frequencies
of the older age groups, it appears that the older the participants are, the lower the F2 values of
/u:/ become, indicating that they are produced more backed. The same trend is also found for
the F2 values of /ʊ/ for the speakers of RP (Hawkins et al. 2005: 195). Similar to the findings
of the front vowels is that the RP back vowels are on average produced higher than the
American English vowels.
2. Duration
For duration, only data of speakers of American English will be used, as the study of
Hawkings et al. (2005) did not present any duration values for the native speakers of RP.
These results are displayed in Figure 6.11. The exact values were presented in Table 4.1 (see
Section 4.3).
Figure 6.11. The average duration for the English vowels produced by native speakers of American
English (in ms) (data taken from Hillenbrand et al. 1995)
Most striking in Figure 6.11 is that the speakers of American English have the longest
duration for /æ/, which is produced noticeably longer than the long vowel /u:/. The shortest
vowel, although shortly followed by /ɪ/ and /ʊ/, is /ε/, which does not correspond to the results
of the productions of the participants in the present study. Similar to the findings of the
participants is that the vowels /ɪ/ and /ʊ/ differ from each other only minimally.
0 100 200 300 400
(in ms)
/ɪ/
/ε/
/æ/
/ʊ/
/u:/
68
Chapter 7. Discussion
In this chapter the findings of the previous chapter will be discussed in relation to the
proposed hypotheses discussed in Chapter 5. The hypotheses concerned the role of the L1,
age and instruction on the vowel production of young Dutch-speaking learners of English.
Firstly, it was suggested that the L1 of the informants will influence the formant frequencies
and duration of the vowels discussed. The degree of influence depends on the similarity
between the two languages. The second hypothesis stated that although the informants are
young learners, their pronunciation will not necessarily be native-like. However, the young
age of the informants will affect the amount of variation in their productions. Lastly, it was
hypothesized that the vowel production of the informants will not have visibly improved
through the instruction received.
This chapter will be divided according to these three hypotheses. For the hypothesis on
the influence of the L1, the results from the Dutch and English tasks will be taken together for
the discussion, to examine whether or not the predicted vowel pronunciation difficulties were
found in these participants. For the hypothesis on the role of age, the focus will mainly be on
inter- and intraspeaker variation. The findings from these discussions will subsequently be
used for the discussion of the hypothesis on the effects of instruction, where the changes in
vowel production which arose between the first and second session of data collection will be
discussed.
7.1 Differences and similarities between the L1 and L2
One of the aims of this study is to examine if and how the L1 influences the production of the
foreign sounds of a learner’s L2. It was hypothesized, based on Flege’s SLM (1997), that
those L2 vowels which are non-native, but which nevertheless have a close equivalent in the
L1 would form the greatest pronunciation difficulties for the L1 learners. Regarding the
English vowels examined in this research paper, this implies that the informants’ production
of the front vowel /æ/ and the back vowel /ʊ/ will be most problematic. For /æ/, it was
suggested that both its formant frequencies and duration would be influenced by the
pronunciation of the Dutch vowel /ε/. Regarding /ʊ/, it was proposed that this vowel would be
produced similarly to /u:/, and that both these back vowels would be influenced by either the
Dutch vowel /u/ or, albeit less likely, /y/. Finally, it was hypothesized that the two English
69
front vowels /ɪ/ and /ε/ would not be very problematic for the informants in that they would be
able to contrast these two vowels sufficiently, although it was also expected that their
pronunciation would show influence of their L1.
In this section, the productions of the Dutch vowels by the informants will be
compared to those of the English vowels, as presented in Chapter 6. For clarity, the front and
back vowels will be separated into two different parts. First, the formant frequencies and
duration of the Dutch and English front vowels will be compared, followed by the formant
frequencies and duration of the Dutch and English back vowels. The first and second session
will be presented separately too, although they will be discussed together, as the discussion of
the differences between these two sessions is provided in Section 7.3.
7.1.1 The front vowels
1. Formant frequencies
The results of the Dutch tasks indicated that the informants clearly contrasted the vowels /ɪ/
and /ε/ from each other in terms of F1. Both in the first and second session /ɪ/ was produced
higher than /ε/. The contrast in F2 was not as obvious between these two vowels in the first
session, but the results from the second session displayed a greater difference in
pronunciation, with /ɪ/ produced more front than /ε/ (see Table 6.5).
The clear contrast made by the informants between these two Dutch vowels is
visualized in Figure 7.1, which shows every individual vowel production by the informants
during the Dutch tasks.
70
Figure 7.1. The Dutch formant values of Session 1 and 2 by the participants
Although Figure 7.1 shows that some productions of the two vowels partly overlap, the
overall picture indicates that the informants generally are able to maintain a contrast between
these two vowels in their native language. Besides some outlying values, which can be
ascribed to accidental faulty articulations or calculation errors, the vowel /ɪ/ is in general
produced higher than /ε/ by the informants during these two sessions. This may lead to the
conclusion that concerning the informants’ L1, their vowel system already seems reasonably
well established in that they are able to distinguish between two closely pronounced vowels,
according to a regular pattern, which may also carry consequences for the pronunciation of
their L2 vowels.
As it is now ascertained that the informants contrast their L1 front vowels, it seems
also likely that they will be able to distinguish between the English vowels /ɪ/ and /ε/, as it is
hypothesized that the Dutch equivalent vowels will influence their English pronunciation. The
pronunciation of the new English vowel /æ/ will, however, most likely be influenced by the
pronunciation of the Dutch vowel /ε/, as already proposed in Section 5.5.
The mean formant values of the English vowels of the first session (see Table 6.5)
indicated that on average in both tasks /ɪ/ seems to be produced higher than /ε/ and /æ/,
similarly to its Dutch equivalent production. The most consistent production for both the F1
and F2 is found for /ε/, while /æ/ shows most variation between the two tasks. Combined with
the finding that the participants’ L1 front vowel pronunciation seems quite consistent, these
results may indicate that the participants base their pronunciation of the L2 vowels on their
0
200
400
600
800
1000
5001000150020002500
F1
(in
Hz)
F2 (in Hz)
Session 1
/ɪ/ /ε/
0
200
400
600
800
1000
5001000150020002500
F1
(in
Hz)
F2 (in Hz)
Session 2
/ɪ/ /ε/
71
pronunciation of their L1 vowels, which enables them to have a more consistent vowel
production for those vowels which have an equivalent in their L1. Striking in this case then is
the difference in F2 value for /ɪ/ between the repetition and picture-naming task. However, as
discussed in Chapter 6, the F2 values for all the informants’ productions, including their Dutch
productions, are subject to wide variation. In order to ensure that these suggestions are
accurate, all individual vowel productions by the participants made in the first session were
placed into a graph (see Figure 7.2).
Figure 7.2. The formant values of the English front vowels by the participants
As can be seen in Figure 7.2, the hypotheses made concerning the production of the three
English front vowels seems to be confirmed in the case of the informants in this study. The
production of /ɪ/ is rather high and front in both the repetition and picture-naming task, similar
to the informants’ Dutch productions of /ɪ/, and also in line with the native speakers’
productions (see Figure 6.9). The same conclusions can be drawn for the productions of the
English vowel /ε/, which is also produced similarly to the informants’ Dutch /ε/, and is
consistent with the native speakers’ pronunciation, although here there seems to be a slight
0
200
400
600
800
1000
1200
3008001300180023002800
F1
(in
Hz)
F2 (in Hz)
Session 1
Repetition task /ɪ/ Repetition task /ε/ Repetition task /æ/
Picture-naming task /ɪ/ Picture-naming task /ε/ Picture-naming task /æ/
72
difference in pronunciation of the vowel between the repetition and picture-naming task. As
was predicted, the pronunciation of /æ/ by the participants seems to overlap almost
completely with the pronunciation of /ε/, which supports the hypothesis that the informants
are not able to produce a contrast between /ε/ and /æ/, as this contrast does not occur in their
L1. According to Collins et al. (2003), /æ/ in English should be produced lower and slightly
more backed than /ε/.10
Remarkable for all three vowels is that they have relatively many
outlying productions. This may be due to the fact that the informants are beginning learners,
tested at the onset of learning, but this observation may also be due to calculation errors or
mispronunciations, which will be further discussed in Section 7.2.
Now focusing on the second session, the results showed that the formant frequencies
of /ɪ/ and /ε/ were quite consistent, with the F2 values of /ε/ showing slight variation (see
Table 6.5). Also the F1 values of /æ/ seemed constant between the two tasks, while the F2
values again are more varied. It is generally suggested that informants’ pronunciations may be
positively influenced in a repetition task by the speaker model, thereby giving better, but
consequently less representative results (Munro 2008). Therefore, it is a bit surprising here
that the F2 values of /æ/ are higher than /ε/ in the repetition task, implying that they are
produced more front, while /æ/ is generally produced more back than /ε/ by native speakers
(Collins et al. 2003). In the picture-naming task, on the other hand, the contrast seems to be
produced more accurately.
The individual productions of the repetition and picture-naming task of the second
session by the participants were also placed into a graph to offer a better picture on whether
the participants make a distinction in the production of the three English front vowels (see
Figure 7.3).
10
Note that this suggestion is consistent with the mean values of the native speakers of American English
(Hillenbrand et al. 1995), but not with the mean values of the native speakers of RP (Hawkins et al. 2005), who
produce /æ/ in between /ɪ/ and /ε/ (see Figure 6.10).
73
Figure 7.3. The formant values of the English front vowels by the participants
Looking at the results of the repetition task, the informants make a rather obvious contrast in
their productions of /ɪ/ and /ε/, leading to further support for the hypothesis that these two
vowels do not pose a problem for Dutch-speaking learners of English. Concerning /æ/, there
again seems to be overlap with /ε/, but there also seem to be some more accurate productions
here, where /æ/ is produced a bit lower, thus differing from /ε/. The results for /ε/ and /æ/ for
the picture-naming task, on the other hand, clearly show that the informants are not able to
contrast these two vowels, as their productions in this case are virtually identical. Between the
productions of /ɪ/ and /ε/ then, the contrast again appears to be maintained by the informants.
The slight difference in pronunciation of /æ/ between the repetition and picture-naming tasks
supports the suggestion by Munro (2008) that speaking the L2 after a model generally leads to
better results by the L2 learners.
As the results from both the first and second session indicate that the pronunciation of
the English vowel /æ/ is problematic for the informants in the present study in that they are
not able to distinguish between /ε/ and /æ/, it may be interesting to examine to what extent the
Dutch vowels influence the production of /æ/. The results from the English repetition and
0
200
400
600
800
1000
1200
3008001300180023002800
F1
(in
Hz)
F2 (in Hz)
Session 2
Repetition task /ɪ/ Repetition task /ε/ Repetition task /æ/
Picture-naming task /ɪ/ Picture-naming task /ε/ Picture-naming task /æ/
74
picture-naming task for /æ/ and the results of the Dutch task for both /ɪ/ and /ε/ were therefore
placed into a graph (see Figure 7.4).
Figure 7.4. The formant values of /æ/ of the first and second session and the formant values of Dutch
/ɪ/ and /ε/ by the informants
In general, there seems to be little overlap between Dutch /ɪ/ and English /æ/, implying that
the informants do not confuse between these two vowels in their productions. On the other
hand, when comparing the values of Dutch /ε/ and English /æ/, it can be concluded that the
informants consider /æ/ almost as a complete equivalent to Dutch /ε/, as their values partly
cover each other, although it may be also noted that in some cases the informants seem to
produce Dutch /ε/ slightly more backed than /æ/, indicating that their productions are not
entirely identical. Since many productions of /ε/ and /æ/ do nevertheless overlap each other,
the back productions of /ε/ may be considered as accidental or untypical productions of the
informants.
2. Duration
For the discussion of the results of the vowels’ duration, the values as presented in Table 4.1
for the native speakers, and the average (-Outliers) values as presented in Tables 6.3 and 6.4
for the results of the Dutch tasks, and Table 6.6 for the results of the English tasks will be
0
200
400
600
800
1000
1200
05001000150020002500
F1
(in
Hz)
F2 (in Hz)
Session 1 /æ/
Repetition task English /æ/
Picture-naming task English /æ/
Dutch /ɪ/
Dutch /ε/
0
200
400
600
800
1000
1200
05001000150020002500
F1
(in
Hz)
F2 (in Hz)
Session 2 /æ/
Repetition task English /æ/Picture-naming task English /æ/
Dutch /ɪ/ Dutch /ε/
75
used (see Figure 7.5). It was remarked in Chapter 6 that the informants’ English vowel
productions seemed rather inconsistent, displaying different results at both sessions, which did
neither seem to correlate directly with their Dutch productions or with the productions of the
native speakers of English. This irregularity is, however, typical of early L2 speech. Similarly
to the formant frequencies, it was hypothesized that the informants would rely on their Dutch
vowel system for the production of the English vowels, meaning that /ɪ/ and /ε/ would be
similar in length to their Dutch equivalents, and the duration of /æ/ would be similar to /ε/.
Figure 7.5. The duration of the L1 Dutch children’s English vowels of the first session compared to
the results of the Dutch vowels (left) and the results of the native speakers (data taken from
Hillenbrand et al., 1995) (right) (in ms)
When comparing the Dutch and English vowel productions of the informants, it becomes
clear that the informants tend to produce the English vowels longer than their Dutch
equivalents. This finding could be linked to the prediction that L2 learners will often make
use of duration to distinguish between foreign vowels. For instance, the contrast between
English /ɪ/ and /ε/ appears to be longer than the contrast between Dutch /ɪ/ and /ε/. While the
native speakers produce /ɪ/ longer than /ε/, the informants followed the production of the
Dutch vowels and produced /ε/ longer than /ɪ/, thereby supporting the hypothesis on the
influence of the L1.
Striking is the duration of /æ/ produced by the informants, as they produce this vowel
relatively long, as opposed to the hypothesis stating that the informants would not be able to
differentiate between /ε/ and /æ/. By producing /æ/ longer than /ɪ/ and /ε/, the informants seem
0 50 100 150 200 250 300 350(in ms)
Informants /ɪ/ Informants /ε/
Informants /æ/ Native speaker /ɪ/
Native speaker /ε/ Native speaker /æ/
0 50 100 150 200 250
(in ms)
L1 Dutch /ɪ/ L1 Dutch /ε/ L2 English /ɪ/
L2 English /ε/ L2 English /æ/
76
on first sight to display a quite accurate pronunciation of this vowel, as the native speakers’
/æ/ also has a longer duration compared to /ɪ/ and /ε/. In this case too, the hypothesis that
duration is used to distinguish between vowel contrasts which do not occur in the speaker’s
L1 can be applied, although it may also be noted that low vowels are generally produced
longer than high vowels (Clark et al. 1995), implying thus that the participants are in fact
aware of the contrast between /ε/ and /æ/.
To be able to ascertain whether these findings are typical of the informants’
pronunciation or, rather, are accidental findings, the results of the second session will be
discussed too. The values are represented in Figure 7.6.
Figure 7.6. The duration of the L1 Dutch children’s English vowels of the second session compared to
the results of the Dutch vowels (left) and the results of the native speakers (data taken from
Hillenbrand et al. 1995) (right) (in ms)
The duration values from the second session as displayed in Figure 7.6 seem to be more in
line with the expected results, as the influence of the L1 is more strongly visible, especially
with reference to the duration of the new vowel /æ/. Similar to the first session, the informants
produced /ε/ longer than /ɪ/, as opposed to the native speakers, which clearly indicates that the
informants use their knowledge of their L1 for the production of their L2. As hypothesized,
/æ/ is produced almost equally long as /ε/, and is on average even produced shorter in this
session than in the first session, suggesting that the informants are not able to make an
accurate contrast between these two vowels.
0 50 100 150 200 250 300 350
(in ms)
Informants /ɪ/ Informants /ε/
Informants /æ/ Native speaker /ɪ/
Native speaker /ε/ Native speaker /æ/
0 50 100 150 200 250
(in ms)
L1 Dutch /ɪ/ L1 Dutch /ε/ L2 English /ɪ/
L2 English /ε/ L2 English /æ/
77
3. Summary
Regarding the vowels /ɪ/ and /ε/, it can be concluded that the informants on average rely on
their L1 productions for the productions of the English equivalents. In the case of the formant
frequencies, this leads to /ɪ/ being produced more front and higher than /ε/. Because this
contrast is similar in Dutch and in English, it may seem that this contrast is therefore produced
quite accurately by the informants. In the case of duration, on the other hand, the informants
produce /ε/ longer than /ɪ/, as opposed to the native speakers of English, and in this respect,
their vowel productions are hence less native-like.
For the non-native contrast /ε/-/æ/, it was clearly shown that concerning the formant
frequencies the informants were not able to distinguish between these two vowels, supporting
the hypothesis on the influence of the L1. In the case of duration, more ambiguous results
were found. Whereas it seemed in the results of the first session that the informants made an
accurate contrast in duration, the results of the second session indicated that the informants
made hardly any contrast in duration. Since /æ/ is a new vowel for speakers of Dutch, it is
logical that much variation exists in its pronunciation, especially considering that the
informants are beginning learners.
7.1.2 The back vowels
1. Formant frequencies
The Dutch vowels /u/ and /y/ were examined, as they might be of influence for the
informants’ production of /ʊ/ and /u:/. It should nevertheless be noted that in Dutch /y/ is a
front vowel, and therefore less likely to influence the production of English /ʊ/ and /u:/
(Collins et al. 2003). The mean results of the vowels /u/ and /y/ from the Dutch tasks of both
Session 1 and 2 were presented in Tables 6.1 and 6.2, respectively.
According to the results displayed in Tables 6.1 and 6.2, the informants pronounced
/u/ in both cases more back and lower than /y/, although the absolute mean values of the
formants in Session 1 concerning the F1 differed only minimally. The contrast made by the
informants between these two vowels is presented in Figure 7.7, which presents the individual
productions of the informants during Sessions 1 and 2.
78
Figure 7.7. The Dutch formant values of Session 1 and 2 by the participants
While Figure 7.7 indicates that there are some identical productions between /u/ and /y/ made
by the informants, in general it appears that the informants maintain a contrast between these
two vowels in Dutch. Especially /y/ is occasionally produced more back, partly covering the
productions of /u/, but these results could be caused by calculation errors or unintentional
mispronunciations by the informants. These results therefore show that the informants have a
relatively fixed L1 vowel system, in which they clearly distinguish between the front vowel
/y/, on the one hand, and the back vowel /u/, on the other hand. It is hypothesized that
especially this last vowel will influence the productions of the English vowels /ʊ/ and /u:/.
Figure 6.10 showed that the acoustic quality of /ʊ/ and /u:/ of the native speakers of
American English and RP is not identical. For the speakers of RP the contrast between /ʊ/ and
/u:/ was more outspoken than for the speakers of American English, and while the RP
speakers produced /u:/ higher and more front than /ʊ/, the speakers of American English
showed the exact opposite relationship between these two vowels. As the vowel diagrams
provided in Section 4.3 (Collins et al. 2003) seem to follow the findings of the L1 American
English speakers, the results from the informants will be compared to these (i.e., /ʊ/ more
back and lower than /u:/, with only a relatively small contrast).
Looking at the mean results in Table 6.5 for the first session, it seems on first sight
that the informants make an accurate distinction between /ʊ/ and /u:/, as in the repetition and
the picture-naming tasks, both the F1 and F2 values for /u:/ are smaller, indicating that this
vowel is produced higher and more back.
0
200
400
600
800
0100020003000
F1
(in
Hz)
F2 (in Hz)
Session 1
/u/ /y/
0
200
400
600
800
0100020003000
F1
(in
Hz)
F2 (in Hz)
Session 2
/u/ /y/
79
However, because it was remarked in Chapter 6 that much variation exists between the
productions, visible through the high standard deviation values, and because the difference in
value between the F1 and F2 seems relatively small, it seems necessary to examine all
individual productions by the informants too, similarly to the front vowels, to verify whether
the contrast between these vowels is constantly maintained (see Figure 7.8).
Figure 7.8. The formant values of the English back vowels by the participants
More notably than with the front vowels, the informants seem to pronounce the back vowels
differently in the repetition and the picture-naming task. Whereas in the repetition task /ʊ/ is
produced more front and lower and the vowel productions are relatively concentrated in the
vowel space, the majority of the results from the picture-naming task of the same vowel are
more back and higher, but are also more spread over the vowel space. This finding does,
however, not hold for /u:/, for which the repetition task results show wide variation, while the
results from the picture-naming task are more clustered. Nevertheless, it should be noted that
there were few useable data for /u:/ from the picture-naming task (see Table 8.12 in the
Appendices), which may influence the findings. Comparing the results of /ʊ/ and /u:/ from the
repetition task, it can be concluded that the participants in some cases seemed able to produce
an accurate contrast between these two vowels, pronouncing /ʊ/ more front and lower. On the
other hand, the wide variation of the /u:/ productions also indicates that this non-native vowel
poses a rather important problem for the informants, in that they are not able to give /u:/ a
fixed place in their vowel system. Comparing the results of /ʊ/ and /u:/ from the picture-
naming task, the opposite could however be concluded, as the productions of /u:/ are quite
0
200
400
600
800
050010001500200025003000
F1
(in
Hz)
F2 (in Hz)
Session 1
Repetition task /ʊ/ Repetition task /u:/
Picture-naming task /ʊ/ Picture-naming task /u:/
80
constant here. Striking here is that the contrast between the two vowels is smaller in the
picture-naming task than in the repetition task, again suggesting that directly repeating a
model improves the L2 learners pronunciation.
The results from the second session, which were presented in Tables 6.5, similarly to
the first session indicate that the informants correctly distinguish /ʊ/ and /u:/ from each other,
as the mean values of the F1 and F2 of /u:/ are again smaller, which implies that on average
this vowel was produced higher and more back than /ʊ/. As the individual productions from
the first session showed, the mean values do not always give a perfect overview of the
contrasts made by the informants. Therefore, the individual results were again placed into a
graph (see Figure 7.9).
Figure 7.9. The formant values of the English back vowels by the participants
In line with the proposed hypothesis that the informants would have difficulties with the non-
native vowel contrast /ʊ/ and /u:/, the results from the second session of the back vowels show
that many of the informants’ productions overlap each other. For both vowels, the productions
in the repetition task are more front, while they are more back in the picture-naming task. An
explanation for this finding could be that in the picture-naming task, the participants rely more
on their knowledge of their L1 vowel system, as it was noted in Section 4.5 that the Dutch
equivalent vowel /u/ is produced less central than English /ʊ/ and higher than English /u:/. In
order to examine this suggestion, the productions of the second session of the English picture-
naming task and the Dutch task can be compared (see Figure 7.10). It should be noted that this
does not necessarily mean that the productions of the repetition task are more accurate. The
0
200
400
600
800
050010001500200025003000
F1
(in
Hz)
F2 (in Hz)
Session 2
Repetition task /ʊ/ Repetition task /u:/
Picture-naming task /ʊ/ Picture-naming task /u:/
81
values of the vowels in the repetition task show much variation and overlap each other,
suggesting that the informants are not able to perceive an acoustic difference between these
vowels.
Figure 7.10. The values from the second session for the Dutch task and the English picture-naming
task, and values of the American English native speaker (data taken from Hillenbrand et al. 1995)
Figure 7.10 supports the claim that Dutch /u/ is in fact produced more back than English /ʊ/
and more closed or higher than English /u:/, as the comparison of the Dutch productions of the
informants and the mean productions of the native speakers shows. Furthermore, the
hypotheses that the informants will not be able to differentiate between English /ʊ/ and
English /u:/, and that their L1 will influence their L2 productions are supported here too, as
the Dutch /u/, English /ʊ/ and English /u:/ values overlap each other.
2. Duration
For the discussion of duration of the back vowels, the average (-Outliers) values which were
presented in Tables 6.3 and 6.4 for the Dutch vowels, and the average (-Outliers) values
which were presented in Table 6.6 for the English vowels will be used.
0
200
400
600
800
0500100015002000
F1
(in
Hz)
F2 (in Hz)
Informants Dutch /u/ Informants English /ʊ/ Informants English /u:/
American English /ʊ/ American English /u:/
82
Figure 7.11. The duration of the L1 Dutch children’s English vowels of the first session compared to
the results of the Dutch vowels (left) and the results of the native speakers (data taken from
Hillenbrand et al. 1995) (right) (in ms)
Figure 7.11 indicates that the informants are capable of distinguishing between the vowels /ʊ/
and /u:/ in terms of duration, which supports the hypothesis that the informants would mostly
have problems with contrasting the formant frequencies of this vowel contrast (cfr. Supra)
rather than with duration. It was hypothesized that the informants would quite easily notice
the difference in length between the checked vowel /ʊ/ and the free vowel /u:/. It may be
suggested that the duration of the English vowels by the informants is influenced by the
duration of the Dutch vowel /u/, as the absolute values of /ʊ/ and /u:/ are shorter for the
informants than for the native speakers, and similar to Dutch /u/. However, it was noted in
Section 4.2 that duration contrasts should be measured relatively, which implies that the
absolute shorter values do not cause a breakdown in understanding, since the relative contrast
is maintained.
As the results of the second session of the front vowels are different from the results of
the first session, the results of the second session of the back vowels will be discussed too.
They are displayed in Figure 7.12.
0 100 200 300
(in ms)
Informants /ʊ/ Informants /u:/
Native speaker /ʊ/ Native speaker /u:/
0 50 100 150 200 250
(in ms)
L1 Dutch /u/ L1 Dutch /y/
L2 English /ʊ/ L2 English /u:/
0 50 100 150 200 250 300
(in ms)
Informants /ʊ/ Informants /u:/
Native speaker /ʊ/ Native speaker /u:/
0 50 100 150 200 250
(in ms)
Dutch /u/ Dutch /y/
English /ʊ/ English /u:/
83
Figure 7.12. The duration of the L1 Dutch children’s English vowels of the second session compared
to the results of the Dutch vowels (left) and the results of the native speakers (data taken from
Hillenbrand et al. 1995) (right) (in ms)
Most striking in the results displayed in Figure 7.12 is that the mean values of Dutch /u/ and
English /ʊ/ are virtually identical, only differing by two ms, which may denote the influence
of the L1 on the L2. However, since this finding did not occur in the first session for these two
vowels, this result can perhaps not be generalized. It was noted in the discussion on the vowel
duration of the first session that the duration contrast between two vowels is measured
relatively. Again, the informants accurately contrast between /ʊ/ and /u:/ by on average
producing /u:/ longer than /ʊ/, similarly to the native speakers. Noteworthy here is that the
contrast between /ʊ/ and /u:/ is nevertheless more outspoken for the informants than for the
native speakers, which may refer to the hypothesis that L2 learners use duration as a means
for distinguishing between vowel contrasts which do not occur in their L1.
3. Summary
In the discussion of the formant frequencies of /ʊ/ and /u:/ it was noted that hardly any
contrast was found between these vowels in the productions of the informants, whereas in
native English these two vowels are contrasted. A comparison between Dutch /u/ and English
/ʊ/ and /u:/ brought to light that the informants’ L1 influenced their English vowel
productions, as the values largely overlapped each other. This influence was further supported
by the finding that the informants showed better results in the repetition tasks, where they
0 50 100 150 200 250 300
(in ms)
Informants /ʊ/ Informants /u:/
Native speaker /ʊ/ Native speaker /u:/
0 50 100 150 200 250
(in ms)
L1 Dutch /u/ L1 Dutch /y/
L2 English /ʊ/ L2 English /u:/
84
could repeat the words after a native speaker, than in the picture-naming tasks. Furthermore, it
can also be concluded that for the production of English /ʊ/ and /u:/, the informants relied
more on their production of Dutch /u/ than /y/, which is produced more front.
Although in absolute values the L1 was also visible in the L2 productions regarding
the duration of the back vowels, this does not cause similar problems as with the formant
frequencies, since relatively the same contrast was made between /ʊ/ and /u:/. The informants
were able to notice the difference in length between these two vowels, which they accordingly
produced accurately. As the contrast in duration made by the informants was larger than by
the native speakers, it may be suggested that the informants made use of duration rather than
place of articulation to distinguish between the vowels in this non-native vowel contrast.
7.2 The influence of age
Many previous studies suggest that young learners should be able to learn an L2 more easily
than older learners (see Chapter 1). One reason for this is that their sound system is more
flexible and can thus more easily adjust themselves to foreign sounds. In the present study, it
was expected that the informants’ productions would not be native-like yet, as was confirmed
in the previous section, but that the fact that the informants are young learners would prevail
in the production variation they showed individually and compared to the other informants,
both in their L1 and L2.
In the Dutch tasks, for instance, the high standard deviations, which were presented in
Tables 6.1 and 6.2, indicate that there is much variation in the L1 vowel productions,
especially concerning the F2 values. This variation may be interspeaker variation, implying
that all informants vary in their vowel pronunciation, but may also be intraspeaker variation,
implying that there is variation in the vowel productions of individual speakers, which is a
typical tendency for young speakers. Although it was confirmed in the previous section that
the informants already show a rather consistent pronunciation for their L1, as they displayed
quite similar results for their L1 in both sessions, the high standard deviation values also
suggest that the informants’ vowel system is not entirely fixed yet. It was proposed in Flege’s
Interaction Hypothesis (see Section 2.2.2.3) that when a speaker’s sound categories are still
adaptable the learning of foreign sounds will be easier too.
However, as the discussion of the previous section demonstrated, the informants were
in general not able to contrast between those vowel contrasts which do not occur in their L1.
85
Concerning the English front vowels, the informants distinguished /ɪ/ and /ε/ similarly as the
native speakers, but this could be explained by the fact that this contrast also occurs in Dutch.
However, the exact place of articulation differed from that of the native speakers, as the Dutch
vowels /ɪ/ and /ε/ are not pronounced exactly the same as their English equivalents. The
influence of the L1 on the production of these vowels became most prevalent in the
examination of the vowels’ duration. For the production of the non-native vowels /æ/, /ʊ/ and
/u:/, the informants relied on the productions of those Dutch vowels which sound similar, but
do in fact differ in pronunciation.
These findings suggest that the young learning age of the informants did not have a
particularly noteworthy advantageous effect on their L2 phonology acquisition, as
hypothesized. It was suggested that besides early acquisition, also experience together with
sufficient use of the L2, and language immersion may positively affect the L2 pronunciation.
Since the informants were beginning learners and were not immersed in an L2-speaking
environment, their L2 production did consequently not directly profit from their young
learning age.
Returning to the possibility of inter- and intraspeaker variation concerning the English
vowels, it was expected that because the informants were at the onset of L2 learning, their
productions would also show greater inter- and intraspeaker variation, which was supported
by the finding of the high standard deviation values for the English vowels, which were
presented in Table 6.5 for the formant frequencies.
Surprisingly, it appears that those vowels which have an equivalent in Dutch (i.e., /ɪ/
and /ε/) have the highest standard deviation values, implying that the informants show high
variation in the productions of the vowels, while the new vowels (i.e., /æ/ and /ʊ/) show
accordingly less variation. It was expected that the informants would have more difficulty
with the new vowels and thus show a less stable pronunciation. The L1 influence may explain
why the inter- and intraspeaker variation is high in the informants’ production of /ɪ/ and /ε/,
but does not explain why the variation is relatively lower for the new vowels. Although, it
should be remarked that the variation in the production of the new vowels is still high in
absolute terms.
In order to discuss whether the variation is mostly between speakers or within
speakers, the individual results for the vowel /ɪ/ are placed into a graph, as the variation for
this vowel is the highest overall. Because the data from the picture-naming task for this vowel
were not complete, only the results from the repetition task will be discussed, as these provide
a more total picture (see Figure 7.13).
86
Figure 7.13. The formant frequencies of /ɪ/ of the repetition tasks
These formant values displayed in Figure 7.13 show that there is both inter- and intraspeaker
variation found in the productions by the informants. Especially in the first session, the
majority of the informants seem to have a quite consistent pronunciation, although some
participants’ productions also vary between the different stimuli (i.e., informant 1, 4 and 8). In
the second session, the high standard deviation values are most likely caused by intraspeaker
variation, as there is almost no interspeaker variation in this case. The fact that the first
session presents more interspeaker variation, while the second session presents more
intraspeaker variation is probably due to coincidence. However, these results do clearly
support the hypothesis that beginning learners’ vowel systems are still variable, as the
informants in this study differ both from each other in their pronunciation and in their own
pronunciations.
7.3 Effects of instruction
As it was argued that L2 learners’ pronunciation generally only benefits from instruction
when the learners receive instruction that focuses also on pronunciation besides vocabulary
and grammar, it was expected that the informants in this study will not show any significant
0
200
400
600
800
0100020003000
F1
(in
Hz)
F2 (in Hz)
Session 1
Informant 1 Informant 2 Informant 3
Informant 4 Informant 5 Informant 6
Informant 7 Informant 8 Informant 9
0
200
400
600
800
0100020003000
F1
(in
Hz)
F2 (in Hz)
Session 2
Informant 1 Informant 2 Informant 3
Informant 4 Informant 5 Informant 6
Informant 7 Informant 8 Informant 10
87
improvement of their vowel productions in the second session. Furthermore, no consensus has
been made yet on which form of instruction leads to the best results. Moreover, the amount of
instruction the informants had received at the time of the second testing was quite limited (see
Section 5.2.1). However, it may be possible that the informants did show some differences in
production, as their language awareness may have evolved between the two sessions by
having received English instruction. To discuss the effects of instruction, the results from the
first session and the second session will be compared, for both the formant frequencies and
duration.
1. Formant frequencies
For the discussion of the difference between the pre-instruction and post-instruction
productions of the informants, the findings from the two previous sections have to be taken
into account. First, it was noted that for the vowels which are similar in Dutch and English the
informants used their L1 vowel pronunciation for the production of the L2 vowels, while the
new vowels were influenced by the production of the closest vowels. Secondly, the
informants’ vowel system is very variable. Note that the standard deviation values presented
in Table 6.5 show that the variation in production by the informants is not lower in the second
session than in the first session. Often it seems as if there is even more variation in the second
session. Furthermore, as a graph containing all individual results would give a rather
confusing presentation, the mean values of the vowels as presented in Chapter 6 will be used.
They are displayed in Figure 7.14.
88
Figure 7. 14. The average (-Outliers) values of the formant frequencies of the English vowels by the
informants (Rep. = repetition task; P.N. = picture-naming task)
As expected, relatively few differences are found in the comparison of the formant values of
the first and second session. Most striking is the difference in place of articulation of /ɪ/ in the
repetition and picture-naming tasks in the first session, whereas this same vowel’s
pronunciation in the second session is almost similar. However, as the English /ɪ/ is usually
produced somewhat more back than Dutch /ɪ/, this finding would suggest that /ɪ/ in the first
repetition task may be produced more accurately than in the second repetition task. This
implication would indicate that instruction did not have any beneficiary effects for the English
pronunciation of the informants. The same finding goes for /ε/, which is usually produced
more front and close in English than in Dutch. Similarly to /ɪ/, it seems as if the results from
the first session are more native-like than those of the second session. The new vowel /æ/ is in
both sessions produced closely to /ε/, and the back vowels’ results do not seem to vary
significantly between the two session, which all suggests that the informants’ pronunciation
did not improve notably through instruction.
2. Duration
As in Chapter 6, the results from the repetition and picture-naming task will be taken together
for the discussion of the vowels’ duration. Furthermore, it was already established in Chapter
6 that the informants’ productions regarding duration were reasonably fixed, not showing
200
400
600
800
5001000150020002500
F1
(in
Hz)
F2 (in Hz)
Session 1
Rep. /ɪ/ Rep. /ε/ Rep. /æ/ Rep. /ʊ/
Rep. /u:/ P.N. /ɪ/ P.N. /ε/ P.N. /æ/
P.N. /ʊ/ P.N. /u:/
200
400
600
800
5001000150020002500
F1
(in
Hz)
F2 (in Hz)
Session 2
Rep. /ɪ/ Rep. /ε/ Rep. /æ/ Rep. /ʊ/
Rep. /u:/ P.N. /ɪ/ P.N. /ε/ P.N. /æ/
P.N. /ʊ/ P.N./u:/
89
much variation (within the same session). Similarly to the formant frequencies, the
conclusions on duration made in the previous sections will be taken into account. For
duration, these are mainly the findings that the informants had most problems with the relative
length of the front vowels, while they seemed able to produce the back vowels more
accurately. In addition, it also appeared that the informants used duration to distinguish
between the non-native vowel contrasts. Figure 7.15 displays the results from Session 1 and 2.
Figure 7. 15. The average (-Outliers) values of the English vowels’ duration by the informants
Again the hypothesis that instruction had no effect on the informants’ production seems to be
confirmed here. In general, the contrasts in duration made by the informants in the first
session were repeated in the second session, with the exception of /ε/-/æ/. This means that for
the front vowels /ɪ/ and /ε/, which were contrasted in a non-nativelike way by the informants
in the first session, no correction in pronunciation is made in the second session. For the
contrast /ε/-/æ/, it appears that the production in the first session was even more native-like
than the one in the second session (see Figure 7.5 for comparison with the results from the
native-speaker). Also for the back vowels, it may be noted that the results from the first
session are more native-like than those from the second session, as it was observed that the
relative contrast in duration in the second session between /ʊ/ and /u:/ is longer than that of
the native speakers (see Figure 7.6 for comparison). Accordingly, it may be argued that L2
instruction did not affect the informants’ pronunciation of the English vowels.
0 50 100 150 200 250
(in ms)
Session 2
/ɪ/ /ε/ /æ/ /ʊ/ /u:/
0 50 100 150 200 250
(in ms)
Session 1
/ɪ/ /ε/ /æ/ /ʊ/ /u:/
90
3. Summary
The findings from the comparison of the first and second session seem to suggest that the
informants have a better pronunciation before the start of instruction than after having
received instruction. It would, however, be more logical that the informants are simply not
aware of how to pronounce the English vowels, and thus rely on their knowledge of their L1.
As the informants’ L1 vowels still show some variation, this could consequently also
influence the productions of the L2, causing the values of the vowels to differ between the
first and second session. In other words, the differences found between the first and second
session are not influenced by the L2 instruction the informants received, but rather by their
varying L1 vowel system. Moreover, there is no immediate explanation for why the
participants showed seemingly more native-like productions in the first session than in the
second session.
91
Conclusion
This research paper aimed to investigate three issues concerning native and non-native vowel
contrasts in the field of early L2 phonology. In particular, the influence of age, of the L1, and
of instruction on the production of the English front vowels /ɪ/-/ε/-/æ/ and back vowels /ʊ/-/u:/
by young L1 Dutch learners of English was examined, both in terms of qualitative and
quantitative vowel aspects.
Firstly, the question whether young learners can easily acquire foreign speech sounds
was addressed. It was hypothesized that, although much research has indicated that early
exposure positively affects L2 acquisition, the participants in the present study would not
show accurate L2 pronunciation yet, as they were studied at the onset of learning, and were
not immersed in an English-speaking environment. However, it was suggested that the
participants would show both intra- and interspeaker variation in their vowel productions,
because of their relatively young age and because they were beginning learners.
Secondly, the influence of the L1 on L2 vowel contrasts was examined. It was
expected that the participants would have few problems with contrasting between /ɪ/ and /ε/,
as this is a native vowel contrast, but would have more difficulties with the contrasts /ε/-/æ/,
and /ʊ/-/u:/, as these vowel contrasts do not occur in Dutch. In addition, it was hypothesized
that all vowels would to a certain degree be influenced by equivalent L1 vowels both
qualitatively and quantitatively.
Thirdly, the issue whether learners’ L2 vowel productions benefit from L2 instruction
was examined. However, since the participants in the present study had received only a small
amount of English instruction at the time of testing, it was not expected that the vowel
productions by the informants would have changed drastically.
These three hypotheses were evaluated using the results obtained from an experiment
on the production of Dutch and English vowels by young learners. The participants were all
Dutch-speaking beginning learners of English, and were tested both before they had received
any instruction and after they had received a small amount of instruction.
With respect to the age of L2 learning, the results revealed that the participants were
not able yet to contrast the English vowels accurately, which suggests that besides early
exposure other factors may be of equal or even more significance in L2 phonology
acquisition. However, the results showed that the participants varied much in both their L1
and L2 vowel productions, although they were in general able to maintain the same contrasts
92
between the vowels, especially in their L1. Surprising for the variation in the L2 productions
was the finding that the participants varied most in their pronunciation of the similar vowels,
rather than the new vowels, which may imply that the participants are able to make a
distinction between, for example, /ε/ and /æ/. This implication is, however, contradicted by
the results, which indicate that the participants in general do not contrast between these
vowels.
In the examination of the results concerning the formant frequencies and duration of
the vowels, it was found that the L1 exerted much influence on the production of the English
vowels by the participants. The participants managed to contrast /ɪ/ and /ε/ accurately in terms
of formant frequencies, but this was mainly due to the fact that they contrasted these vowels
similarly in Dutch. This did, however, not hold for the duration of these vowels, which were
contrasted differently in Dutch and English. The participants were not able to distinguish
accurately between the vowel contrasts /ε/-/æ/ and /ʊ/-/u:/ in terms of formant frequencies.
For duration, /ε/ and /æ/ were contrasted correctly in the first session, but not in the second
session, which suggests that the participants were indeed not aware of a contrast between
these vowels. The back vowels /ʊ/ and /u:/ did, on the other hand, not pose any problems
concerning duration.
Lastly, the results from the first and second session did not reveal any significant
differences, implying that the participants’ English vowel production had not changed much
after having received instruction.
Nevertheless, a number of limitations should be noted regarding the findings in this
research paper. First of all, the number of participants in the present study was quite small,
which may have influenced the interpretations of the results. Furthermore, it should be
remarked that not all data could be used, limiting the number of available results. Finally,
because the participants had received only a small amount of instruction, the third issue
concerning the role of instruction on L2 phonology could not be answered sufficiently.
Therefore, it would be interesting for further research to examine the role of
instruction on the L2 phonology of young learners who have received instruction over a
longer period of time, preferably with additional focus on pronunciation, besides vocabulary
and grammar. Moreover, it would also be interesting to investigate the role of duration in L2
non-native vowel contrasts more deeply.
In summary, the findings in this research paper suggested that when no specific
attention is given to early learners’ L2 phonology, the learners will not be able to accurately
contrast between both native and non-native vowels.
93
References
Adank, Patti, Roeland van Hout & Roel Smits. 2004. An acoustic description of the vowels of
Northern and Southern Standard Dutch. Journal of the Acoustical Society of America
116(3). 1729–1738.
Anderson, Raquel. 2004. Phonological acquisition in preschoolers learning a second language
via immersion: a longitudinal study. Clinical Linguistics & Phonetics 18(3). 183-210.
Baker, Wendy & Pavel Trofimovich. 2005. Interaction of native- and second language vowel
system(s) in early and late bilinguals. Language and Speech 48(1). 1-27.
Baker, Wendy, Pavel Trofimovich, Molly Mack, & James E. Flege. 2002. The effect of
perceived phonetic similarity on non-native sound learning by children and adults. In
S. Fisch, B. Scarabel, & A. H. Do (eds.), Proceedings of the 26th Boston University
Conference on Language Development, 36-47. Somerville, MA: Cascadilla Press.
Baker, Wendy, Pavel Trofimovich, James E. Flege, Molly Mack & Randall Halter. 2008.
Child–adult differences in second-language phonological learning: The role of cross-
language similarity. Language and Speech 51(4). 317-342.
Binghadeer, Nora A. 2010. The Effects of Age and Input Enhancement on L2 Vowel
Production: A Longitudinal Study. TESOL Journal 3. 64-80.
Boersma, Paul & David Weenink. 2012. Praat: doing phonetics by computer [Computer
program]. Version 5.3.13, retrieved 13 April 2012 from http://www.praat.org/
Bohn, Ocke-Schwen & James E. Flege. 1992 The production of new and similar vowels by
adult German learners of English. Studies in Second Language Acquisition 14. 131-158.
Bongaerts, Theo, Chantal van Summeren, Brigitte Planken & Erik Schils. 1997. Age and
ultimate attainment in the pronunciation of a foreign language. Studies in Second
Language Acquisition 19. 447-465.
94
Cebrian, Juli. 2006. Experience and the use of non-native duration in L2 vowel categoriation.
Journal of Phonetics 34. 372-387.
Clark, John & Colin Yallop. 1995. An introduction to phonetics and phonology. 2nd
edn.
Oxford: Blackwell
Collins, Beverley & Inger M. Mees. 2003. The phonetics of English and Dutch. 5th
edn.
Leiden: Brill.
Derwing, Tracey M. & Marian J. Rossiter. 2003. The effects of pronunciation instruction on
the accuracy, fluency, and complexity of L2 accented speech. Applied Language
Learning 13(1). 1-17.
Derwing, Tracy M., Murray J. Munro & Grace Wiebe. 1998. Evidence in favor of a broad
framework for pronunciation instruction. Language Learning 48(3). 393-410.
Eckman, Fred R. 2004. From phonemic differences to constraint rankings: Research on
second language phonology. Studies in Second Language Acquisition 26. 513-549.
Eckman, Fred R., Abdullah Elreyes & Gregory K. Iverson. 2003. Some principles of second
language phonology. Second Language Research 19(3). 169-208.
Escudero, Paola, Rachel Hayes-Harb & Holger Mitterer. 2008. Novel second-language words
and asymmetric lexical access. Journal of Phonetics 36. 345-360.
Flege, James E. 1987. The production of “new” and “similar” phones in a foreign language:
Evidence for the effect of equivalence classification. Journal of Phonetics 15. 47-65.
Flege, James E. 1995. Second-language speech learning: Theory, findings and problems. In
W. Strange (ed.), Speech perception and linguistic experience: Issues in cross-
language research, 229-273. Timomium, MD: York Press.
95
Flege, James E. 1997. English vowel production by Dutch talkers: More evidence for the
“new” vs “similar distinction”. In Allan R. James & Jonathan Leather (eds.), Second-
Language Speech, Structure and Process, 11-52. Berlin: Mouton de Gruyter.
Flege, James E. & James Hillenbrand. 1984. Limits on phonetic accuracy in foreign language
speech production. Journal of the Acoustical Society of America 76(3). 708-721.
Flege, James E., Ocke-Schwen Bohn and Sunyoung Jang. 1997. Effects of experience on non-
native speakers’ production and perception of English vowels. Journal of Phonetics 25.
437-470.
Flege, James E., Grace H. Yeni-Komshian & Serena Liu. 1999. Age constraints on second-
language acquisition. Journal of Memory and Language 41. 78-104.
Hawkins, Sarah & Jonathan Midgley. 2005. Formant frequencies of RP monophthongs in four
age groups of speakers. Journal of the International Phonetic Association 35(2). 183-
199.
Hillenbrand, James, Laura A. Getty, Michael J. Clark, & Kimberlee Wheeler. 1995. Acoustic
characteristics of American English vowels. Journal of the Acoustical Society of
America 97(5). 3099-3111.
Holleman, Esther. 2011. The English front vowels in early second language phonology: A
study on the production of English /ɛ/ and /æ/ by young native speakers of Dutch.
Ghent: University of Ghent Bachelor paper.
Højen, Anders & James E. Flege. 2006. Early learners' discrimination of second-language
vowels. Journal of the Acoustical Society of America 119(5). 3072-3084.
Ioup, Georgette. 2008. Exploring the role of age in the acquisition of a second language
phonology. In Jette G. Hansen Edwards & Mary L. Zampini (eds.), Phonology and
Second Language Acquisition, 41-62. Amsterdam: Benjamins.
96
Isaacs, Talia. 2009. Integrating Form and Meaning in L2 Pronunciation Instruction. TESL
Canada Journal 27(1). 1-12.
Kennedy, Sara & Pavel Trofimovich. 2010. Language awareness and second language
pronunciation: A classroom study. Language Awareness 19(3). 171-185.
Keys, Kevin. 2002. First language influence on the spoken English of Brazilian students. ELT
Journal 56(1). 41-46.
Lord, Gillian. 2010. The combined effects of immersion and instruction on second language
pronunciation. Foreign Language Annals 43(3).488-503.
MacLeod, Andrea A.N. & Carol Stoel-Gammon. 2010. What is the impact of age on the
second language acquisition on the production of consonants and vowels among
childhood bilinguals? International Journal of Bilingualism 14(4). 400-421.
Major, Roy C. 2008. Transfer in second language phonology: A review. In Jette G. Hansen
Edwards & Mary L. Zampini (eds.), Phonology and Second Language Acquisition, 63-
94. Amsterdam: Benjamins.
Marinova-Todd, Stefka H., D. Bradford Marshall & Catherine E. Snow. 2000. Three
misconceptions about age and L2 learning. TESOL Quarterly 34. 9-34.
Munro, Murray J. 2008. Foreign accent and speech intelligibility. In Jette G. Hansen Edwards
& Mary L. Zampini (eds.), Phonology and Second Language Acquisition, 193-218.
Amsterdam: Benjamins.
Munro, Murray J. & Tracey M. Derwing. 2008. Segmental acquisition in adult ESL learners:
A longitudinal study of vowel production. Language Learning 58(3). 479-502.
Munro, Murray J., James E. Flege & Ian R. A. MacKay. 1996. The effects of age of second
language learning on the production of English vowels. Applied Psycholinguistics 17.
313-334.
97
O'Brien, Mary Grantham & Laura Catharine Smith. 2010. Role of first language dialect in the
production of second language German vowels. International Review of Applied
Linguistics in Language Teaching 48(4). 297-330.
Pennington, Martha. 1996. Phonology in English language teaching. New York: Longman.
Perkell, Joseph S., Frank H. Guenther, Harlan Lane, Melanie L. Matthies, Ellen Stockmann,
Mark Tiede & Majid Zandipour. 2004. The distinctness of speakers' productions of
vowel contrasts is related to their discrimination of the contrasts. Journal of the
Acoustical Society of America 116(4). 2338-2344.
Piske, Thorsten, Ian R. A. MacKay & James E. Flege. 2001. Factors affecting degree of
foreign accent in an L2: a review. Journal of Phonetics 29. 191-215.
Saito, Kazuya. 2011. Examining the role of explicit phonetic instruction in native-like and
comprehensible pronunciation development: An instructed SLA approach to L2
phonology. Language Awareness 20(1). 45-59.
Snow, Catherine E., and Marian Hoefnagel-Hohle. 1978. The critical period for language
acquisition: Evidence from second language learning. Child Development 49. 1114-1128.
Tops, Guy A. J., Xavier Dekeyser, Betty Devriendt & Steven Geukens. 2001. Dutch speakers.
In Michael Swan & Bernard Smith (eds.), Learner English, Second Edition: A
Teacher’s Guide to Interference and other Problems, 1-22. Cambridge: Cambridge
University Press.
Trofimovich, Pavel & Elizabeth Gatbonton. 2006. Repetition and focus on form in processing
L2 Spanish words: Implications for pronunciation instruction. The Modern Language
Journal 90(4). 519-535.
Tsukada, Kimiko, David Birdsong, Ellen Bialystok, Molly Mack, Hyekyung Sung & James E.
Flege. 2005. A developmental study of English vowel production and perception by
native Korean adults and children. Journal of Phonetics 33. 263-290.
98
Verhoeven, Jo & Christophe Van Bael. 2002. Akoestische kenmerken van de Nederlandse
klinkers in drie Vlaamse regio’s. Taal en Tongval, Tijdschrift voor Taalvariatie 54(1).
1-23.
Wong, Richard Kwok-Shing and Brian MacWhinney. 2009. Integrating teaching practice with
developmental norms: The case of phonological teaching in L2. International Journal of
Early Years Education 17(1). 17-31.
Zampini, Mary L. 2008. L2 speech production research: Findings, issues, and advances. In
Jette G. Hansen Edwards & Mary L. Zampini (eds.), Phonology and Second Language
Acquisition, 219-249. Amsterdam: Benjamins.
99
Appendices
Dutch picture-matching task stimuli
Table 8.1. Dutch picture-matching task stimuli
Target vowel Stimuli
/ɑ/ kat, zak
/a:/ kaas, haas
/ε/ bed, zes
/ɪ/ kip, vis
/i/ tien, fiets
/ɔ/ kop, bos
/u/ voet, boek
/u/ mug, bus
/y/ muur, vuur
/e/ zeep, beek
/o/ boot, spook
/ø/ neus, reus
English repetition and picture-naming task stimuli
Table 8.2 English repetition and picture-naming task stimuli
Target vowel Stimuli
/ɑː/ bath, heart, card, cars
/ε/ bed, head, pen
/æ/ cat, hat
/ɪ/ pig, six
/i:/ sheep, keys
/ɔː/ ball, fork
/ʊ/ book, foot
/u:/ pool, goose, shoes
/ɜː/ girl, bird
/ʌ/ duck, bus
/ɒ/ dog, sock(s)
100
Dutch picture-matching task results
Table 8.3. The results per participant for /ɪ/ (Duration in ms; F1 and F2 in Hz)
Informant Session 1 Session 2
Duration F1 F1 Duration F1 F2
1 109 528 1560 130 517 2066
1 148 500 2121 112 525 2354
1 84 542 1489 - - -
2 123 595 1805 205 532 1695
2 164 526 1959 145 610 1746
2 105 610 2157 - - -
3 109 509 924 138 477 1764
3 233 509 924 97 665 2193
4 144 544 2143 173 512 1918
4 117 595 2143 168 564 2164
5 91 527 1925 87 600 1311
6 98 581 1746 103 464 1298
6 134 510 612 118 583 2054
6 163 586 635 - - -
7 87 454 2027 133 535 1531
7 144 485 1887 140 552 1979
8 78 507 2153 147 498 832
8 119 555 2344 84 616 2528
8 77 538 2347 - - -
9 102 612 687 - - -
9 73 520 903 - - -
10 - - - 98 578 1741
10 - - - 81 511 1856
101
Table 8.4. The results per participant for /ε/ (Duration in ms; F1 and F2 in Hz)
Informant Session 1 Session 2
Duration F1 F2 Duration F1 F2
1 100 609 1483 165 647 1768
1 131 205 743 - - -
1 106 637 1700 - - -
2 87 535 1779 222 668 1295
2 177 576 1752 164 723 1431
2 158 606 1790 - - -
3 134 682 1321 155 681 1325
3 164 671 1777 166 702 1632
4 150 645 2009 237 712 2102
4 115 637 1876 205 756 1912
4 - - - 189 728 1874
5 183 933 1821 147 767 1097
5 69 658 1815 130 461 772
6 193 710 1654 162 734 1541
6 115 712 1613 174 697 1548
6 161 657 1580 - - -
7 109 679 1745 139 692 1783
7 123 693 1675 115 466 1751
8 69 710 2086 133 288 780
8 93 758 1991 178 824 1291
8 89 733 2079 - - -
9 92 778 873 - - -
9 117 434 917 - - -
9 172 918 2372 - - -
10 - - - 113 675 1827
10 - - - 116 692 1752
102
Table 8.5. The results per participant for /u/ (Duration in ms; F1 and F2 in Hz)
Informant Session 1 Session 2
Duration F1 F2 Duration F1 F2
1 976 335 835 111 440 892
1 134 264 1005 114 435 1293
1 79 338 1598 - - -
2 81 352 911 154 522 1042
2 117 316 777 149 349 844
2 116 309 701 - - -
3 94 456 613 101 347 922
3 126 403 587 121 309 887
3 118 526 1301 - - -
4 76 336 476 155 502 973
4 142 435 599 141 410 847
4 113 412 1472 - - -
5 138 417 792 133 502 1243
5 184 323 910 92 459 715
6 198 413 860 140 398 868
6 136 494 1026 141 472 1183
6 207 316 958 - - -
7 112 313 862 144 321 855
7 111 334 666 - - -
8 72 523 933 130 448 1036
8 74 513 782 - - -
8 67 439 1353 - - -
9 95 472 810 - - -
9 111 466 659 - - -
9 111 439 662 - - -
10 - - - 111 436 1190
10 - - - 97 479 1184
103
Table 8.6. The results per participant for /y/ (Duration in ms; F1 and F2 in Hz)
Informant Session 1 Session 2
Duration F1 F2 Duration F1 F2
1 137 645 1882 223 391 1983
1 216 526 1881 231 419 1994
1 190 462 1949 - - -
2 184 297 1625 253 348 2065
2 198 287 2242 281 313 1805
2 281 319 2124 - - -
3 142 456 2617 233 285 1645
3 292 172 525 262 261 1184
3 258 515 2543 - - -
4 180 492 2144 213 333 924
4 214 521 1444 291 449 1855
4 241 351 2569 - - -
5 161 448 2382 190 365 745
5 225 462 2250 251 424 1315
6 111 337 1158 257 535 1850
6 193 320 923 246 462 2427
7 92 423 2060 184 290 1249
7 252 412 2091 163 291 2007
7 206 444 1746 - - -
8 127 372 2131 219 445 2068
8 71 352 1797 261 507 2335
8 111 304 2153 - - -
9 178 523 620 - - -
9 128 430 1817 - - -
9 201 604 680 - - -
10 - - - 153 333 1976
10 - - - 166 450 2467
104
English repetition and picture-naming task results
Table 8.7. The results per participant for /ɪ/ (Duration in ms; F1 and F2 in Hz) (Dur. = duration)
Repetition task Picture-naming task
Informant Session 1 Session 2 Session 1 Session 2
Dur. F1 F2 Dur. F1 F2 Dur. F1 F2 Dur. F1 F2
1 131 504 1250 164 502 649 109 581 666 - - -
1 227 587 2518 - - - 102 533 570 - - -
2 146 546 2395 186 492 2427 88 624 2240 177 501 2448
2 227 504 2555 - - - 136 561 2365 - - -
3 165 569 942 214 464 4354 111 570 857 234 597 832
3 138 509 1388 151 597 2048 - - - - - -
4 132 529 2282 140 575 2171 230 525 2596 200 579 2385
4 373 484 728 202 505 1123 259 466 2595 - - -
5 101 547 2305 126 484 2463 237 500 2765 137 567 2213
5 239 448 2398 108 149 643 169 511 2432 - - -
6 166 511 1097 135 491 2140 178 635 2082 - - -
6 150 496 946 117 468 2273 - - - - - -
7 133 539 2218 125 504 2437 93 501 2393 116 461 2609
7 85 455 2518 115 536 2190 132 554 2279 - - -
8 93 554 2203 120 548 918 94 549 2342 139 549 554
8 167 553 1388 292 525 3003 205 530 1987 - - -
9 129 601 1074 - - - - - - - - -
9 150 788 1147 - - - - - - - - -
10 - - - 152 516 2466 - - - 142 482 2788
10 - - - 161 539 2492 - - - - - -
105
Table 8.8. The results per participant for /ε/ (Duration in ms; F1 and F2 in Hz) (Dur. = duration)
Repetition task Picture-naming task
Informant Session 1 Session 2 Session 1 Session 2
Dur. F1 F2 Dur. F1 F2 Dur. F1 F2 Dur. F1 F2
1 215 622 1429 235 613 1878 - - - 262 785 1952
1 191 645 2364 193 699 2082 - - - 173 428 916
1 154 769 1329 186 643 2087 - - - 258 523 2014
2 216 658 2058 233 599 2014 63 654 1928 149 543 1650
2 107 554 2243 172 826 2104 72 651 1864 115 805 1774
2 - - - 222 740 2078 - - - - - -
3 200 697 2047 172 720 1812 257 556 1002 116 745 1749
3 145 653 698 205 631 1444 176 749 1975 311 725 1899
3 213 658 2308 286 635 1873 103 714 897 160 739 1783
4 220 808 1910 265 719 910 283 546 2057 294 728 1006
4 139 569 1809 195 331 724 263 587 2097 212 826 2173
4 365 669 2024 208 760 2091 134 530 1600 228 693 2040
5 322 693 2060 237 726 2071 246 517 659 285 697 877
5 234 689 2124 114 329 697 200 719 1919 260 678 1960
5 234 689 2124 266 820 2072 131 768 1844 134 713 1945
6 226 560 983 301 553 1134 140 678 1693 312 681 1968
6 191 809 853 182 676 1816 238 664 725 339 550 118
6 368 554 892 298 686 1704 - - - 162 727 1156
7 171 733 2104 162 762 1888 180 613 2179 207 693 1782
7 149 608 2226 216 732 2077 234 559 1990 225 674 1872
7 226 793 2056 - - - 151 536 2178 130 817 1465
8 200 707 2234 197 785 845 233 774 2113 214 625 694
8 150 731 1915 150 816 833 206 449 1747 203 685 993
8 189 757 2170 214 750 756 - - - 121 770 2353
9 121 817 1128 - - - - - - - - -
9 237 781 1054 - - - - - - - - -
9 149 922 931 - - - - - - - - -
10 - - - 170 745 1950 - - - 183 756 1824
10 - - - 164 874 1932 - - - 208 745 1930
10 - - - 191 729 2039 - - - 118 728 1691
106
Table 8.9. The results per participant for /æ/ (Duration in ms; F1 and F2 in Hz) (Dur. = duration)
Repetition task Picture-naming task
Informant Session 1 Session 2 Session 1 Session 2
Dur. F1 F2 Dur. F1 F2 Dur. F1 F2 Dur. F1 F2
1 149 839 1877 251 7760 2012 - - - 194 709 1800
1 150 678 1167 104 638 1751 - - - - - -
1 226 840 1656 - - - - - - - - -
2 82 875 2007 176 644 1878 - - - 189 772 1764
2 189 708 2127 194 799 1638 - - - 193 649 1975
3 270 699 1964 213 745 1737 196 509 960 159 776 1794
3 147 719 1172 155 683 1738 171 728 939 288 684 1683
4 396 706 2014 221 756 1925 307 575 1958 226 785 2141
4 306 623 1967 154 808 2081 316 596 1943 262 714 1959
5 285 751 2040 233 961 2011 397 658 1913 218 782 796
5 174 632 2176 145 851 2079 285 594 970 305 702 1958
6 212 795 1731 279 665 1726 - - - 319 598 1080
6 272 536 1744 203 684 1933 - - - 287 594 864
7 177 760 2008 164 835 1926 111 838 1898 167 619 1720
7 132 749 2003 161 751 2087 141 712 2021 169 735 1877
8 214 1003 1490 204 747 865 135 752 2152 294 417 735
8 124 788 1961 140 1090 1821 - - - 228 479 499
9 228 953 1311 204 832 2014 - - - - - -
9 128 1030 1233 154 728 2044 - - - - - -
10 - - - - - - - - - 176 762 1972
10 - - - - - - - - - 214 758 1885
107
Table 8.10. The results per participant for /ʊ/ (Duration in ms; F1 and F2 in Hz) (Dur. = duration)
Repetition task Picture-naming task
Informant Session 1 Session 2 Session 1 Session 2
Dur. F1 F2 Dur. F1 F2 Dur. F1 F2 Dur. F1 F2
1 92 477 1357 121 609 1623 126 523 1284 164 537 1623
1 147 515 970 97 541 1309 - - - - - -
2 151 549 1033 137 529 1020 189 351 841 122 396 934
2 216 503 1046 136 486 860 152 330 830 132 391 931
3 108 656 1175 73 411 1082 169 339 856 100 319 877
3 142 591 1220 130 466 1270 150 420 738 138 462 792
4 254 427 1071 146 579 1226 251 346 1130 235 482 1034
4 307 591 1433 180 584 1770 372 354 1621 210 650 1206
5 163 363 1121 124 542 1250 106 398 485 236 470 669
5 175 555 2110 140 558 1525 - - - 240 392 919
6 97 526 1143 111 403 1013 240 612 1339 150 491 1058
6 198 550 1685 142 485 736 - - - 233 479 1042
7 128 482 1328 130 680 1304 104 439 758 125 570 1169
7 129 467 1081 92 449 1106 - - - - - -
8 116 698 929 76 532 2122 135 551 887 177 496 521
8 124 544 1025 147 590 1248 - - - - - -
9 88 593 922 - - - - - - -
9 100 550 742 - - - - - - -
10 - - - 125 488 1200 - - - 134 515 1167
10 - - - 122 593 1509 - - - 127 487 1349
108
Table 8.11. The results per participant for /u:/ (Duration in ms; F1 and F2 in Hz) (Dur. = duration)
Repetition task Picture-naming task
Informant Session 1 Session 2 Session 1 Session 2
Dur. F1 F2 Dur. F1 F2 Dur. F1 F2 Dur. F1 F2
1 234 492 1015 302 428 921 - - - 166 500 1450
1 225 498 1390 213 400 1105 - - - 197 522 1208
2 212 333 769 228 432 849 - - - 172 326 668
2 190 368 912 228 393 889 - - - - - -
3 252 498 796 179 493 1006 235 465 688 160 476 612
3 272 488 802 293 386 959 - 268 485 1021
4 236 508 513 278 479 991 131 395 1142 237 393 781
4 240 474 1281 233 495 1236 248 354 832 -
5 165 491 563 231 462 730 241 390 832 120 406 640
5 198 428 1376 248 418 1450 - - - 289 479 1292
6 216 516 1179 283 434 1067 - - - 188 464 913
6 196 543 2679 318 421 1216 - - - - - -
7 153 369 714 193 289 1170 264 355 701 112 465 490
7 175 605 1483 - - - 239 427 828 218 358 916
8 148 488 774 160 497 585 - - - 137 217 538
8 121 351 740 198 545 1343 - - - - - -
9 167 625 985 - - - - - - - - -
9 145 606 2506 - - - - - - - - -
10 - - - 185 470 883 - - - 167 418 963
10 - - - 186 515 1422 - - - 285 371 1215
109
Number of tokens measured per vowel
Table 8.12. Total number of tokens measured per vowel (n)
Session 1 Session
Dutch_PM task /ɪ/ 22 17
Dutch_PM task /ε/ 23 18
Dutch_PM task /u/ 25 16
Dutch_PM task /y/ 25 18
English_Rep task /ɪ/
18
16
English_PN task /ɪ/ 14 7
English_Rep task /ε/ 27 26
English_PN task /ε/ 18 26
English_Rep task /æ/ 19 18
English_PN task /æ/ 11 17
English_Rep task /ʊ/ 18 18
English_PN task /ʊ/ 11 15
English_Rep task /u:/ 18 17
English_PN task /u:/ 6 14