variation and change in the use of hesitation markers in germanic...
TRANSCRIPT
Variation and change in the use of hesitation markers in Germanic languages
Martijn Wieling1*, Jack Grieve2, Gosse Bouma1, Josef Fruehwald3, John Coleman4, and Mark
Liberman5
1University of Groningen, 2Aston University, 3University of Edinburgh, 4University of Oxford, 5University of Pennsylvania
In this study, we investigate cross-linguistic patterns in the alternation between UM, a
hesitation marker consisting of a neutral vowel followed by a final labial nasal, and UH, a
hesitation marker consisting of a neutral vowel in an open syllable. Based on a quantitative
analysis of a range of spoken and written corpora, we identify clear and consistent patterns of
change in the use of these forms in various Germanic languages (English, Dutch, German,
Norwegian, Danish, Faroese) and dialects (American English, British English), with the use of
UM increasing over time relative to the use of UH. We also find that this pattern of change is
generally led by women and more educated speakers. Finally, we propose a series of possible
explanations for this surprising change in hesitation marker usage that is currently taking
place across Germanic languages.
1. Introduction
Two basic hesitation markers (also referred to as fillers or filled pauses) are common in
modern Germanic languages: the UM form, which consists of a neutral vowel followed by a
final labial nasal, and the UH form, which consists of a neutral vowel in an open syllable. For
example, in the English language these forms are generally written as um and uh in American
English and as erm and er in British English. Similarly, in German a distinction is made
between in ähm or öhm and äh or öh, whereas in Dutch a distinction is made between ehm or
uhm and eh or uh. Similar forms appear to exist in all other Germanic languages. Hesitation markers, including UM and UH, have long been studied in linguistics,
primarily because their use has been seen as being directly related to the cognitive processes
responsible for the production of speech, specifically marking disfluencies (e.g., Maclay and
Osgood, 1959; Goldman-Eisler, 1968; Rochester, 1973; Crystal 1982; Levelt, 1983; Levelt &
Cutler, 1983; Schachter et al., 1991). For example, Schachter et al. (1991) found that lecturers
in the humanities used more hesitation markers than lecturers in the natural sciences when
teaching, but not when being interviewed. They argued that this difference is due to the larger
number of words from which a lecturer in the humanities must choose compared to a lecturer
in the natural sciences, where technical vocabulary is more strictly defined. Because
humanities lecturers have to make more decisions during speech production, they tend to use
more hesitation markers. In other words, hesitation markers are seen as marking disfluency
during language production. This general explanation for the use of hesitation markers has
been referred to as the symptom hypothesis (De Leeuw, 2007). Although disfluencies during language production would appear to explain many
occurrences of hesitation markers in spoken language, other explanations for the use of UM
and UH have been identified. For example, in a series of reaction time experiments, Brennan
and Schober (2001) found that hesitation markers were beneficial to comprehension, as
listeners were faster to select a target object after a filler was used in the stimulus sentence. Indeed, listeners appear to show a disfluency bias when encountering a hesitation marker. For
example, Arnold et al. (2013) found that when encountering disfluent speech, listeners were
more likely to expect a discourse-new referent. In line with this, Bosker et al. (2014) showed
that listeners were more likely to expect a low frequency as opposed to a high frequency word
after a disfluency marker, though listeners adapted this expectation on the basis of the speaker
(i.e. there was no higher expectation for a low-frequency word when listening to non-native
speakers). Similarly, Fox Tree (2001) showed that UH (but not UM) facilitated the speed with
which listeners were able to recognize upcoming words. Fraundorf and Watson (2011)
showed that hesitation markers improve recall whether or not they predict upcoming discourse
boundaries and that no such effect results from coughs of equal duration, ruling out a
processing time effect. In contrast to the symptom hypothesis, this type of explanation for the
usage of hesitation markers has been referred to as the signal hypothesis (De Leeuw, 2007).
Still other researchers have pointed out that UM and UH can be used to fulfill various
discursive functions (e.g. Swerts, 1998; Rendle-Short, 2004; Tottie, 2014). For example,
Swerts (1998) showed that hesitation markers can be used as markers of discourse structure,
with hesitation markers occurring more often with stronger discourse breaks than with weaker
discourse breaks. Similarly, Tottie (2014) argued that UM and UH can be used as discourse
markers, with a similar meaning as the discourse markers well and you know. Linguists have also directly compared the usage of UM and UH. For example, as noted
above, Fox Tree (2001) found that UH but not UM facilitated word recognition by listeners.
Alternatively, Shriberg (1994) reported that UM was more frequently found in sentence-initial
position than UH in American English, a result that Swerts (1998) replicated based on the
analysis of Dutch data. Similarly both Swerts (1998) and Clark and Fox Tree (2002) found
that UH tends to be used by speakers to mark minor delays, whereas UM tended to be used to
mark major delays. Such findings, however, have not been replicated by all researchers. For
example, O’Connell and Kowal (2005) argued that there are no functional differences in the
usage of UM and UH based on their analysis of six media interviews of Hillary Clinton.
Furthermore, based on a review of previous research, Corley and Stewart (2008) concluded
that there is no evidence that speakers have intentional control over the production of UM or
UH (see also Finlayson and Corley, 2012). Differences have also been found in the use of UM
and UH across Germanic languages. For example, De Leeuw (2007) reported that whereas
English and German speakers had a higher frequency of use of UM, Dutch speakers generally
had a higher frequency of use of UH.
The aforementioned studies have all focused on the different functions of UH and UM
from a structural perspective. However, researchers have also analyzed the effect of various
social factors on the choice between these two forms. For example, Rayson et al. (1997)
showed on the basis of a corpus analysis of the British National Corpus (BNC) that er (i.e.
UH) was the second-most characteristic word for male speech and the fourth-most
characteristic word for the speech of older (35+) speakers, whereas erm (i.e. UM) was the
ninth-most characteristic word for people from the upper social class, although they did not
directly contrast social patterns in the use of UM and UH. Liberman (2005), however, found
clear gender- and age-related patterns in the use of UH versus UM in corpora of transcribed
English-language telephone conversations (i.e. the Switchboard, Fisher Part 1 and Fisher Part
2 collections; Godfrey & Holliman, 1993; Cieri et al., 2004; Cieri et al., 2005). He observed
that the use of UH was higher for men than for women and for older speakers than for
younger speakers, whereas the use of UM was higher for women and younger speakers. In
other words, the frequency of UM relative to UH (i.e. the UM/UH ratio) was greater for
younger speakers and women. More recently, various other corpus-based studies have analyzed the use of hesitation
markers in English and have obtained similar results (see Tottie, 2011 for an overview). For
example, on the basis of two sub-corpora of the BNC (i.e. BNC-Demographic and BNC-
Context Governed), Tottie (2011) showed that women, younger people, and people from
higher socio-economic classes had a higher UM/UH ratio than men, older people and people
from lower socio-economic classes—a result that once again suggests that UM usage is rising
over time, led by women and speakers from higher classes. Similarly, Acton (2011) analyzed
the UM/UH ratio in American English based on the relatively recent Speed Dating Corpus
(SDC; Jurafsky et al., 2009) and the older Switchboard corpus (Godfrey and Holliman, 1993)
and obtained similar results, with women showing a greater UM/UH ratio than men in both
corpora. Based on the Switchboard corpus, Acton (2011) also showed that this pattern
persisted at the dialect-region level and when the gender of the hearer was taken into account
(i.e. same-gender dyads appeared to show a greater UM/UH ratio than different-gender
dyads). He also found that younger speakers had a greater UM/UH ratio than older speakers
and that the UM/UH ratio was greater for the more recent SDC than the Switchboard corpus
and therefore suggested that these results (together with the gender difference) might indicate
that a linguistic change is in progress. Similarly, Laserna et al. (2014) analyzed transcripts of
conversations collected by 263 American participants from five different studies (Mehl &
Pennebaker, 2003a, 2003b; Mehl, Gosling & Pennebaker, 2006; Fellows, 2009; Baddeley,
Pennebaker & Beevers, 2013), which were collected via electronically activated recorders
carried by the participants for two to three days, allowing for truly spontaneous conversations
to be obtained. Laserna et al. (2014) did not explicitly contrast the use of UM and UH in their
study, but they reported a significant correlation between gender (male: 1, female: 2) of r = -
.15 (p < .05) for UH, and r = -.09 (p > .05) for UM. Consequently, they concluded that
women showed a lower frequency of use for both UH and UM than men (since the correlation
coefficients are negative) (see also Bortfeld et al., 2001). However, as the reduction appears to
be greater for UH than UM, this result suggests that women in this study are characterized by
a greater UM/UH ratio than men. In addition, Laserna et al. (2014) reported a negative
correlation between age and UM use (r = -.21, p < .001), but not between age and UH use (r =
-.01, p > .05). As the use of UM (but not UH) decreases for older people, this implies that the
UM/UH ratio also decreases for older people, which once again implies that a change in
English hesitation marker usage is currently underway. Previous research on social variation in the use of UM and UH in British and American
English has thus repeatedly identified the same basic patterns: younger speakers and women
use relatively more UM than UH compared to older speakers and men (irrespective of the
potential categorical functional differences between the two alternatives). This type of pattern
is commonly identified in apparent-time sociolinguistic research and is seen as being
indicative of a linguistic change in progress (Labov, 1994) with the use of UM relative to UH
increasing over time. The apparent-time hypothesis assumes that most language is acquired
during childhood and remains relatively stable afterwards. Correspondingly, the speech of
older people is assumed to reflect the linguistic situation when these speakers were young.
Furthermore, variationist sociolinguistics studies have repeatedly found that language change
is led by women (e.g., see Labov, 2001). The first goal of this paper is therefore to assess
whether a change in hesitation markers usage is truly underway in the English language based
on detailed quantitative analyses of both longitudinal and apparent-time data. Furthermore,
because other Germanic languages have comparable hesitation markers, the second goal of
this paper is to investigate whether similar patterns of variation and change in the use of UM
and UH can be found in other Germanic languages, including Dutch, German, Norwegian,
Danish and Faroese.1
2. Data: Spoken language corpora
To compare patterns of linguistic variation and change in the use of the hesitation markers
UM and UH in Germanic languages, we analyzed a range of spoken language corpora
representing the English, Dutch, German, Norwegian, Danish and Faroese languages. For
1 While we focus on Germanic languages in this study, note that a similar gender-related pattern has been
recently observed in Mandarin speech (Yuan et al., submitted).
each of these corpora we generated a primary data set by extracting information about the
usage of UM and UH2 in the corpus as well as a range of social information about each
speaker.3 Most notably, we included gender and age. The age of the speakers may be used as a
way to assess linguistic change. This type of apparent time analysis is a common technique in
sociolinguistic research (see Labov, 1994) and is based on the assumption that if a change in
progress is taking place, younger speakers will tend to use the more modern form, whereas
older speakers tend to use the original form.
2.1. English For the English language, we analyzed five spoken language corpora, including three corpora
of American English, one corpus covering a wide range of British English dialects, and one
corpus of Scottish English. First, we analyzed the Switchboard Corpus of American English (Godfrey & Holliman,
1993), which contains data from approximately 2,400 two-sided telephone conversations
collected in 1990. We extracted all 91,001 tokens of UM (i.e. um) and UH (i.e. uh) from the
corpus, which were produced by a total of 520 different speakers. In addition, we recorded the
position (counted from the start of the utterance) and duration of the hesitation marker and the
duration of preceding and following pauses, as well as the age and gender of each speaker
(education level was not included), and the total number of words that they contributed to the
corpus. Second, we analyzed the Fisher Corpus of American English (Part 1 and Part 2) (Cieri
et al., 2004; Cieri et al., 2005), which contains transcripts of almost 12,000 telephone
conversations collected from 2002 to 2003. We extracted all 19,753 tokens of UM (i.e. um)
and UH (i.e. uh) from the corpus, which were produced by a total of 10,313 different
speakers. In addition, we obtained the age, gender and amount of education (in years) of each
speaker, and the total number of words that they contributed to the corpus.
Third, we analyzed the Philadelphia Neighborhood Corpus (PNC; Labov et al., 2013),
which contains transcripts of interviews with speakers from the Philadelphia area conducted
from 1973 to 2013. We extracted all 25,514 tokens of UM (i.e. um) and UH (i.e. uh) from the
corpus, which were produced by a total of 395 different speakers. In addition, we recorded the
duration of the hesitation marker, whether a pause occurred before or after the hesitation
marker, the year of recording, the age, gender and number of years of schooling of each
speaker, and the total number of words that they contributed to the corpus. Fourth, we analyzed the spoken component of the British National Corpus (BNC;
Coleman et al., 2012), which contains approximately seven million words recorded in 1993.
We extracted all 25,498 tokens of UM (i.e erm) and UH (i.e. er) from the corpus, which were
produced by a total of 960 different speakers. In addition, we recorded the duration of the
hesitation marker and the duration of the pause following the hesitation marker, as well as the
age and gender of each speaker, and the total number of words that they contributed to the
corpus. Fifth, we analyzed the HCRC Map Task Corpus of Scottish English (HCRC Map Task
Corpus, 1993), which contains transcribed speech collected from undergraduates at the
University of Glasgow in 1990, who were participating in a map task in which a guide had to
explain a route drawn on a paper map to a follower who only had a map without the route. We
2 Of course, transcribers may have made errors in assigning the label of the hesitation marker. However, it is
unlikely that these errors are specific to the gender and age of the speakers. 3 The data, methods and results associated with this analysis are available for download as supplementary
materials at the first author’s website (http://www.martijnwieling.nl) and at the Mind Research Repository
(http://openscience.uni-leipzig.de).
extracted all 1,987 tokens of UM (i.e. ehm, erm, mm4, um) and UH (i.e. eh, er, uh), which
were produced by a total of 64 different speakers (of which 61 subjects were Scottish). In
addition, we recorded the position of the hesitation marker in each utterance, as well as the
age, gender, and role (i.e. follower or guide) of each speaker, and the total number of words
that they contributed to the corpus.
2.2. Dutch For the Dutch language, we analyzed the Corpus Gesproken Nederlands (version 2.0) (CGN,
2006), which contains spoken transcribed speech (about 9 million words) from various
sources (e.g., spontaneous conversations, interviews, telephone dialogues) recorded from
1998 to 2004. We extracted all 228,619 tokens of UM (i.e. ehm, uhm) and UH (i.e. eh, uh)
from the corpus, which were produced by a total of 3,433 different speakers. In addition, we
recorded the position and duration of the hesitation marker, the duration of preceding and
following pauses, the preceding and following word, the part-of-speech tag of the preceding
and following word, as well as the age, gender, education level, nationality (Dutch, Belgian),
and level of preparedness (i.e. low for spontaneous speech, high for a televised speech) of
each speaker. Furthermore, we also extracted the total number of words that each speaker
contributed to the corpus.
2.3. German For the German language, we analyzed the Forschungs- und Lehrkorpus Gesprochenes
Deutsch (FLGD; Depperman, 2014), which contains about 100 hours of recorded speech
(about 1 million words) collected from 2005 to 2014. We extracted all 16,221 tokens of UM
(i.e. ähm, öhm) and UH (i.e. äh, öh), which were produced by a total of 238 different
speakers. In addition, we recorded the age and gender of each speaker.
2.4. Norwegian For the Norwegian language, we analyzed the Nordic Dialect Corpus and Syntax Database
(NDCSD; Johannessen et al., 2009), which contains approximately 2.8 million words from
conversations and interviews collected between 1951 and 2012. We extracted all 47,604
tokens of UM (i.e. em, EM, m, M, m-m, m_m) and UH (i.e. e, E, h-e) from the corpus that
were tagged as hesitation markers, which were produced by a total of 554 different speakers.
In addition, we recorded the year of recording, the age group (old: aged 50+, young: aged
between 18 and 30) and gender of each speaker, and the total number of words that they
contributed to the corpus.
2.5. Danish and Faroese Finally, for the Danish and Faroese languages, we analyzed the Faroese Danish Corpus
Hamburg (FADAC; Braunmüller, 2011), which contains 440,000 words collected on the
Faroe Islands from 2005 to 2009. We extracted all 4,504 tokens of UM (i.e. ehm, ehhm, eehm,
æhm, ææhm, øøhm, etc.) and UH (i.e. eh, ehh, eeh, æh, ææh, øøh, etc.) from the corpus,
which were produced by a total of 57 different speakers. In addition, we recorded the
language in which the interview was conducted (Danish, Faroese), the age and gender of each
speaker, and the total number of words that they contributed to the corpus.
4 We included mm, which accounts for 7.7% of all hesitation markers in this dataset, as it generally appears to be
used to mark hesitations, rather than for indicating assent (as opposed to mhm).
3. Data: Twitter corpora
In addition to analyzing various spoken language corpora, we also analyzed the use of UM
and UH in both American and Dutch Tweets, a written register that is especially informal and
shares several features with spontaneous speech. Notably, in instant message conversation, a
similar register of computer-mediated communication, Tagliamonte & Denis (2008) found
that the usage rates of discourse-pragmatic variables were broadly comparable to spoken
language corpora. Of course, the function of UH and UM will often be different in writing
than in speech, in large part because the use of UM or UH in written language is generally a
conscious process (i.e. it has to be typed) and Twitter is less interactional than spoken
language. This results in these forms being used primarily as discourse markers as opposed to
hesitation markers. For example, the following Twitter conversation shows that UM in
Twitter can be used to indicate irony, which appears to be far less common in spoken
language:
A: “Make fun of Jeb Bush's brother all you want, but he would've been dropping bombs
months ago.”
B: “um that's why everyone hates him”
Nevertheless, it is informative to test if patterns in the use of UM and UH can also be
observed in written language.
3.1. English For English Twitter, we analyzed a corpus of 6 billion words of American Tweets collected
by Diansheng Guo of the University of South Carolina in 2013, which only contains tweets
where the longitude and latitude of the user at the time of posting is known, as it was designed
for the analysis of geolinguistic variation. We extracted the 69,075 tokens of UM (i.e. um) and
UH (i.e. uh) from the corpus that were produced by the 25,852 users who contributed at least
1,000 total words to the corpus and whose username contained an unambiguous male or
female name (e.g. John2002 was designated as male, whereas Kate_1234 was designated as
female). Although this approach to identifying gender is not perfect, as some names will be
misclassified, we assume that the chances of misclassifications are relatively modest. In
addition, we recorded the gender of each user and the total number of words that they
contributed to the corpus. It would have also have been possible to use the username to
determine the year of birth, but very few users had a username containing a potential year of
birth (i.e. less than 1% of the 25,852 users).
3.2. Dutch For Dutch Twitter, we analyzed a corpus of 28.9 billion words of Dutch Tweets collected by
the Department of Information Science at the University of Groningen between 2011 and
2014. We extracted the 68,089 tokens of UM (i.e. uhm, um, euhm, ehm, etc.) and UH (i.e. uh,
uuh, eh, eeh, euh, etc.) from the corpus that were produced by the 38,651 users who
contributed at least 1,000 total words to the corpus and whose username contained an
unambiguous male or female name (as described above) and/or a four digit number ranging
between 1930 and 2009, which we used to estimate that user’s year of birth. In contrast to the
English dataset, the (much larger) Dutch Twitter dataset contained this four digit number
frequently in the usernames of Dutch Twitter users. This approach to identifying age also is
not perfect, as some names will be misclassified, but we assume that the chances of
misclassifications are relatively modest.
4. Analysis5
Because the dependent variable for each of the primary data sets is binary (i.e. the use of UM
versus UH or the number of tokens of UM versus the number of tokens of UH), we assessed
the effect of each of our predictor variables (e.g., age, gender, hesitation marker duration) on
the use of UM and UH using mixed-effects logistic regression (Agresti, 2007). By using
mixed-effects regression we are taking the structural variability associated with speakers into
account (see Baayen, 2008). This is important because some speakers may be more likely to
use UM (relative to UH) than others (i.e. modeled via a random intercept for speaker).
Similarly, the effect of each predictor may vary across speakers. For example, for some
speakers a longer duration of the pause following a hesitation marker may be more predictive
of the usage of UM than for other speakers. This would be modeled with a by-speaker random
slope for the duration of a following pause. Since we are using logistic regression, the
estimates need to be interpreted with respect to the logit scale (i.e. the logarithm of the odds of
observing UM rather than UH). Positive estimates indicate an increased probability of
observing UM together with increasing values of the predictor, whereas negative estimates
signal the opposite. An estimate of zero indicates that it has no effect on the probability of
observing UM.
For all of the primary data sets except one, we obtained the best-fitting model including
only significant predictors and supported random intercepts and random slopes. Predictors and
random intercepts and slopes were included if they reduced the Akaike Information Criterion
(AIC; Akaike, 1974) by at least 2, compared to the model without the random intercept or
slope (see also Wieling et al., 2014 for a similar approach). A reduced AIC indicates that the
additional complexity of the model is warranted given the increase in goodness of fit. Due to
the large number of predictors in the Dutch data set, however, we did not fit the best model
but rather fitted a random-intercepts-only model and assessed if the inclusion of individual
random slopes affected the significance of the predictors. We only included predictors that
remained significant in all cases in the final model. Given the large number of predictors in
this model, we also did not evaluate all possible interactions.
We assessed the goodness of fit of these models (including the random-effects structure)
by calculating the index of concordance C, which is known as the receiver operating
characteristic curve area ‘C’ (Harrell, 2001). Values of C greater than 0.8 indicate a successful
classifier, whereas a value of 0.5 indicates the classifier has no predictive power at all. All
models had C values close to or over 0.8 (see supplementary materials for exact values).
5. Results: Spoken language
Table 1 presents the effects (including associated estimations of effect size: the increase in
logits of the dependent variable for the categorical predictors, or per 1 standard deviation
increase of the numerical predictors) of the speaker-related predictors that were present in at
least two data sets (i.e. gender, age, education level, and year of recording) on the use of UM
over UH. Table 1 clearly shows that women are more likely than men to use UM as opposed
to UH across all data sets. Similarly, Table 1 shows that younger speakers are generally more
likely than older speakers to use UM as opposed to UH; only in the case of the relatively
small HCRC Corpus, does the effect of age not reach significance (p = 0.07). Table 1 also
5 Given that we analyzed nine independent data sets, we provide a simplified summary of the results for all
models together in this section, rather than reporting each individual model. The full details for each model can
be found in the supplementary materials (available at the Mind Research Repository: http://openscience.uni-
leipzig.de), which contains all data, all R commands used to generate the models, and all results for each
individual model, as well as detailed instructions on how to conduct the analysis.
shows that more or longer educated people are more likely to use UM as opposed to UH in the
Fisher corpus and the Dutch corpus, but that the effect of education in the PNC was non-
significant. In addition, the effect of education is much smaller than that of age. Finally, Table
1 shows that the use of UM over UH has increased over real-time in the PNC, the Norwegian
Corpus, and in the Dutch corpus. Figure 1 visualizes this result for the three data sets. For
each data set, the graph shows the proportion of UM over UH (i.e. UM/[UM+UH]) by year of
recording (divided into four groups containing roughly the same number of speakers) and
gender. The error bars indicate the 95% confidence interval (i.e. 1.96 standard errors below
and above the mean). It should be noted, however, that whereas the PNC (1973-2013) and the
Norwegian corpus (1951-2012) each span at least 40 years, the Dutch Corpus only spans 13
years and 90% of the data was recorded between 1999 and 2003. The effect of year of
recording is significant even while controlling for the age of the speaker (i.e. it is not an effect
of age grading; see Table 1). When year of recording is excluded from the analysis for the
PNC and instead only year of birth and age are taken into account, the most important
predictor clearly is year of birth; the effect of increasing age (i.e. older people are still more
likely to use UH) is only minimal (p = .04).
Significant interactions (e.g., between age and gender) were identified in some models;
however, because these interactions did not change the direction of the general effect (e.g., the
age effect was negative for both men and women, but less so for men than for women), we did
not explicitly include these interactions in Table 1 (see, however, supplemental materials for
the precise model specifications). Most important, these effects were found to be significant,
while controlling for the effect of other potential important predictors, such as the duration of
the pause before and after the hesitation marker (see Table 3, discussed below). Also note that
for the PNC (and for the Switchboard corpus, but not for the HCRC, nor the BNC), the
predictive value of the duration of the pause after the hesitation marker has diminished for
people born in more recent years (i.e. a longer pause is more likely to predict the occurrence
of UM over UH for older people than for younger people; see supplementary material). This
suggests, for these datasets, that younger people are using UM more across the board, and are
not simply more frequently signaling longer pauses.
Table 1. Effects of subject-related predictors on the choice of UM over UH for all data sets
Gender: Male
vs. Female Age:
Old vs. Young Education:
High/More vs.
Low/Less
Year of
Recording:
Increase vs.
Decrease
Switchboard F (1.03) Y (0.6z - 0.7z)
Fisher F (1.37) Y (0.39z) More (0.11z)
PNC F (1.31) Y (1.2z -1.7z) (More) (0.03z) Increase (0.54z)
BNC F (0.45) Y (0.45z)
HCRC F (2.30) (Y) (0.35z)
German F (0.43) Y (0.94z)
Norwegian F (0.23) Y (0.65) Increase (0.35z) Danish/Faroese F (0.59) Y (0.4z - 0.6z)
Dutch F (0.5 - 0.9) Y (0.3z - 0.6z) High (0.15z) Increase (0.09z) Significant (p < 0.05) and non-significant (category name put between parentheses) effects are listed; an empty cell indicates
the absence of that predictor in that data set. The values between parentheses indicate the effect size (in terms of logits: the
increase in probability of observing UM rather than UH) when the category changes to the one indicated or (when a
subscripted z is shown) when the value of the numerical predictor increases with 1 standard deviation. A range of values
indicates the predictor is involved in an interaction. In other words, the effect of age in the Switchboard corpus varies based
on the hesitation marker being phrase final (smaller effect) or not (larger effect), while the effect of gender and age varies per
country for the Dutch data set (larger for Belgium than for the Netherlands), and the effect of age varies per language in the
Danish/Faroese data set (larger for Faroese), and for gender in the PNC (larger for men).
Figure 2 presents four graphs for the American English Switchboard data set, which
visualize the relationship between age, gender and the use of UM and UH. The first graph
(top-left) plots the proportion of UM over UH (i.e. UM/[UM+UH]) by age (divided into four
age groups containing roughly the same number of speakers) and gender. Similarly as before,
the error bars indicate the 95% confidence interval (i.e. 1.96 standard errors below and above
the mean). This graph shows a clear increase in the proportion of UM over UH across age
groups for both men and women, with women consistently showing a higher rate of UM
usage than men. Note that all speakers in this corpus mostly use UH, with only women in the
two youngest age groups approaching 50% UM usage. The second graph (top-right) plots the
relative frequency of UM and UH taken together (i.e. total hesitation marker frequency
relative to all words in the corpus) by age and gender. This graph shows a clear decline in
hesitation marker usage across age groups for both men and women, with men consistently
using more hesitation markers than women (but note that this pattern is not observed in the
smaller HCRC and Danish/Faroese datasets, likely due to the large individual differences in
hesitation marker frequency; also Bell et al. (2000) found no gender differences in hesitation
marker usage for Swedish speakers). The third graph (bottom-left) charts the frequency of
UM relative to all words in the corpus by age and gender. This graph shows a clear increase in
UM use over age groups with women consistently using UM more frequently than men, even
though this gap appears to be closing in the youngest age group. Finally, the fourth graph
(bottom-right) plots the frequency of UH relative to all words in the corpus by age and
gender. This graph shows a clear decrease in UH usage across age groups with men
consistently using UH more frequently than women.
Figure 3 presents the same four graphs for the Dutch data set. Overall, the Dutch results
are similar to the American English results presented in Fig. 2. In particular, the first graph
(top-left) also shows a clear increase in the usage of UM over UH across age groups with
women showing a higher proportion of UM over UH than men, while the third graph (bottom-
left) shows a clear increase in the relative frequency of UM across age groups with women
using UM more often than men. Finally, the fourth graph (bottom-right) shows a decrease in
the relative frequency of UH across age groups, especially for women. Despite these
similarities, differences between the American English Switchboard data and the Dutch data
are apparent. Whereas hesitation markers in English have been showing a clear decrease in
frequency across age groups, the second graph (top-right) shows that there is no clear trend in
the overall usage of hesitation markers in Dutch (though the distinction between men and
women is similar). The visualizations for the other data sets, which can be found in the supplemental
material, all show relatively similar patterns. Most important, all data sets show an increase
across age groups in the use of UM over UH (with women having the highest proportion of
UM use) and an increase across age groups in the relative frequency of UM. In addition, most
data sets show a decrease across age groups in the use of UH. There are, however, differences
between the nine data sets. In particular, the relative frequency of hesitation markers across
age groups (i.e. the second graph in Figs 2 and 3) varies considerably across the nine data sets.
Despite generally following the same basic trends, there are also considerable
differences in the average overall proportions of UM over UH and the relative frequencies of
UM and UH across the nine data sets. These results are summarized in Table 2. For example,
the average proportion of UM over UH ranges from 27% to 64% for the five English corpora,
compared to 50% in the German corpus, 17% in the Danish corpus, 13% in the Norwegian
corpus, and 11% in the Dutch corpus.
Figure 1. Proportion of UM over UH for three data sets: PNC (top), Norwegian (middle) and
Dutch (bottom) by year of recording and gender.
Finally, Table 3 presents the effects (again including estimations of effect size) of the
hesitation marker-related predictors that were present in at least two data sets (i.e. the duration
of the hesitation marker, the duration or presence (for the PNC) of a pause before the
hesitation marker, the duration or presence (for the PNC) of a pause after the hesitation
marker, the presence of the hesitation marker at the start of the utterance, and the presence of
the hesitation marker at the end of the utterance) on the use of UM over UH. Table 3 only
presents results for the five data sets for which we were able to include information about the
duration and position of hesitation markers and pauses.
Figure 2. American English Switchboard data: proportion of UM over UH (top-left), relative
frequency of hesitation markers (top-right), relative frequency of UM (bottom-left), and
relative frequency of UH (bottom-right) by age and gender.
Table 2. Proportion of UM over UH and relative frequency of UM and UH for all data sets UM Proportion UM Relative
Frequency UH Relative
Frequency
Switchboard 0.2825 0.0075 0.0221
Fisher 0.6408 0.0099 0.0068 PNC 0.2765 0.0045 0.0132 BNC 0.4612 0.0043 0.0045
HCRC 0.5717 0.0081 0.0058 German 0.5017 (no word counts) (no word counts) Norwegian 0.1285 0.0026 0.0189 Danish/Faroese 0.1653 0.0020 0.0079 Dutch 0.1086 0.0037 0.0315
Figure 3. Dutch Spoken data: proportion of UM over UH (top-left), relative frequency of
hesitation (top-right), relative frequency of UM (bottom-left), and relative frequency of UH
(bottom-right) by age and gender.
Table 3. Effects of hesitation marker-related predictors on the choice of UM over UH
Duratio
n of
Marker
Duration/Presence
of pause before
Marker
Duration/Presence
of pause after
Marker
Initial Position
Final Position
Switchboard Longer
(0.87z) Longer (0.12z)
Longer (0.11z)
Initial
(0.67) Final
(1.06)
PNC Longer
(1.25z) (Absent) (-0.08)
Present / Longer (0.59) / (0.55z)
BNC Longer
(1.06z)
Longer (0.44z)
HCRC Initial
(0.83)
Final
(1.07)
Dutch Longer
(1.15z) Longer (0.17z)
Longer (0.47z)
Initial
(0.51) Final
(0.96) Significant (p < 0.05) and non-significant (category name put between parentheses) effects are listed; an empty cell indicates the absence of
that predictor in that data set. The values between parentheses indicate the effect size (in terms of logits: the increase in probability of observing UM rather than UH) when the category changes to the one indicated or (when a subscripted z is shown) when the value of the
numerical predictor increases with 1 standard deviation.
In general, all predictors showed positive estimates, indicating that higher values of the
predictors are associated with a greater likelihood of observing UM as opposed to UH.
Specifically, a longer duration (of the hesitation marker or the pause before or after the
hesitation marker) is associated with a greater likelihood of the hesitation marker being UM
rather than UH, while the occurrence of the hesitation marker in utterance-initial or utterance-
final position is also associated with a greater likelihood of the hesitation marker being UM
rather than UH. Note that in the case of the Philadelphia Neighborhood Corpus, the presence
of a pause before or after the hesitation marker is similar to the hesitation marker being
utterance initial or final as utterances were identified on the basis of the pauses (a pause of
200 ms. or more indicated the break between two utterances).
6. Results: Twitter
Figure 4 presents four graphs for the American English Twitter data set, which visualize the
relationship between gender and the use of UM and UH. The first graph (top-left) plots the
proportion of UM over UH and shows that women are more likely to use UM over UH than
men. The logistic mixed-effects regression model indicates this effect was significant (p <
.001). The second graph (top-right) plots the frequency of UM and UH taken together relative
to all words in the corpus and shows that women are more likely to use hesitation markers
than men. The third graph (bottom-left) plots the frequency of UM relative to all words in the
corpus and shows that women are more likely to use UM overall than men. The fourth graph
(bottom-right) plots the frequency of UH relative to all words in the corpus and shows that
women are more likely to use UH overall than men. These results for the proportion of UM
over UH and the relative frequency of UM agree with the results of the analysis of the
American English spoken language data sets (e.g., see Fig. 2); however, unlike the results of
the spoken analyses, women were found to have higher relative frequencies for UH and for
hesitation markers in general, likely reflecting functional differences in the use of UM and UH
in written language.
Figure 5 presents four graphs for the Dutch Twitter data set, which visualize the
relationship between age, gender and the use of UM and UH. The first graph plots the
proportion of UM over UH and shows that women and younger Twitter users are more likely
to use UM than men and older Twitter users, although in this case the youngest users were
found to reduce their use of UM compared to users from the second youngest group. The
logistic mixed-effects regression model indicates that the age effect was significant (p < .001)
but the gender effect was not (p = .13). However, note that a curvilinear pattern might be a
better fit to the data, with a decrease in proportion of UM over UH for the youngest users. The
second graph plots the frequency of UM and UH taken together relative to all words in the
corpus and shows that women and younger Twitter users are more likely to use hesitation
markers than men. Also in this case, the youngest users were found to reduce their use of
hesitation markers compared to users from the second youngest group. The third graph plots
the frequency of UM relative to all words in the corpus and shows that women and younger
Twitter users are more likely to use UM than men, although once again the youngest users
were found to reduce their use of UM compared to users from the second youngest group. The
fourth graph plots the frequency of UH relative to all words in the corpus and shows that
women and younger Twitter users are more likely to use UH than men, with a similar
deviating pattern for the youngest users. In terms of gender, these results (though not
significant) are in line with the results of the analysis of the American Twitter data. Although the results of the analysis of both the American and Dutch Twitter data
correspond reasonably well overall with the results of the analysis of the spoken language data
sets, the relative frequency of the hesitation markers in the Twitter data is an order of
magnitude lower than in the spoken language data, which likely reflects clear register
differences between speech and writing. Table 4 lists these values, for comparison with the
corresponding values for the spoken data sets presented in Table 2. Note that the proportion of
UM versus UH for the Dutch Twitter data is much larger than for the Dutch spoken data.
Again this is likely indicative of register differences between speech and writing.
Figure 4. American Twitter data: proportion of UM over UH (top-left), relative frequency of
UM and UH (top-right), relative frequency of UM (bottom-left), and relative frequency of UH
(bottom-right) by gender.
Table 4. Relative proportion of UM vs. UH and versus all words for the Twitter data sets
UM
Proportion UM Relative
Frequency UH Relative
Frequency
American English 0.5334 0.00025 0.00019 Dutch 0.6518 0.00011 0.00006
Figure 5. Dutch Twitter data: proportion of UM over UH (top-left), relative frequency of UM
and UH (top-right), relative frequency of UM (bottom-left), and relative frequency of UH
(bottom-right) by age and gender.
7. Discussion
The results of our analyses have shown that there is a consistent pattern of sociolinguistic
variation in the use of the hesitation markers UM and UH across many modern Germanic
languages. In English, Dutch, German, Norwegian, Danish and Faroese, UM is relatively
more common than UH in the language of women and younger speakers when compared to
the language of men and older speakers. Although gender and age patterns in the use of UM
and UH have been identified in previous research on the English language, this paper has
shown that this pattern holds across a wide variety of Germanic languages, as well as several
varieties of English (American, Scottish and other British dialects). Furthermore, because we
analyzed a wide variety of different corpora, this paper has also shown that this pattern is even
more pervasive, existing across a range of time periods and registers, including both speech
and writing. In addition to identifying a cross-linguistic pattern of language variation, the results of
our study strongly suggest that what has actually been identified is a cross-linguistic pattern of
language change. Because variation in the use of UM and UH shows a clear trend across age
groups, with younger speakers using UM rather than UH more often than older speakers, it
appears that there is a change in hesitation marker usage currently taking place across various
Germanic languages, with the use of UM rising over time. This type of apparent-time
evidence, which is common in sociolinguistic research (see Labov, 1994), is based on the
assumption that if a change is taking place, then younger speakers will generally be more
likely than older speakers to prefer the linguistic form that is on the rise. This interpretation of
our age-based results is strongly supported by our longitudinal analyses of the Philadelphian
English, the Norwegian, and (to a lesser extent) the Dutch corpora, which show that the use of
UM is rising in real time. Finally, our finding that women consistently use UM more often
than men is also consistent with this interpretation, as women have frequently been found to
lead linguistic change (see Labov, 1990). This study has therefore uncovered clear evidence
that a similar change is taking place in the use of the hesitation markers UM and UH
(irrespective of whether one accepts a categorical distinction between the two variants or not)
across a range of Germanic languages, with the use of UM as opposed to UH becoming more
frequent over time. This change in the use of hesitation markers is surprising because it is occurring
simultaneously across a relatively large and mostly mutually unintelligible set of Germanic
languages. Examples of cross-linguistic change are not well attested in the literature and it is
unclear how this type of change could have developed or could be maintained. Perhaps the
most basic question is whether this cross-linguistic change began in one language and then
spread to other languages, or whether it developed in all languages simultaneously. This is a
complex puzzle, one for which we cannot provide a definitive answer. In the remainder of this
paper, we therefore present a number of possible explanations for this cross-linguistic change,
discuss the strengths and weaknesses of each of these explanation, and consider how these
competing theories could be tested in future research.
One possible explanation for this cross-linguistic change is that there are independent
patterns of change in use of UM and UH occurring in all six of the Germanic languages for
various different reasons, which are coincidentally all moving in the same direction. Although
such an explanation is possible, completely independent changes progressing in unison across
six different languages is highly improbable. It is therefore necessary to consider other
hypotheses that directly explain why the same basic change is taking place across so many
Germanic languages. There would appear to be two general types of non-coincidental
explanations that could account for these results: the change may have spread through contact
from one of the languages to the others or a true parallel change may be taking place caused
by some factor that affects the use of these related hesitation markers across all the languages. Language contact is one possible explanation for this cross-linguistic change in
hesitation marker use. For example, lexical items in one language that refer to new concepts
are often borrowed into other languages that do not have words to refer to those concepts,
such as the English word ‘computer’. This word was borrowed into Dutch, German and
Danish, although not into Norwegian (datamaskin) or Faroese (telda). English forms, in
particular, would appear to be especially likely to spread through contact, because it is one of
the primary languages of mass media and the Internet, as well as being commonly used as a
second language by many speakers of other Germanic languages. Unfortunately, as all
datasets show an increase in UM use for younger speakers, and the age range covered in each
data set varies considerably, we are not able to identify when this potential development
would have started.
Even though it is well known that linguistic forms can spread through language contact,
which furthermore is often led by women (Van Ness, 1995), it is unclear if language contact
could explain the type of cross-linguistic change in hesitation marker usage identified in this
study. On the one hand, hesitation markers are relatively frequent in the English language,
ensuring that they would be present in the language to which non-native English speakers are
exposed. The proportion of UM (over UH) is also higher on average in the English language
corpora compared to the corpora for other Germanic languages, which is what we would
expect if the change originated in the English language. On the other hand, there is a
considerable range in the average usage of UM over UH in the English corpora (see Table 2),
which in some cases dips below the levels for German speakers in particular. The use of
hesitation markers would also generally appear to be a highly subconscious process and the
shift in usage of UM versus UH in the English language is a subtle change, only having been
identified here through a careful statistical analysis of large amounts of language data.
Furthermore, unlike the examples of language contact presented above, both forms involved
in this change already existed in all the Germanic languages under analysis, so that it is not
the specific form UM that would have spread but a pattern of change that affects a pre-
existing alternation. All of these factors presumably make it more difficult for variation in hesitation markers
to spread through contact than, for example, a new word that refers to a new concept.
However, perhaps that is what is happening here: UM might have taken on a new meaning or
function in English, and it is this meaning or function has spread through contact to other
Germanic languages, which already have a comparable form, To some extent we did control
for functional differences in the use of UM and UH by including various linguistic predictors
in our analyses. For example, UM tended to have a longer duration, was preceded and
followed by longer pauses, and was more frequently found at the beginning or end of an
utterance than UH. These results are in line with earlier studies (e.g., Clark and Fox Tree,
2002, Shriberg, 1994, Swerts, 1998), which found that UM is more likely to signal a major
delay (but see O’Connell and Kowal, 2005). Of course, a longer duration of UM is not
surprising, given that UM is essentially UH plus the labial nasal, but the gender and age-
related patterns still hold when these potential linguistic differences between the two
hesitation markers are controlled for. In addition, for most of the corpora analyzed here, the
overall relative frequency of UM and UH combined was found either to be decreasing or have
remained relatively steady over real or apparent time, which suggests that there has not been a
substantial increase in the use of UM or UH as discourse markers over this period of time. We
did not, however, analyze different linguistic functions of UM or UH. Most notably, as
discussed in the introduction, it is clear that hesitation markers can be used as discourse
markers, for instance to manage turn taking during a conversation, or to signal indecision,
disagreement, focus, or confusion. If UM, for example, is becoming more common as a
discourse marker over time compared to UH in English, then this change could explain the
rise of UM in English and could have been passed on to other Germanic languages through
contact. It should be noted, however, that relatively comparable age- and gender-related
patterns were found in the Twitter data, where UM and UH generally have different functions
than in spoken language. In addition to language contact, a cross-linguistic change could also be the result of
some linguistic or extra-linguistic process that causes each of the languages to change
independently but in parallel. For example, parallel changes can be a result of general
processes of sound change, such as elision, which involves the deletion of segments during
speech to facilitate articulation. There does not appear, however, to be any phonological
processes that would explain the rise in usage of UM compared to UH over time cross-
linguistically, such as a tendency for open syllables to close. In fact, the opposite is true: open
syllables are generally more common than closed syllables in languages of the world, and
furthermore syllables consisting solely of a vowel, such as UH, tend to develop onsets as
opposed to codas over time (Hyman, 2008). It also seems possible that UM could be reduced
to UH through elision in natural speech so as to accelerate language production. General
processes of phonological change therefore do not appear to explain the results of this study. Alternatively, a general extra-linguistic force could be responsible for a parallel change
in the usage of UM and UH across the six Germanic languages. For example, Biber et al.
(2010) found that noun phrase modification in English newspaper writing has become
syntactically more complex and compressed over time, and argue that this is due to the
increasing amount of information incorporated into newspapers in modern times and the
increasing use of word processing technology that has allowed reporters to devote more time
to carefully preparing and editing their texts. Similar societal changes could be affecting the
usage UM and UH cross-linguistically. For example, although there is general prescription
against using both forms in the English language (Erard, 2007), UM is arguably more polite
than UH (e.g., “polite yawning” is used to refer to yawning with the mouth closed; Hilgers et
al., 2000), given that UH leaves the mouth in an open position and that the UH sound is also
common reaction to physical pain, fatigue, sadness, and anger. Given the rise of living
standards, education level, mass media, and the service economy in the Western World over
the course of the 20th century, it is possible that people have become more self-conscious of
their language use, resulting in the rise of UM over UH across Germanic languages.
Unfortunately, the datasets analyzed in this study are not suitable for a more detailed
diachronic analysis of a potential shift towards more self-conscious language use.
Furthermore, to our knowledge there are no scientific studies describing a potential shift
towards (or away from) more self-conscious speech during the 20th century.
In conclusion, this study has shown that there is a clear change taking place across
modern Germanic languages, with UM rising in frequency relative to UH. Furthermore, we
have considered some possible explanations for this surprising cross-linguistic change, with
two hypotheses standing out as being most likely. The first explanation is that the change
originated in English and spread through contact with other Germanic languages, which have
similar forms, possibly reflecting semantic change in the use of UM, i.e. as a discourse
marker. The second explanation is that a parallel change is underway due to general societal
changes in communication in the Western World, for example with UM increasing in usage
because it is more self-conscious than UH. To assess these hypotheses both individually and
in conjunction, as well as potentially generating other explanations for the findings of this
study, it is necessary to conduct more detailed functional and social analyses of UM and UH
usage over time both within and across Germanic languages.
Acknowledgements
We thank Deborah Cameron, Michael Cysouw, Mark Dingemanse, Diansheng Guo, Daniel
Ezra Johnson, Alice Kasakoff, John Nerbonne, Andrea Nini, Emily Waibel, and three
anonymous reviewers whose comments have helped to improve this manuscript. Furthermore,
we thank Thomas Schmidt in helping us to extract the relevant information from the
Forschungs- und Lehrkorpus Gesprochenes Deutsch.
References
Acton, E. K. 2011. On gender differences in the distribution of um and uh. University of
Pennsylvania Working Papers in Linguistics 17(2), article 2.
Akaike, H. 1974. A new look at the statistical identification model. IEEE Transactions on
Automatic Control 19(6): 716-723.
Agresti, A. 2007. An Introduction to Categorical Data Analysis. John Wiley & Sons,
Hoboken, NJ, 2nd edition. Arnold, J. E., Fagnano, M., & Tanenhaus, M. K. 2003. Disfluencies signal theee, um, new
information. Journal of Psycholinguistic Research 32: 25-36. Baayen, R. H. 2008. Analyzing Linguistic Data: A Practical Introduction to Statistics using R.
Cambridge University Press.
Baddeley, J. L., Pennebaker, J. W., & Beevers, C. G. 2013. Everyday social behavior during a
major depressive episode. Social Psychological and Personality Science, 4: 445-452. Bamman, D., Eisenstein, J., & Schnoebelen, T. 2014. Gender identity and lexical variation in
social media. Journal of Sociolinguistics, 18(2): 135-160.
Biber D., Grieve J., & Iberri-Shea H. 2010. Noun phrase modification. In Rohdenburg G. &
Schlüter J. (eds.) One Language, Two Grammars? Differences between British and
American English, 182-193. Cambridge University Press.
Bell L., Eklund R., & Gustafson J. 2000. A Comparison of Disfluency Distribution in a
Unimodal and a Multimodal Human–Machine Interface. Proceedings of the International
Conference on Spoken Language Processing (ICSLP) 2000, 16–20 October 2000, Beijing,
China, vol. 3, 626-629.
Bortfeld, H., Leon, S.D., Bloom, J.E., Schober, M.F., & Brennan, S.E. 2001. Disfluency rates
in conversation: Effects of age, relationship, topic, role, and gender. Language and
Speech, 44: 123-147. Bosker, H. R., Quené, H., Sanders, T. J. M., & de Jong, N. H. 2014. Native 'um's elicit
prediction of low-frequency referents, but non-native 'um's do not. Journal of Memory
and Language, 75: 104-116.
Braunmüller, Kurt 2011. Faroese Danish Corpus Hamburg (FADAC Hamburg),
http://corpora.exmaralda.org/sfb_k8.html. Brennan, S. E., & Schober, M. F. 2001. How listeners compensate for disfluencies in
spontaneous speech. Journal of Memory and Language, 44(2): 274-296.
Cieri, C., Graff, D., Kimball, O., Miller, D., & Walker, K. 2004. Fisher English Training
Speech Part 1 Transcripts LDC2004T19. Web Download. Philadelphia: Linguistic Data
Consortium. Cieri, C., Graff, D., Kimball, O., Miller, D., & Walker, K. 2005. Fisher English Training Part
2, Transcripts LDC2005T19. Web Download. Philadelphia: Linguistic Data Consortium.
Clark, H. H., & Fox Tree, J.E. 2002. Using uh and um in spontaneous speaking. Cognition,
84: 73-111.
Coleman, J., Baghai-Ravary, L., Pybus, J., & Grau, S. 2012 Audio BNC: the audio edition of
the Spoken British National Corpus. Phonetics Laboratory, University of
Oxford, http://www.phon.ox.ac.uk/AudioBNC. Corley, M., & Stewart, O. W. 2008. Hesitation disfluencies in spontaneous speech: The
meaning of um. Language and Linguistics Compass, 2(4): 589-602. CGN. 2006. Corpus Gesproken Nederlands. Version 2.0.
http://tst-centrale.org/nl/producten/corpora/corpus-gesproken-nederlands/6-17 Crystal, D. 1982. Profiling linguistic disability. London: Arnold.
De Leeuw, E. 2007. Hesitation markers in English, German, and Dutch. Journal of Germanic
Linguistics 19(2): 85-114. Deppermann, A., & Schmidt, T. 2014. Gesprächsdatenbanken als methodisches Instrument
der Interaktionalen Linguistik - Eine exemplarische Untersuchung auf Basis des Korpus
FOLK in der Datenbank für Gesprochenes Deutsch (DGD2). In: Domke, Christine &
Gansel, Christa (eds.) Korpora in der Linguistik - Perspektiven und Positionen zu Daten
und Datenerhebung [=Mitteilungen des Deutschen Germanistenverbandes 1/2014], 4-17.
Erard, M. 2007. Um...Slips, Stumbles, and Verbal Blunders, and What They Mean. New York:
Pantheon Books.
Fellows, M. D. 2009. An exploration of emotion language use by preschool-aged children and
their parents: Naturalistic and lab settings. PhD thesis. The University of Texas at Austin Finlayson, I. R., & Corley, M. 2012. Disfluency in dialogue: an intentional signal from the
speaker? Psychonomic bulletin & review 19(5): 921-928.
Fox Tree, J. E. 2001. Listener’s uses of um and uh in speech comprehension. Memory and
Cognition 29: 320-326. Fraundorf, S.H., & Watson, D.G. 2011. The disfluent discourse: Effects of filled pauses on
recall. Journal of Memory and Language 65(2): 161-175. Godfrey, J., & Holliman, E. 1993. Switchboard-1 Release 2 LDC97S62. DVD. Philadelphia:
Linguistic Data Consortium. Goldman-Eisler, F. 1968. Psycholinguistics: Experiments in spontaneous speech. London:
Academic Press. HCRC Map Task Corpus 1993. LDC93S12. Web Download. Philadelphia: Linguistic Data
Consortium.
Hilgers F. J., van Dam F. S., Keyzers S., Koster M. N., van As C. J., & Muller M. J. 2000.
Rehabilitation of olfaction after laryngectomy by means of a nasal airflow-inducing
maneuver: the polite yawning technique. Archives of Otolaryngology–Head & Neck
Surgery 126(6): 726-732.
Hyman, L. M. 2008. Universals in phonology. The linguistic review 25(1-2): 83-137.
Johannessen, J.B., Priestley, J., Hagen, K., Åfarli, T. A., & Vangsnes, Ø. A. 2009. The Nordic
Dialect Corpus - an advanced research tool. In: Jokinen, Kristiina and Eckhard Bick
(eds.) Proceedings of the 17th Nordic Conference of Computational Linguistics
NODALIDA. NEALT Proceedings Series Volume 4, 73-80. Jurafsky, D., Ranganath, R., & McFarland, D. 2009. Extracting social meaning: Identifying
interactional style in spoken conversation. In: Proceedings of Human Language
Technologies: The 2009 Annual Conference of the North American Chapter of the
Association for Computational Linguistics. ACL, 638–646.
Labov, W. 1990. The intersection of sex and social class in the course of linguistic change.
Language Variation and Change 2(2): 205-254. Labov, W. 1994. Principles of Language Change. Volume 1: Internal Factors. Oxford:
Wiley-Blackwell. Labov, W. 2001. Principles of Language Change. Volume 2: Social Factors. Oxford: Wiley-
Blackwell. Labov, W., Rosenfelder, I., & Fruehwald, J. 2013. One hundred years of sound change in
Philadelphia: Linear incrementation, reversal, and reanalysis. Language 89(1): 30-65. Laserna, C. M., Seih, Y. T., & Pennebaker, J. W. 2014. Um... who like says you know. Filler
word use as a function of age, gender, and personality. Journal of Language and Social
Psychology 33(3): 328-338.
Levelt, W. J. M. 1983. Monitoring and self-repair in speech. Cognition 14: 41-104. Levelt, W. J. M., & Cutler, A. 1983. Prosodic marking in speech repair. Journal of Semantics
2: 205-217. Liberman, M.(2005. Young men talk like old women.
http://itre.cis.upenn.edu/~myl/languagelog/archives/002629.html (first accessed 6
November 2005). Maclay, H., & Osgood, C. E. 1959. Hesitation phenomena in spontaneous English speech.
Word 15: 19-44.
Mehl, M. R., Gosling, S. D., & Pennebaker, J. W. 2006. Personality in its natural habitat:
Manifestations and implicit folk theories of personality in daily life. Journal of
Personality and Social Psychology 90: 862-877. Mehl, M. R., & Pennebaker, J. W. 2003a. The social dynamics of a cultural upheaval: Social
interactions surrounding September 11, 2001. Psychological Science 14: 579-585. Mehl, M. R., & Pennebaker, J. W. 2003b. The sounds of social life: A psychometric analysis
of students’ daily social environments and natural conversations. Journal of Personality
and Social Psychology 84: 857-870.
O’Connell, D. C., & Kowal, S. 2005. Uh and um revisited: Are they interjections for signaling
delay? Journal of Psycholinguistic Research 34: 555-576. Rayson, P., Leech, G., & Hodges, M. 1997. Social differentiation in the use of English
vocabulary: Some analyses of the conversational component of the British National
Corpus. International Journal of Corpus Linguistics 2: 133-152.
Rendle-Short, J. 2004. Showing structure: Using um in the academic seminar.
Pragmatics, 14(4), 479-498. Rochester, S. R. 1973. The significance of pauses in spontaneous speech. Journal of
Psycholinguistic Research 2(1): 51-81. Schachter, S., Christenfeld, N., Ravina, B., & Bilous, F. 1991. Speech disfluency and the
structure of knowledge. Journal of Personality and Social Psychology 60(3): 362-367. Shriberg, E. E. (1994). Preliminaries to a theory of speech disfluencies. PhD dissertation,
University of California, Berkeley. Swerts, M. 1998. Filled pauses as markers of discourse structure. Journal of pragmatics
30(4): 485-496.
Tagliamonte, S. A., & Denis, D. 2008. Linguistic Ruin? Lol! Instant Messaging and Teen
Language. American Speech 83(1): 3-34. doi:10.1215/00031283-2008-001
Tottie, G. 2011. Uh and um as sociolinguistic markers in British English. International
Journal of Corpus Linguistics 16(2): 173-197. Tottie, G. 2014. On the use of uh and um in American English. Functions of Language 21(1):
6-29.
Van Ness, S. 1995. Ohio Amish women in the vanguard of a language change: Pennsylvania
German in Ohio. American speech 70(1): 69-80.
Yuan J., Xiaoying X., Lai W., & Liberman M. Submitted. Pauses and pause fillers in
Mandarin monologue speech: The effects of sex and proficiency.
Wieling, M., Montemagni, S., Nerbonne, J., & Baayen, R. H. 2014. Lexical differences
between Tuscan dialects and standard Italian: Accounting for geographic and socio-
demographic variation using generalized additive mixed modeling. Language 90(3): 669-
692.