epubs.surrey.ac.ukepubs.surrey.ac.uk/844682/1/ethesis 2017_kirsty sheph…  · web viewpredictors...

339
1 Predictors of response to Systems Training for Emotional Predictability and Problem Solving (STEPPS) for the treatment of Borderline Personality Disorder Kirsty Shepherd Submitted for the Degree of Doctor of Psychology (Clinical Psychology)

Upload: vuongdien

Post on 05-Mar-2018

216 views

Category:

Documents


1 download

TRANSCRIPT

1

Predictors of response to Systems Training for Emotional Predictability and Problem Solving (STEPPS) for the treatment

of Borderline Personality Disorder

Kirsty Shepherd

Submitted for the Degree of

Doctor of Psychology(Clinical Psychology)

School of PsychologyFaculty of Health and Medical Sciences

University of SurreyGuildford, SurreyUnited KingdomSeptember 2017

2

Statement of Originality

This thesis and the work to which it refers are the results of my own efforts. Any ideas, data, images, or text resulting from the work of others (whether published or unpublished) are fully identified as such within the work and attributed to their originator in the text. This thesis has not been submitted in whole or in part for any other academic degree or professional qualification.

Name: Kirsty Shepherd

3

Overview

Borderline Personality Disorder (BPD) is a heterogeneous diagnosis which encapsulates a

wide range of symptom presentations. There are now a number of effective treatments for

BPD but individuals vary in their response to these different treatments. Understanding

people’s suitability to treatment would be beneficial in targeting and individualising BPD

treatment efforts. This thesis aimed to explore the BPD related predictors of treatment

response. Part one of this portfolio presents a narrative review of predictors of BPD

treatment outcomes. BPD severity, symptoms and subgroups predictors were included in this

review. Findings support previous reports that people with more severe BPD show larger

improvements following treatment, although there was a tendency for this to occur in more

skills-based treatments compared with reflective treatments. The findings on symptom level

predictors were varied and would benefit from replication. Subgroup research is still in its

early stages but findings suggest an important role of internal and external coping styles. Part

two presents an empirical paper investigating predictors of response to Systems Training for

Emotional Predictability and Problem Solving (STEPPS). Severity of BPD was examined as

a predictor of outcomes alongside each of the 9 DSM symptom criteria for BPD. Findings

are discussed in relation to the different methods of managing missing data. Severity was

found to be predictive of response. When severity was broken down into its symptom level

sub-parts, identity disturbance and dissociation were the most reliable predictors of response

to STEPPS.

A summary of my clinical experience during training and a list of assignments completed are

also documented.

4

Table of ContentsContents

Acknowledgements………………………………………………… 5

Research Part 1: Literature Review………………………………….6

Appendix to the Literature Review………………….………………102

Research Part 2: Empirical Paper……………………………………103

Appendices to the Empirical Paper………………………………….163

Part 3: Summary of Clinical Experience…………………………… 229

Part 4: Table of Assessments Completed During Training………… 231

5

Acknowledgements

I would like to thank my fellow trainee psychologists involved in the development and data

collection on the wider STEPPS research project, who have helped me in so many ways. The

advice and guidance from both my internal and external supervisor have been invaluable

throughout this entire project. I also want to thank all the people who participated in the

research and took time to share their thoughts and experiences of their time in STEPPS,

along with the facilitators and STEPPS coordinators who agreed to take part in the research

and made the project possible. I would like to thank Dr Mary Zanarini for allowing me to use

her self-report version of the Zanarini Rating Scale of Borderline Personality Disorder. My

cohort, members of my PPD group and course staff have always been there for advice and

encouragement and I will be ever grateful. Finally, I would like to thank my husband, who

has been patient, understanding and supportive over the past three years and who never

doubted my abilities even when I did.

6

Research Part 1

Literature Review

A systematic review evaluating severity, symptoms and subgroups of

borderline personality disorder as predictors of psychological therapy

outcomes.

Word Count: 7,980

7

Abstract

The diagnosis of Borderline Personality Disorder (BPD) includes a heterogeneous

array of clinical presentations, varying on demographics, comorbidities, symptom

profiles and severity of symptoms. Individuals with BPD show similar heterogeneity

when it comes to treatment response. Research into predictors of treatment response

has examined how these individual differences relate to treatment outcomes.

The aim of this review was to evaluate the current research into predictors of

outcome following psychological treatment, specifically looking at predictors related

to the BPD diagnosis and symptoms. A literature search was conducted using

EBSCO and WebofScience which identified 22 papers. The predictors in these

papers could be categorised under three headings: severity of BPD, symptoms of

BPD and subgroups of BPD. Results confirm that those with more severe BPD show

greater improvements to treatment. This pattern was more consistently observed in

skills based and CBT groups compared to more reflective therapies. Each DSM-IV-

TR symptom criteria for BPD was examined separately. Impulsivity and suicidal

tendencies failed to predict treatment outcomes. There was little consistency between

the other symptoms ability to predict treatment outcomes. By examining subgroups

as predictors of treatment outcome there emerged some preliminary evidence for the

importance of internalising and externalising styles in subgroups of BPD.

Limitations included varied treatment approaches, length of treatment, measures used

and treatment setting which makes direct comparisons difficult. Difficulties with the

exploratory nature and frequent post hoc analysis are also discussed. In terms of

future research there was a general need for replication studies in this area of

research.

8

Introduction

Borderline Personality Disorder (BPD) is characterised by dysregulation of emotion

and unstable interpersonal patterns along with impulsive and self-destructive

behaviours. Despite being relatively uncommon (estimates of 1.16%; Stone, 2006)

there is an incongruently high body of research into BPD which likely represents the

significant challenges associated with its treatment and high prevalence in mental

health settings (Widiger & Weissman, 1991). Complexity of treating BPD is

confounded by high rates of attempted (60-70%) and completed suicide (8-10%)

(Oldman, 2006). The substantial rates of comorbidity in people with BPD also

contribute to its complexity, with multiple diagnoses being the rule rather than the

exception (Zimmerman & Mattia, 1999).

BPD is a heterogeneous condition encompassing a variety of presentations (APA,

2001). The Diagnostic and Statistical Manual (DSM-IV-TR; APA, 2000) outlines

nine criteria for BPD, to reach diagnosis a person is required to meet five or more of

these criteria, allowing for over 150 different symptom profiles (Sanislow et al.,

2002). The World Health Organization's (WHO) International Classification of

Diseases (ICD-10) takes a different approach and refers to “Emotionally Unstable

Personality Disorder”. This review will use the DSM-IV-TR criteria as this is used

by the majority of the research reviewed. Diagnosis of BPD in DSM-5 (APA, 2013)

has not radically changed meaning findings from these papers can largely be applied

to DSM-5.

While some have argued that the heterogeneity of symptomatology in people with

BPD makes the diagnosis invalid (Tyrer, 1999) others have sought to understand the

origin of this diversity. Many indices and characteristics, including severity

9

dimensions (Asnaani, Chelminski, Young, & Zimmerman, 2007), interpersonal

styles (Wright, Hallquist, Beeney, & Pilkonis, 2013), affective reactivity (Kopala-

Sibley, Zuroff, Russell, Moskowitz, & Paris, 2012) and psychopathy (Newhill, Eack,

& Mulvey, 2009), have been examined with the aim of understanding the

heterogeneous presentation found within BPD. No sufficient explanation has been

identified for the assortment of presentations categorised under this single diagnosis.

Some researchers therefore suggest that the varied presentations in BPD could be

more accurately represented with subcategorises rather than a single diagnosis

(Oldham, 2006; Sanislow et al., 2002, Digre, Reece, Johnson, & Thomas, 2009;

Andión et al., 2011). These researchers are investigating the possibility that the

observed heterogeneity in BPD follows patterns and can be organised into more

reliable categorises than a unidimensional diagnosis.

Although once assumed untreatable, since the 1990’s many effective treatments have

emerged. Linehan (1993) published Dialectic Behavioural Therapy for BPD and

since then the number of therapies for BPD has expanded rapidly (Bateman &

Fonagy, 2004; Blum, Pfohl, John, Monahan, & Black, 2002; Clarkin, Yeomans, &

Kernberg, 2006; Kellogg & Young, 2004). Psychological therapy remains the front

line treatment for BPD (Stoffers et al., 2012).

Despite significant improvements in treatments available, response to treatment also

shows heterogeneity, for example Bohus et al. (2004) found that while 50% of

participants made clinically significant improvements following DBT, there were

50% who did not. It has been suggested 40% of people diagnosed with BPD who

request therapy may not benefit from it (Lana & Fernandez-San Martin, 2013).

10

Predicting who is likely to respond to the various treatment options would help

clinicians tailor and target treatments.

Research has examined how individual differences in people with BPD relate to

treatment outcomes, however there are a large number of individual differences

found in the BPD population. Individuals naturally vary in demographics and the

number and type of comorbid diagnoses. Research into the predictive power of

demographic variables and comorbidities has been reviewed (Barnicott et al., 2012).

Thirty three studies were identified, almost all failed to find a significant association

between treatment outcomes and demographics or comorbidities. The same review

(Barnicott et al., 2012) evaluated papers examining severity of BPD as a predictor of

outcome. This variable was found to be related to outcomes with higher baseline

severity predicting larger improvements.

While severity appears to be associated with some of the variance in treatment

outcomes (Barnicott et al., 2012), we know that looking at BPD as a homogeneous

group is likely to mask some key differences within this group. A sufficient body of

research now exists which looks at the separate BPD symptoms and their association

with treatment outcomes (Yen, Johnson, Costello, & Simpson, 2009). Research into

differential treatment response from subgroups of BPD is also starting to emerge

(Nesci, 2009; Eurelings-Bontekoe, Peen, Noteboom, Alkema & Dekker, 2012;

Altieri, 2011; Digre et al., 2009).

Aims and Rationale

Severity of BPD has been identified as a useful variable in predicting treatment

outcome (Barnicott et al., 2012). This finding would benefit from replication. Five

11

additional papers on severity and outcomes which have emerged since 2012 and

warrant review. Research that breaks down BPD severity into separate symptoms has

reached a degree which allows for a review of preliminary findings. The research

into the subgroups of BPD may also provide useful means of breaking down the

severity variable to provide information about predictors of treatment response and

help identify important areas for future research. The aims of this review are:

To review research into severity of BPD as a predictor of outcome.

To review research examining symptoms of BPD, as defined by the DSM-IV-

TR, as predictors of outcome.

To review research examining differential treatment response of identified

subgroups of BPD.

Method

Paper Identification

A literature search was performed and all papers examining predictors of outcome

were included then later screened for the predictors of interest. Searches were

performed on EBSCO host which includes PsycINFO, PsycARTICLES, Psychology

and Behavioral Sciences Collection, PsycBOOKS and MEDLINE. Search terms

were also entered into WebofScience. Search terms in section 1 of table 1 were

combined with each of the sets of search terms in section 2 in turn. Papers were

searched with publication dates between 1990-2016 and only papers available in

English were included. Grey literature was searched by exploring online unpublished

studies and theses, along with a fingertip search of the identified papers’ references.

12

Effective therapies for BPD began to emerge in the 1990’s (Gunderson, 2009), so for

this reason research before this time would less usefully contribute to this review.

Table 1

Search term combinationsSection 1 “((Borderline*) OR (BPD) OR

(emotionally unstable personality) OR (emotional intensity disorder) OR (impulsive type))”

Section 2 “AND ((treatment OR therapy) AND (response OR predict*))”

“AND (precit*) AND (outcome*)”

“AND ((precit*) AND (treatment outcome OR response))”

“AND ((treatment OR therapy) AND (effect*))”

“AND ((treatment OR therapy) AND (modulators))”

The search process identified 747 articles, 171 duplicates were excluded leaving 576

articles for screening. The titles of all 576 papers were screened. 279 were excluded

for not being relevant to the topic under review on the basis of the title (for example

when the focused on a diagnosis other than BPD or articles reporting medication

trials); if the suitability of the paper could not be judge from the title then it was

included at this stage. 297 abstracts were then screened to identify papers on

predictors of outcome with treatment for BPD. This excluded 233 further articles.

The remaining 64 full texts were screened using the inclusion and exclusion criteria

outlined below along with two papers from searching the grey literature and an

additional 6 from reviewing the reference lists of useful papers. 49 were excluded for

13

not meeting the inclusion criteria (see figure 1 for a full breakdown). The final

number for review was 23, and quality was assessed using the Specialist Unit for

Review Evidence (SURE, 2013). This was selected as it provides an amalgamation

of the Critical Appraisals Skills Program (CASP, 2017) and NICE Public Health

Methods Manual (2012) that covered key areas of critique. A qualitative review of

quality was performed using the SURE and the resulting critique is summarised in

table 3.

14

Figure 1: Prisma diagram (Moher, Liberati, Tetzlaff, & Altman, 2009) of systematic

search process

Articles identified through electronic database search N= 747

Articles for title screening N= 576

Exclusion of duplicates N= 171

Articles excluded following title screen N= 279

Articles for abstract screening N= 297

Articles for full text screening N= 64

Articles excluded following abstract screen N= 233

Articles excluded following full text screen N= 49

Reasons for exclusion:Not predictor of outcome study N=10Unavailable in English N=2Predictor not targeted N=18Review study N=1Predictors of response to hospitalisation N=4Adolescent participants N=4Predictors of natural prognosis N=2Treatment not targeting BPD N=2Non-BPD participant group N=4Predictors of dropout N=1Study protocol N=1

Articles included in review N= 23

Grey literature search N= 2

Review of references of key papers N= 6

15

Inclusion

Studies reporting quantitative findings using Randomised Controlled Trials

(RCTs), Naturalistic and Observational studies. A significant proportion of

the research comes from uncontrolled studies. These will be accepted as, at

this early stage of research, associations are key contributors to understanding

predictors of outcome.

Studies using participants either with a diagnosis of BPD, subthreshold BPD

or clinician judged BPD. Formal diagnosis of BPD using DSM criteria will

not be required as studies using sub-threshold BPD often have higher

ecological validity and are more representative of the BPD sample seen in

clinical practice (Bos, van Wel, Appelo, & Verbraak, 2011).

As the variable most consistently found to be related to treatment outcomes

(severity) is specifically related to the degree and content of the BPD

diagnosis only studies including predictor variables which specifically relate

to BPD content were targeted. Target variables fell into one of three

categories:

o Severity, defined by either BPD severity or global severity of

symptoms and measured on a quantitative scale

o Variables relating to the BPD criteria as defined by DSM-IV-TR:

frantic efforts to avoid abandonment, unstable interpersonal

relationships, identity disturbance, impulsivity, suicidal ideation and

deliberate self harm (DSH), affective instability, emptiness, problems

with anger and dissociative symptoms.

16

o Subgroups of BPD. Any reference to subgroup will be used as there is

no current agreement on the subgroups of BPD.

Studies including psychological and talking therapies with a primary aim of

treating BPD. Treatments of varied durations will be included as will both

group and individual therapies. Targeting any one form, duration or type of

therapy at this stage would restrict the richness of data for review.

Studies examining outcomes of BPD or associated general psychopathology

Studies including participants over the age of 18

Exclusion

Studies with participants whose primary diagnosis is not BPD

Studies not examining predictors of outcome

Studies including predictor variables not falling into the three categories

outlined above (factors relating to BPD diagnosis), including demographics,

comorbidities and other characteristics related to BPD including childhood

trauma, executive functioning and mentalisation. Although these are

important variables, including the full array of BPD features was beyond the

scope of this review so symptom variables were restricted to those outlined

by the DSM-IV-TR.

Studies which do not examine psychological therapy including

hospitalisation.

Treatments targeting other primary diagnoses and pharmacological studies.

Studies examining prognosis of BPD as opposed to treatment specific

outcomes. Research examining the natural course of BPD will be excluded as

17

variables predicting treatment response will not necessarily be the same as

those predicting naturally occurring changes.

Studies not examining outcomes of BPD or associated general

psychopathology

Participants under the age of 18

Studies not available in English.

Results

Results are grouped and presented under three headings relating to BPD severity,

symptoms and subgroups depending on the type of predictor variable being studied.

This allows these different predictor variables to be summarised, compared and

critiqued together.

Twenty three studies are reviewed, all papers report quantitative results relating to

how BPD related variables predict treatment outcomes. Eleven RCTs (Arntz, Stupar-

Rutenfrans, Bloo, van Dyck & Spinhoven. 2015; Bateman & Fonagy, 2013; Bellino,

Bozzatello, & Bogetto, 2015; Black, Allen, John, Pfohl, McCormick & Blum., 2009;

Bos et al., 2011; Davidson, Tyrer, Norrie, Palmer, & Tyrer, 2010; Giesen-Bloo et al.,

2006; Gratz, Dixon-Gordon & Tull., 2014; Lenzeweger, Clarkin, Levy, Yeomans &

Kernberg, 2012; Spinhoven, Giesen-Bloo, van Dyck & Arntz, 2008; Verheul, Van

Den Bosch, Koeter, De Ridder, Stijnen & Brink, 2003), ten naturalistic observation

studies (Altieri, 2011; Black, Blum, McCormick & Allen, 2013; Digre et al., 2009;

Eurelings-Bontekoe et al., 2012; Hull, Clarkin & Kakuma, 1993; Kleindienst et al.,

2011; Nesci., 2009; Ryle & Golynkina, 2000; Sparapani, 2014; Yen et al., 2009) and

two controlled studies were identified (Bohus et al, 2004; Meares, Stevenson &

Comerford, 1999). Some studies examined both severity and symptom level

18

predictors (Bohus et al., 2004; Ryle & Golynkina, 2000; Bellino et al., 2015). The

severity and symptom level results are summarised separately under the different

subsections. Eleven studies examine severity predictors (Black et al., 2009; Black et

al., 2013; Bellino et al., 2015; Bohus et al, 2004; Bos et al., 2011; Bateman &

Fonagy, 2013; Giesen-Bloo et al., 2006; Gratz et al., 2014, Meares et al., 1999; Ryle

& Golynkina, 2000; Spinhoven et al., 2008) twelve examine symptom level

predictors (Arntz et al., 2015; Bellino et al., 2015; Bohus et al, 2004; Hull et al.,

1993; Davidson, Tyrer, Norrie, Palmer, & Tyrer, 2010; Gratz et al., 2014;

Kleindienst et al., 2011; Lenzeweger, et al., 2012; Ryle & Golynkina, 2000;

Sparapani, 2014; Verheul et al., 2003; Yen et al., 2009) and four BPD subgroups

(Altieri, 2011; Digre et al., 2009; Eurelings-Bontekoe et al., 2012; Nesci, 2009).

A variety of treatment approaches were studied including DBT (Altieri, 2011),

Cognitive Analytic Therapy (CAT) (Ryle & Golynkina, 2000), Systems Training for

Emotional Predictability and Problem Solving (STEPPS) (Black et al., 2009),

Mentalisation Based Therapy (MBT) (Bateman & Fonagy, 2013), Interpersonal

Psychotherapy (IPT) (Bellino et al., 2015), Psychodynamic therapies (Meares et al.,

1999) and Schema therapy (Giesen-Bloo et al., 2006). Participants were all adults

with mean ages within late twenties and early thirties. Six studies had only female

participants (Bohus et al., 2004; Gratz et al., 2014; Hull et al., 1993; Kleindienst et

al., 2011; Verheul et al., 2003; Yen et al., 2009), although females dominated across

all studies. Studies were conducted in a range of countries including the Netherlands

(Giesen-Bloo et al., 2006), Australia (Nesci, 2009), the U.K (Ryle & Golynkine,

2000), the U.S (Black et al. 2009), Germany (Bohus et al., 2004) and Italy (Bellino et

al., 2015) (see table 2 for additional details).

19

BPD Severity

The predictor variable most frequently studied was severity of BPD. Barnicot et al.

(2012) included a review of severity papers. Six are reviewed again. Since this time

the evidence has almost doubled, with five additional papers published which

warrant incorporation. Of eleven, seven found higher baseline severity predicted

greater improvements following treatment (Meares et al., 1999; Giesen-Bloo et al.,

2006; Black et al., 2009; Bos et al., 2011; Black et al., 2013; Gratz et al., 2014;

Bellino et al., 2015). One study found baseline severity to predict poorer outcomes

(Ryle & Golynkina, 2000) and three found severity had no predictive power over

outcome (Bohus et al., 2004; Spinhoven, Giesen-Bloo, van Dyck, & Arntz, 2008;

Bateman & Fonargy, 2013).

Three RCTs using dimensional measures of severity found that higher baseline

severity predicts larger improvements following treatment of BPD (Black et al.,

2009; Bos, van Wel, Appelo, & Verbraak, 2011; Giesen-Bloo et al., 2006) alongside

two uncontrolled trial (Black, Blum, McCormick, & Allen, 2013; Bellino et al.,

2015).

Two of the RCTs examined STEPPS treatment (Black et al., 2009; Bos et al., 2011).

Black et al. (2009) found higher baseline Zanarini Rating Scale for BPD (ZAN-BPD;

Zanarini, 2003) scores predicted significantly larger improvements on the ZAN-BPD

and higher baseline Borderline Evaluation of Severity over Time (BEST, Blum,

Pfohl, John, Monahan & Black, 2002) scores predicted larger improvements on the

BEST. In one of the higher quality studies, Bos et al. (2011) found baseline severity

was significantly related to increased improvements in scores on the Symptom

20

Checklist-90-R (SCL-90-R; Derogatis & Savitz, 2000) and BPD symptom severity

(BPD-40 score).

Regression to the mean (Barnett, van der Pols, & Dobson, 2005) could explain why

participants with higher baseline scores demonstrate greater reductions following

treatment. However the findings by Bos et al. (2011) cannot be explained by

regression to the mean as the authors found a differential effect of severity on

participants within the treatment and control conditions. Only one study (Spinhoven,

Giesen-Bloo, van Dyck, & Arntz, 2008), using a dimensional measure of severity

(final Borderline Personaity Disorder-Severity Index [BPD-SI] score; Arntz et al.,

2003), found severity to be unrelated to outcome. However when entered into a

regression model predicting reliable change severity contributed to the variance in

scores.

Ryle and Golynkina (2000) was the only study to find a negative association between

baseline severity and treatment outcome. This study divided participants into

improved and unimproved groups based on whether they still reached diagnostic

criteria for BPD at the end of CAT. BPD DSM Index Severity was significantly

higher in the unimproved group than the improved group with a medium effect size

of r=.31. Other studies defined severity as the number of BPD DSM criteria met

(Meares et al., 1999; Bohus et al., 2004; Bateman & Fonargy, 2013; Gratz et al.,

2014). Two did not find a predictive effect of the number of criteria on outcome

(Bohus et al., 2004; Bateman & Fonargy, 2013), one found an initial predictive effect

which did not hold up at follow-up (Gratz et al., 2014) and one found higher number

of DSM-III criteria at baseline predicted greater reductions on number of criteria at

the end of treatment (Meares et al., 1999).

21

Table 2

Methodological information for studies examining severity, symptoms and subgroup predictors of outcomeStudy Treatment Design and

analysisParticipants Measures Outcome Effect size

Altieri (2011). Borderline personality disorder and dialectical behaviour therapy in an Australian private hospital setting treatment response and BPD subtypes

Country of origin: Australia

Dialectical Behavioural Therapy (DBT) on a day patient basis in a private hospital for 1 year

Uncontrolled Naturalistic Study

Groups compared using t-tests

27 participants (23 used for analysis)

All met DSM-IV-R (APA, 2000) criteria for BPD confirmed using the Personality Diagnostic Questionnaire version 4 (PDQ-4) (Hyler, 1994)

85% femaleAge: M: 38 (SD: 11.00)

Subgroups:

Cluster analysis revealed two subgroups of BPD:

Cluster 1: dysregulated/more functional self

Cluster 2 dysregulated/dysfunctional self.

Dependent variables: Young Schema

Questionnaire (Young,1998)

Million Clinical Multi-axial Inventory-III (MCMI) (Million, 1994)

Depression Anxiety and Stress Scale (DASS) (Lovibond & Lovibond, 1995).

Trauma Symptom Inventory (TSI) (Briere, 1995)

Subgroups:

Cluster 2 made significant improvements on borderline psychopathology (p=.005), anger (STAXI, p=0.019) anxiety (MCMI, p=.014) and depression (MCMI, p<.001). Cluster 1 did not make significant improvements on these measures.

Subgroups:

Cluster 2 significant changes:Borderline psychopathology ηp2= .32STAXI ηp2=.23MCMI anxiety ηp2= 0.26MCMI depressionηp2= .55

22

State Trait Anger Expression Inventory (STAXI) (Spielberger, 2003)

Arntz, Stupar-Rutenfrans, Bloo, van Dyck & Spinhoven. (2015). Prediction of treatment discontinuation and recovery from borderline personality disorder: results from an RCT comparing schema therapy and transference focused psychotherapy.

Country of origin:The Netherlands

Schema Focused Therapy (SFT) and Transference Focused Therapy (TFP) for 3 years

Randomised Control Trial (RCT) using multivariate backwards logistic regression to examine predicts of recovery as defined by the Borderline Personality Disorder-Severity Index (BPD-SI) (Giesen-Bloo, Wachters, Schouten & Arntz, 2010) < 15.

86 participants in Holland with a primary diagnosis of BPD confirmed using the Structured Clinical Interview for DSM-IV Axis II Personality Disorders (SCID) (First, Gibbon, Spitzer, Williams, & Benjamin, 1996)

80 females, 6 malesAge: M: 30.6 (SD: 7.8)Average number of previous

Symptoms:

Burden of dissociation via the Borderline Personality Disorder-checklist (BPD-checklist) (Giesen-Bloo, Arntz, & Schouten, 2010)

Identity diffusion measured by the Inventory of Personality Organisation (Clarkin, Foelsch, & Kernberg, 2001)

‘Interpersonal relationships’ item on the BPD-SI, rated on 10 point likert scale.

‘Impulsivity’ item on the BPD-SI, rated on 10 point likert scale.

‘Suicidality’ item on the BPD-SI, rated on 10 point likert scale.

Symptoms:

Subjective burden of dissociation predicted a smaller chance of recovery (p=.009)

Identity diffusion did not predict recovery (BPD-SI<15)(p=.737)

Ratings on interpersonal relationships did not predict recovery (BPD-SI<15) (p=.286)

Impulsivity did not predict recovery (BPD-SI<15) (p=.889)

Symptoms:

Β=-.230

N/A

N/A

N/A

N/A

23

treatments: 3

Exclusion criteria: Psychotic disorders, bipolar, dissociative identity disorder, antisocial personality disorder, ADHD, addiction requiring detoxification and learning disability.

‘Anger problems’ item on the BPD-SI, rated on 10 point likert scale.

Suicidality did not predict recovery (BPD-SI<15)

Anger did not predict recovery (BPD-SI<15)

N/A

Bateman & Fonagy (2013). Impact of clinical severity on outcomes of mentalisation-based treatment for borderline personality disorder.

Country of

Mentalisation-Based Treatment (MBT)

Control condition: structured clinical management

Treatment for

RCT, secondary analysis of data.

Mixed effects regression models and linear regression

134 participants, referred following a recent crisis incident

MBT N= 71Control N= 63

BPD diagnosis confirmed

Severity:

Primary outcome measure was absence of crisis episode (suicide, self-harm or hospital admission

Global Assessment of Functioning (GAF) (APA, 2000)

Beck Depression Inventory (BDI)

Severity:

None of the severity criteria measured predicted outcome in a regression model as a main effect or interaction.

However those

Severity:

Non-significant effect

r= .277* for GSI above the mean

24

origin:The U.K

1.5 year using the SCID

Exclusion criteria: antisocial personality and substance misuse.

(Beck, Ward, Mendelson, Mock, & Erbaugh, 1961)

Social Adjustment Scale Self Report (SAS) (Weissman & Bothwell, 1976)

Inventory of Interpersonal Problems (IIP) (Horowitz, Rosenberg, Baer, Ureno, & Villasenor, 1988)

Symptom Checklist-90- Revised’s (SCL-90-R) Global Severity Index (GSI) for severity of symptom distress (Derogatis & Savitz, 2000)

No. of BPD criteria met

who had above the median scores on the GSI were more likely to recover in MBT than the control group

Bellino, Bozzatello & Bogetto. (2015). Combined treatment of borderline personality disorder with interpersonal

Interpersonal Therapy (IPT) + fluoxetine

34 sessions of outpatient IPT were provided

Post-hoc analysis of data from treatment condition of an RCT.

Linear regressions for continuous variables

27 participants who had previously been allocated to an IPT+fluoxetine condition (Bellino et al, 2010).

Severity:

Dependent variable: Change in Clinical

Global Impression scale (CGI) (Guy, 1976)

Semi-structured interview

Severity:

BPDSI total score was significantly related to change in CGI-S (p=.001, coefficient= .618

Severity:

Coefficient= .222

25

psychotherapy and pharmacotherapy: Predictors of response.

Country of origin:Italy

Significant variables were entered into a multiple regression analysis.

BPD diagnosed using DSM-IV-TR (APA, 2000) criteria and confirmed using the SCID.

Exclusion criteria: lifetime diagnosis of delirium, dementia, amnestic disorder, other cognitive disorders, schizophrenia or other psychotic disorders, bipolar disorder, comorbid axis I or II disorder.

demographic and clinical variables

CGI-S- severity of illnesses subscale,

Hamilton Depression Rating Scale (HDRS) (Hamilton, 1960)

HARS (Hamilton, 1959)

Social and Occupational Functioning Assessment Scale (SOFAS) (Goldman, Skodol, & Lave,1992)

Satisfaction Profile (SAT-P) (Majani & Callegari, 1998)

BPD-SI

Symptoms:

‘Abandonment’ item on the BPD-SI, rated on 10 point

). This remained significant when entered into the multiple regression analysis (p=.028, coefficient= .222)

Higher severity at baseline predicted greater changes following treatment.

Symptoms:

Those with higher ratings on abandonment

Symptoms:

Coefficient=.452 for

26

likert scale

‘Interpersonal relationships’ item on the BPD-SI, rated on 10 point likert scale.

‘Identity’ item on the BPD-SI, rated on 10 point likert scale.

‘Impulsivity’ item on the BPD-SI, rated on 10 point likert scale.

showed greater improvements on CGI score (p=.001)

Ratings on interpersonal relationships did not predict improvements on CGI score (p=.548).

Higher ratings on identity at baseline predicted greater improvements on CGI score (p=.003).

Ratings on impulsivity did not predict improvements on CGI score (p=.114).

contribution of fear of abandonment in 4 item model predicting change in CGI

N/A

Multiple Regression Coefficient=.307 for contribution of identity in 4 item model predicting change in CGI

N/A

27

‘Parasuicidal behaviour’ item on the BPD-SI, rated on 10 point likert scale.

‘Affective instability’ item on the BPD-SI, rated on 10 point likert scale.

‘Emptiness’ item on the BPD-SI, rated on 10 point likert scale.

‘Outbursts of anger’ item on the BPD-SI, rated on 10 point likert scale.

‘Paranoid ideation’ item on the BPD-SI, rated on 10 point likert scale.

Parasuicidal beahviour was not predictive of change in CGI score (p=.304)

Higher affective instability scores predicted greater improvements on CGI score (p=.030)

Emptiness did not predict change in CGI score (p=.266)

Outbursts of anger did not predict change in CGI score

Paranoid ideation significant at the linear regression (p=.080 but was not predictive of change in CGI

N/A

Coefficient=.242 for contribution of affective instability in 4 item model predicting change in CGI

N/A

N/A

N/A

28

score at the multiple regression analysis.

Black, Allen, John, Pfohl, McCormick & Blum. (2009). Predictors of response to Systems Training for Emotional Predictability and Problem Solving (STEPPS) for borderline personality disorder: an exploratory study.

Country of origin:The U.S

System’s Training for Emotional Predictability and Problem Solving (STEPPS) treatment plus Treatment as usual (TAU)

TAU control group

Outpatient weekly group sessions for 20 weeks.

RCT

Multiple regression models and multiple logistic regression analyses with predictor by treatment group interactions

164 participants randomly allocated to treatment groups

STEPPS + TAU N= 92TAU N= 72

85% female

BPD diagnosed using DSM-IV-R criteria, confirmed using the SCID.

Exclusion criteria: diagnosis of schizophrenia, schizoaffective disorder,

Severity:

Baseline ratings from Zanarini Rating Scale

for borderline personality disorder (ZAN-BPD) (Zanarini, 2003)

Borderline Evaluation of Severity Over Time(BEST) (Blum, 2002)

SCL-90-R Social Adjustment

Scale (SAS) Clinical Global

Impression (CGL) severity scale

Barratt Impulsivity Scale (BIS) (Barratt, 1959).

Global Assessment Scale (GAS) (Endicott, Spitzer, Fleiss, & Cohen, 1976)

BDI

Severity:

Greater baseline ZAN-BPD was associated with improvements on the ZAN-BPD

Greater baseline BEST score (and being in the STEPPS group) predicted improvements in BEST total score.

Severity:

Odds ratio for ZAN-BPD as a baseline predictor= .80

29

psychotic mood disorder or primary neurological disorder, cognitive impairments, current substance abuse or dependency or previous participation in STEPPS.

Black, Blum, McCormick & Allen. (2013). Systems Training for Emotional Predictability and Problem Solving (STEPPS) group treatment for offenders with borderline personality disorder.

Country of origin:The U.S

STEPPS treatment

Weekly group sessions for 20 weeks.

Naturalistic Observation study with secondary analysis of data.

Linear mixed effects model

77 offender participants from the US

BPD diagnosis using DSM-IV-R criteria

14 male63 female

Age: M: 31.4 (SD: 8.6)

Severity:

BEST BDI Positive and Negative

Affect Schedule (PANAS) (Watson & Clark, 1994)

Data on additional clinical variables such as self-harm ad suicide attempts collected

Severity:

Higher baseline severity (BEST) predicted larger improvements following STEPPS on the BEST (t=-4.89, p<.001)

Severity:

r= 0.492*

30

Bohus et al. (2004) Effectiveness of inpatient dialectic behavioural therapy for borderline personality disorder: a controlled trial.

Country of origin:Germany

DBTN=31

3-month inpatient program including individual and group therapy.

Control group: waiting list controlN=19

No between group differences on measured demographics and comorbidities

Controlled Trial

Predictor variables were tested using discriminate analysis method and non-parametric hierarchical classification.

Jacobson’s criterion b for clinical relevant change (Jacobson & Truax, 1991) is used.

50 participants

BPD diagnosed using the SCID for DSM-IV Axis II

Participants were required to have at least one suicide attempt or two acts of self-harm within the last two years.

All participants female

Age: M: 29.1 (SD: 7.2) DBT group M: 29.5 (SD: 5.4) Control group

Exclusion

Severity:

No of DSM criteria met

Symptoms:

Predictor variables Lifetime Parsuicide

Count (LPC) (Linehan & Comtois, 1994)

SCL-90-R Hamilton Anxiety

Scale (HARS) State-Trait-Anxiety

Inventory (STAI) (Spielberger, Gorsuch, & Lushene, 1970)

Hamilton Depression Scale (HDRS)

State Trait-Anger Inventory (STAXI)

Dissociations Experiences Scale (DES) (Bernstein & Putnam, 1986)

Global Assessment of

Severity:

No effect was found for the number of DSM criteria met on any outcome measure.

Symptoms:

In the DBT group the initial IIP score was positively correlated with improvements in IIP

Initial scores on STAXI was positively correlated with improvements on the STAXI in both DBT and waitlist control group

Initial scores on the DES were

Severity:

no effect found

Effect sizes not reported and insufficient data provided to calculate.

31

criteria: lifetime diagnosis of schizophrenia, bipolar I disorder, current substance abuse or learning disability. Living over 250 miles from inpatient unit

Functioning Scale (GAF)

IIP

positively correlated with improvements on the DES for both the DBT group and the waitlist control group

Bos, van Wel, Appelo & Verbraak. (2011). Effectiveness of Systems Training for Emotional Predictability and Problem Solving (STEPPS) for borderline personality problems in a ‘real-world’ sample: moderation by diagnosis or

STEPPS treatment plus adjunctive

Outpatient weekly group sessions for 20 weeks.

TAU control group

RCT

Linear mixed model interactions

168 participants

STEPPS N=84Age: M: 33.5 (SD: 8.2)88.1% female

TAU N= 84Age: M: 31.7 (SD: 9.7)85.7% female

Subthreshold participants included

Severity:

Personality Diagnostic Questionnaire (PDQ)

SCID SCL-90 Borderline Personality

Disorder checklist-40 (BPD-Checklist -0)

World Health Organisation’s Quality of Life Assessment (WHOQLA) (Trompenaars,

Severity:

The effectiveness of STEPPS vs TAU did not differ on the basis of DSM-IV diagnosis of BPD.

However there was a differential effect for the STEPPS treatment on the basis of severity

Severity:

d= .445* for SCL-90 outcome

d= .405* for BPD-40 outcome

32

severity?

Country of origin:The Netherlands

Exclusion criteria: did not speak Dutch, cognitively impaired, less than 18 or presented with immediate danger to self or others.

Masthoff , Van Heck, Hodiamont , & De Vries, 2005).

for SCL-90 (F=8.3, p= 0=.005) and BPD-40 (F=6.9, p= .010). Those with more severe symptoms showed a better response to STEPPS than TAU.

Davidson, Tyrer, Norrie, Palmer, & Tyrer. (2010). Cognitive therapy v. usual treatment for borderline personality disorder: prospective 6-year follow-up.

Country of origin:The U.K

Cognitive Behavioural Therapy-Personality Disorder (CBT-PD) 30 individual sessions

TUA control

RCT

Using logistic regression

Dependent variable:Participants classified as having good or poor outcomes. Poor outcomes were classified as any suicide attempts in follow-up period

76 participants from possible 106 from the initial study.

M: 32

84% female

Symptoms:

Acts of Deliberate Self-Harm Inventory (structured interview) (Davidson, 2007)

Symptoms:

DSH did not predict good or poor outcomes (p= .14)

Symptoms:

Odds Ratio: 3.42

Digre, Reece, Johnson & Thomas. (2009).

SPECTRUM- 6 month residential

Uncontrolled Naturalistic Study

77 participants

BPD diagnosis

Subgroups:

Cluster analysis identified

Subgroups:

No significant

Subgroups:

Withdrawn-

33

Treatment response in subtypes of borderline personality disorder.

Country of origin:Australia

treatment involving daily group therapy and individual therapy three times a week

Eclectic program using concepts from DBT and psychodynamic therapy

Mixed analysis of variance and paired sample t-tests

confirmed using the SCID

Female N: 74Male N: 3Age: M: 34 (SD: 8.73)

Participants were required to have a history of unsuccessful treatment and high risk status for admission

Exclusion criteria: limited English, acute mental illness, unable to manage own medication or accessing disability services.

three subgroups of BPD: Withdrawn

internalising Severely disturbed-

internalising Anxious externalising

Dependent variables: Suicide attempts;

Parasuicide History Inventory (PHI) (Linehan, Heard, & Wagner, 1989)

Self harm (PHI) BDI DES

interactions were found on any of the outcome measures.

For withdrawn –internalisers a significant change in DES score was observed (t(11)= 3.38, p=.006)

For severely-disturbed internalisers there was no significant improvements on all outcome measures

For anxious-externalisers a significant change was revealed for BDI score (t(25)= 6.82, p<.001)

internalisers change in DES d=.98

Anxious-externalisers change in BDI d=1.34

Eurelings- Supportive Naturalistic 2,062 Subgroups: Subgroups: Subgroups:

34

Bontekoe, Peen, Noteboom, Alkema & Dekker. (2012). Differential treatment response of subtypes of patients with borderline personality organisation, as associated with theory-driven profiles of the Dutch short form if the MMPI: A naturalistic follow-up study

Country of origin:The Netherlands

treatment, psychotherapy and family therapy. This varied between participants

Follow-Up Study

Repeated measures MANCOVA and using reliable change index

participants

Diagnosed with a borderline personality profile

56.1% femaleAge: M: 38.14 (SD: 10.11)

Exclusion criteria not reported

Five Borderline Personality Organisations (BPO) were identified using the MMPI:

Immature BPO, Narcissistic BPO, High-level/

Overcontrolled BPO, Low-level BPO Psychotic BPO

Dependent variables: Brief Symptom

Inventory (BSI) (Derogatis & Spencer, 1982)

Outcome Questionnaire (OQ-45) (De Jong et al., 2007)

CGI

There was a significant (p<.01) interaction between time and borderline profile.

The High-level BPO group benefited most from treatmentThe Psychotic BPO group did not demonstrate any significant change as a result of treatment

Interaction effect size: ηp2 =0.05

Effect size of high-level BPO score changes: d=0.77Effect size of psychotic BPO score change:d=0.16

Giesen-Bloo, van Dyck, Spinhoven, van Tilburg, Dirksen, van Asselt, Kremers, Nadort & Arntz (2006). Outpatient Psychotherapy for Borderline

SFT

TFP

Both treatments involved 50 minutes sessions twice

RCT

Cox regression survival analysis (relevant to this review).

86 participants from the Netherlands

BPD diagnosed with BPD-SI >20 and confirmed

Severity:

BPD-SI<15 as recover criterion

Reliable change= improvements of 11.70

WHOQLA BPD Checklist

Severity:

Baseline BPDSI-IV had a significant effect (Waldstatistic=15.01; P_.001; 95% CI, 1.03-1.10) on

Severity:

Risk Ratio=1.07

35

Personality Disorder.

Country of origin:The Netherlands

a week for 3 years

Outpatient treatment

using the SCID

Exclusion criteria: psychotic disorders, bipolar disorder, personality disorder, ADHD, addiction requiring detoxification and learning disability.

SFT N=44Age: M: 31.70 (SD: 8.9)90.9% female

TFP N= 42Age: M: 29.45 (SD: 6.5)95.2% female

SCL-90 Rosenberg Self-

Esteem Scale (Rosenberg, 1965)

YSQ Personality Disorder

Belief Questionnaire (BPD subscale) Arntz, Dreessen, Schouten, & Weertman, 2004)

IPO Defence Style

Questionnaire (DSQ) (Andrews, Pollock, & Stewart, 1989)

BPDSI-IV–based reliable changecriterion

For recovery criterion baseline BPDSI-IV was not significant(p=.38).

Gratz, Dixon-Gordon & Tull. (2014). Predictors of treatment

Emotion Regulation Group Therapy (ERGT) +TAU

RCT, secondary analysis of data.

Multilevel

61 participants

Age: M: 32.5 (SD: 10.9)

Severity:

SCID Diagnostic Interview

Severity:

Higher baseline emotion

Severity:

Not reported Insufficient data

36

response to an adjunctive emotion regulation group therapy for deliberate self-harm among women with borderline personality disorder.

Country of origin:The U.S

90 minute group outpatient sessions for 14 weeks

Control condition: TAU waiting list

modelling approach (piecewise linear model)

ERGT N=31TAU N=30

Included subthreshold BPD population

Participants were required to have at least one episode of DSH in the past six months

Exclusion criteria: diagnosis of psychotic disorder, bipolar I disorder and current substance dependence.

for DSM-IV Personality Disorders (DIPD-IV) (Zanarini, Frankenburg, Sickel, & Young, 1996)

LPC Deliberate Self-Harm

Inventory (DSHI) (Gratz, 2001)

Treatment History Interview (THI; Linehan & Heard, 1987)

Self-Harm Inventory (SHI) (Sansone, Wiederman, & Sansone, 1998)

Difficulties in Emotion Regulation Scale (DERS) (Gratz & Roemer, 2004)

Acceptance and Action Questionnaire (AAQ) (Hayes et al., 2004)

ZAN-BPD BEST

Symptoms:

LPC assessed lifetime suicide behavioursDSHI assessed lifetime

dysregulation and more BPD criteria met (DIPD-IV) predicted greater improvements in BPD symptoms.

Symptoms:

Higher baseline DSH was

reported to calculate

Not reported Insufficient data

37

deliberate self harm.

Emotion regulation measured by DERS

associated with poorer treatment response on the BEST (but not the ZAN-BPD) (p<.05)

Higher levels of baseline emotion dyregulation predicted greater improvements in BPD symptoms (ZAN-BPD) (p<.05)

reported to calculate

Not reported Insufficient data reported to calculate

Hull, Clarkin & Kakuma. (1993). Treatment response of borderline inpatients: A growth curve analysis.

Country of origin:The U.S

Psychodynamic psychotherapy 3 times a week as part of hospitalisation

25 weeks

Uncontrolled Naturalistic Study using growth curve analysis and regression to examine how their 3 factors of BPD predict treatment response.

Dependent variable: SCL-90-R

40 female participants in America with BPD (DSM-III-R, APA 1987) confirmed using the SCID

Age= M: 24.7

No exclusion criteria reported, inclusion

Symptoms:

Impulsivity on SCID, rated on 6 point likert scale

Symptoms:

Severity of impulsivity did not predict course of treatment on SCL-90-R.

Symptoms:

N/A

38

criteria appears to be admission to the hospital ward for BPD.

Kleindienst et al. (2011). Dissociation predicts poor response to dialectic behavioral therapy in female patients with borderline personality disorder.

Country of origin:Germany

DBT, 3 month inpatient treatment.

Naturalistic Observation Study using Linear regression analyses

Dependent variable:pre to post differences on the Global Severity Index (GSI) of the SCL-90-R

DESIIP

57, female participants in Germany with at least 2 nonsuicidal self-injurious acts of one suicide attempt within the last 2 years.

BPD diagnosed, confirmed via the SCID

Exclusion criteria: diagnosis of schizophrenia, bipolar, learning disability or current substance

DES

IIP

Higher baseline DES negatively predicted improvements on the GSI (p=.008)

IIP scores at baseline were unrelated to the changes in GSI scores

Β=-0.017

N/A

39

abuse and not living within a 250 mile radius of treatment centre

Lenzeweger, Clarkin, Levy, Yeomans & Kernberg. (2012). Predicting domains and rates of change in borderline personality disorder.

Country of origin:The U.S

TFP, DBT and Supportive Psychotherapy

1 year

Post-hoc analysis of data from RCT to examine domains and rates of change in BPD.

Individual growth curve was calculated, these slope values acted as the dependent variable. Forward stepping multiple linear regression examined predictors of outcome

58 participants who met DSM criteria for BPD confirmed using the International Personality Disorder Examination.

54 femaleAge: M: 30.5 (SD: 6.92)

Symptoms:

Negative Affect measured by the Multidimensional Personality Questionnaire (MPO) (Tellegen, 1982)

Identity diffusion measured by the IPO

Symptoms:

Lower initial levels of negative affect were associated with more rapid clinical improvements in the anger/aggression change domain (p=.014)

Higher baseline identity diffusion was associated with more rapid rates of clinical improvements in the global functioning/social adjustment domain of change (p=.006)

Symptoms:

r=0.30

r= -0.36

40

Aggression measured via the MPQ

Lower initial levels of aggression were associated with more rapid clinical improvements in the anger/aggression change domain (p=.035)

r=.25

Meares, Stevenson & Comerford, (1999). Psychotherapy with borderline patients: A comparison between treated and untreated cohorts.

Country of origin:Australia

Interpersonal psychodynamic psychotherapy (IP)

Outpatient Individual therapy for one hour, twice a week over 12 months.

Control: waiting list with treatment as usual.

Controlled Trial

Multiple regression analysis

60 participants, 30 in treatment condition and 30 in control

BPD diagnosed using DSM-III-R

Treatment resistant sample

Age (treatment condition): M: 29.4 (SD: 7.9)

Severity:

Independent variables Number of DSM-III

criteria for BPD met 27 point scale on BPD

criteria rated by clinicians

Dependent variable Change in number of

DSM-III criteria for BPD met

Severity:

Change in DSM score was dependent on baseline number of DSM-III criteria (p=.03). Individuals with higher DSM baseline scores demonstrated greater reductions, independent of treatment group.

Severity:

Insufficient data to calculate and not reported in paper

41

Age (control condition): M: 32.9 (SD: 7.8)

Nesci. (2009). Identification and clinical utility of subgroups of borderline personality disorder.

Country of origin:Australia

SPECTRUM- 6 month residential treatment involving daily group therapy and individual therapy three times a week Eclectic program using concepts from DBT and psychodynamic therapy.

Uncontrolled Naturalistic Study

Wilcoxon Signed Ranked tests

61 participants

BPD diagnosis confirmed using the SCID

Female N: 60Male N:1Age: M: 33.41 (SD: 8.20)

Participants were required to have a history of unsuccessful treatment and high risk status for admission

Exclusion criteria: BPD not primary diagnosis

Subgroups:

Cluster analysis identified two subgroups of BPD:

Self good/self bad Self good/other bad

Dependent variables: Internal, Personal and

Situational Attribution Questionnaire (Kinderman & Bentall, 1996)

BDI Revised Ways of

Coping Checklist (Vitaliano, et al., 1985)

DES PHI Affective Control

Scale (Williams, Chambless, & Ahrens, 1997)

Subgroups:

Self good/ other bad (externalisers) improved on BDI

Self good/self bad (internalisers) improved on BDI, PHI (self harm), PHI (suicide attempt), affect control scale (anxiety and depression)

Subgroups:

Self good/other bad BDI: r= -.54

Self good/self bad BDI: r= -.52PHI (self harm) r= -.33PHI (suicide attempt) r=-.35Affect control Anxiety r=-.55Affect control depression r=-.53

Ryle & Golynkina,

Cognitive Analytic

Uncontrolled Naturalistic Study

27 participants Severity: Severity: Severity:

42

(2000). Effectiveness of time-limited cognitive analytic therapy of borderline personality disorder: Factors associated with outcome.

Country of origin:U.K

Therapy (CAT)

28 sessions

Outpatient, individual treatment

Chi-squared analysis comparing baseline features of the improved vs. the unimproved groups.

BPD diagnosed (on basis of Personality Assessment Schedule; PAS, Tyrer, Alexander,& Ferguson, 1987). High rates of comorbidity. Inner city, British population.

Male N: 11Female N: 16Age: M: 34.3 (SD: 7.5)

14 taking psychotropic medication6 abusing alcohol 10 used illicit substances3 overdosed during treatment

Predictor variables:

Independently developed measure of severity of BPD features.

BDI SCL-90-R IIP Social Questionnaire

(SC) (Corney, Clare, & Fry, 1982)

Self-harm and hospitalisation following overdose in past year and at any time from recorded histories

Dependent variable: Classification of

improved or unimproved on the basis of whether participants still met criteria for diagnosis based on the PAS.

14 participants were classified as ‘improved’ and 13 as ‘unimproved’

Those in the unimproved group had significantly (p=.015) higher mean scores on severity ratings of DSM features of BPD.

r= .310*not reported in the paper but calculated from χ2 value and N

43

Symptoms:

Impulsivity rated on the PAS

Self-cutting, self-poisoning, hospitalisation after overdose recorded from recorded histories.

Symptoms:

Baseline impulsivity did not differ between the improved and unimproved groups

Self cutting either in the past year (p= .011) or at any time( p=.002) was significantly associated with the unimproved group compared to the improved group

Symptoms:

N/A

Self cutting in the past year associated with unimproved group, χ2 =6.54, r= .49*

Self-cutting at any time associated with unimproved group, χ2= 13.35, r= .70*

Sparapani (2014). Moderators of treatment outcome in dialectical behavior therapy: the role of emotion

DBT for 1 year Naturalistic Observation Study using regression analyses to examine the impact of self related emotion

30 participants in America aged 19-54

Diagnostic data unavailable

Symptoms:

Emotion regulation measured using the DERS

Symptoms:

Those initially indicated elevated DERS scores had higher final ratings of

Symptoms:

β=397

44

regulation and impulsivity.

Country of origin:The U.S

dysregulation on final reported BPD symptoms.

Dependent variable: brief self-report measure of BPD, 12 item measure of BPD was developed for tracking progress during treatment.

Age: M: 30.79 (SD: 9.68)87% female

Exclusion criteria: cognitive disability or psychotic symptoms that would interfere with treatment.

emptiness (p=.041)

Spinhoven,Giesen-Bloo, van Dyck & Arntz. (2008). Can assessors and therapists predict the outcome of long-term psychotherapy in borderline personality disorder?

Country of origin:The Netherlands

SFT

TFP

Both treatments involved 50 minutes sessions twice a week for 3 years

RCT

Recovery criteria was set at BPD-SI<15

Jacobson’s reliable change index was also used (reduction of 11.70).

Hierarchical multiple regression and logistical regression analyses

71 participants from the NetherlandsSFT N= 36Age: M: 31.17 (SD: 8.07)91.7% female

TFP N= 35Age: M: 29.46 (SD: 6.59)94.3% female

BPD diagnosed and BPD-SI score

Severity:

Predictor variables: BPD-SI Global Severity Index

(GSI) SCL-90

Dependent variable: BPD-SI

Assessments occurred every 3 months over a 3 year period

Severity:

Baseline severity did not predict end of treatment BPDSI score however when entered into a regression model to predict reliable change it did contribute to predicting some of the variance in scores.

Severity:

t=.20 for BPD-SI baseline severity in a model predicting BPD-SI at end of treatment with an R²=0.23

45

were calculated. <20

Exclusion criteria: psychotic disorders, bipolar disorder, dissociative identity disorder, antisocial personality disorder, ADHD, addiction requiring detox and learning disabilities.

Verheul, Van Den Bosch, Koeter, De Ridder, Stijnen & Brink, (2003). Dialectic behaviour therapy for women with borderline personality disorder.

DBTN= 27

Weekly individual sessions and weekly skills-training group sessions.

12 months

RCT

General Linear Mixed Models were calculated with baseline severity, treatment condition and time entered into the model and interactions were

58 participants within 40-km of Amsterdam.

BPD diagnosed using the SCID for DSM-IV Axis II

Symptoms:

Parasuicide behaviours and DSH measured at baseline, 11, 22, 33, 44 and 52 weeks using section of the BPD-SI

LPC at baseline, 22 and 52 weeks.

Symptoms:

There was a significantly (p=.029) better response to treatment from those in the higher-severity participant group.

Symptoms:

d= .59*

46

Country of origin:The Netherlands

Control group: usual treatment including clinical management.N=31

analysed.

Participants were categorised into two groups:

Higher severity

Lower severity

On the basis of number of self-mutilating acts.

All participants female

Age: M: 34.9 (SD: 7.7)

Exclusion criteria include:Bipolar diagnosis, psychotic disorder, insufficient Dutch language and cognitive impairments

DBT was superior to TAU for patients in the high severity group but not for those in the lower severity group.

Yen, Johnson, Costello & Simpson. (2009). A 5-day dialectic behavior therapy partial hospital program for women with borderline personality disorder: predictors of

DBT, 5-day inpatient hospital program.

Naturalistic follow-up study examining the predictive power of each of the DSM-IV BPD criteria.

Regression analysis

Dependent

50 female participants in the U.S, who met full criteria for BPD, confirmed using the SCID.

Exclusion criteria:

Symptoms:

Endorsement of ‘fear of abandonment’ BPD symptom

Endorsement of ‘unstable relationship’ BPD symptom

Symptoms:

Frantic efforts to avoid abandonment predicted less improvements on measures of hopelessness

Those who endorse unstable

Symptoms:

Lack of frantic efforts to avoid abandonment as a predictor of change in BHS score β= 0.37

Unstable relationships as

47

outcome from a 3-month follow-up study.

Country of origin:The U.S

variables: BDI Beck

Hopelessness Scale (BHS) (Beck at al., 1974)

DES STAXI BSI Self Injury

Questionnaire (adapted from PHI)

diagnosis of schizophrenia, bipolar disorder, cyclothymic disorder, substance dependence, learning disability.

Additional demographics such as age not reported.

Endorsement of ‘identity disturbance’ BPD symptom

Endorsement of ‘impulsivity’ BPD symptom

Endorsement of ‘self-injury’ BPD symptom

Endorsement of ‘affective

relationships were significantly less likely to self harm during follow-up than those who did not endorse unstable relationships

Unstable identity predicts less improvements in general psychopathology (BSI score)

Endorsing impulsiveness predicted significantly more improvements in anger (p<.01).

Self-injury did not predict any of the outcome measures.

Affective

a predictor of change in self-harm, Wald=7.0

Lack of unstable identity as a predictor of change in BSI score β= 0.35

Impulsivity as a predictor of change in STAXI score β= 0.41

N/A

N/A

48

instability’ BPD symptom

Endorsement of ‘emptiness’ BPD symptom

Endorsement of ‘anger’ BPD symptom

Endorsement of ‘dissociation’ BPD symptom

instability was not predictive of any of the measured outcomes

Those who endorsed emptiness showed greater improvements on measures of dissociation (p<.01), depression (p<.05) and general psychopathology (p<.01)

Anger did not predict any of the measured outcomes

Dissociation did not predict any of the measured outcomes

Emptiness as a predictor of DES: β=0.34Emptiness as a predictor of BDI: β=0.30Emptiness as a predictor of BSI: β=0.40

N/A

N/A

Key: SPECTRUM: Australian treatment based on DBT and psychodynamic principles

49

DBT: Dialectical Behavioural TherapySFT: Schema Focused TherapyTFP: Transference Focused TherapyMBT: Mentalisation Based TherapyIPT: Interpersonal TherapySTEPPS: Systems Training for Emotional Predictability and Problem SolvingCBT-PD: Cognitive Behavioural Therapy for Personality Disorder

ERCT: Emotion Regulation Group TherapyIP: Interpersonal psychodynamic psychotherapyCAT: Cognitive Analytic Therapy

RCT: Randomised Controlled TrialN: Number of participantsTAU: Treatment as usual DSH: Deliberate self-harmSD: Standard deviationM: Mean

Table 3

Outcome information for studies examining severity, symptoms and subgroups as predictors of outcome

Study Strengths Weaknesses

Altieri (2011). Borderline personality disorder and dialectical behaviour therapy in an Australian private hospital setting treatment response and BPD subtypes

Reported effect sizes

Good ecological validity

Used a BPD outcome measure

Larger improvements in Cluster 2 can be explained through regression to the mean.

Cluster 1 only included 5 participants which is a small group to perform accurate comparisons

No examination of interaction effect to examine group differences due to small sample size and non-parametric statistical methods

Uncontrolled study so cause and effect cannot be

50

inferred

Small sample size and under representation of males with BPD

Arntz, Stupar-Rutenfrans, Bloo, van Dyck & Spinhoven. (2015). Prediction of treatment discontinuation and recovery from borderline personality disorder: results from an RCT comparing schema therapy and transference focused psychotherapy.

Study designed on need for replication

IIT analyses conducted

Participants were randomly allocated to treatment conditions

High risk of type 1 error and no adjusted significance value. Authors argue that with an adjusted value and this sample size only ‘unrealistically’ powerful predictors would survive analysis.

No follow-up reported

Post-hoc analysis of data, predictors constructed from available data.

Use of recovery criteria may have masked more subtle changes. Especially considering average baseline BPD-SI score= 33.97.

Bateman & Fonagy (2013). Impact of clinical severity on outcomes of mentalisation-based treatment for borderline personality disorder.

Primary aim of research to examine severity on outcomes.

No significant pre-treatment differences between groups

Random allocation of participants to treatment group

Trial was not powered to examine moderating variables.

Not measure of social or personal adaptation

No specific measure of mentalisation was used

Participants were referred following a crisis incident; this sample may not be a representative

51

Less than 10% of data missing

ITT analyses conducted

An adjusted significance value (0.01) was used to protect against type 1 error due to high number of analyses.

of people with BPD in general.

Bellino, Bozzatello & Bogetto. (2015). Combined treatment of borderline personality disorder with interpersonal psychotherapy and pharmacotherapy: Predictors of response.

Primary aim of the study to examine predictors of outcome

Participants randomly allocated to treatment conditions (in this study on those allocated to treatment condition were included)

Small sample size (N=22 in final analyses)

Uncontrolled trial so cause and effect cannot be inferred

Comorbid diagnoses was an exclusion criteria which reduces the representativeness of the sample as comorbidities are the norm as opposed to the exception. This limits generalisability.

High risk of type 1 error due to the explorative nature of this research

No ITT analyses was conducted

The dependent variable is a subjective measure of improvement completed by a clinician, it is also not BPD specific which could mask certain changes.

Use of single items may affect the power of the analysis. This reduces the variance which will

52

restrict any correlation coefficients.

Black, Allen, John, Pfohl, McCormick & Blum. (2009). Predictors of response to Systems Training for Emotional Predictability and Problem Solving (STEPPS) for borderline personality disorder: an exploratory study.

1 year follow- up included

Randomised allocation to treatment condition

Reasonably high participant numbers

Controlled study with TAU comparison group

Post hoc analyses, not all relevant variables were assessed

49% of participants assigned to STEPPS did not attend at least half of the STEPPS sessions.

Results could be explained through regression to the mean

High discontinuation rate

Limited number of male participants so the findings cannot be generalised to males with BPD.

Due to exploratory nature of the study there was a high risk of type 1 error due to high number of analyses and no adjusted p-value.

Black, Blum, McCormick & Allen. (2013). Systems Training for Emotional Predictability and Problem Solving (STEPPS) group treatment for offenders with borderline personality disorder.

High ecological validity with ‘real-world’ application of STEPPS

Uncontrolled study which means causality could not be inferred

47% attrition rates

Findings cannot be generalised to non-offender populations on the basis of this study alone. Males were also under represented in the sample

53

High risk of type 1 error due to large number of analyses

No follow-up reported

Treatment allocation was not randomised

Authors of paper have invested interest as developers of STEPPS.

Bohus et al. (2004) Effectiveness of inpatient dialectic behavioural therapy for borderline personality disorder: a controlled trial.

Included a control group for comparison.

Applied a Bonferroni correction to calculate an adjusted significance value.

Intent to treat analyses were calculated.

No between group differences on measured demographics and comorbidities

Participants were not randomised into treatment conditions.

Participants groups were unequal.

Drop out from DBT was 22.0% which is higher than what NICE suggests is acceptable (20%).

Unequal baseline number of BPD criteria met. Significantly fewer criteria met by the DBT group.

The abstract does not refer to any testing of predictor variables (so could easily be missed).

Many of the reported effects could be explained through regression to the mean.

Bos, van Wel, Appelo & Verbraak. (2011). Effectiveness of Systems

Ecological validity of sample being representative of the clinical practice

No information on comorbidities

54

Training for Emotional Predictability and Problem Solving (STEPPS) for borderline personality problems in a ‘real-world’ sample: moderation by diagnosis or severity?

including sub-threshold participants

Equal number of participants in treatment conditions

Participants were randomly allocated to treatment conditions using a block randomisation procedure using sealed envelopes.

ITT analyses calculated

Groups did not differ significantly on baseline characteristics

Differential treatment effects means results cannot be explained by regression to the mean

Controlled for number of treatment contacts

Ecologically valid TAU control group

Males under-represented (although less than in other studies) so cannot generalise to males with BPD

TAU control group open to clinician selection bias

Davidson, Tyrer, Norrie, Palmer, & Tyrer. (2010). Cognitive therapy v. usual treatment for borderline personality disorder: prospective 6-year follow-up.

6-year follow up study

ITT analysis

Missing data managed through

Self-report of DSH

Under-representative of males with most participants being female

55

imputation

Reasonably high response rate (72%).

Controlled study

Participants randomised into treatment groups

No information on comorbidities or medication.

Digre, Reece, Johnson & Thomas. (2009). Treatment response in subtypes of borderline personality disorder.

Used an Bonferroni adjusted significance value (p=0.017)

Effect sizes are calculated and reported

Good ecological validity

Used pre-existing data which removes any control over the variables gathered.

It was a secondary aim of the study to look at treatment outcomes

Uncontrolled study so cause and effect cannot be inferred.

Unrepresentative sample as only included treatment resistant, high risk individuals

Under representation of males with BPD so findings can’t be generalised to males

Participants varied in their length of stay in treatment which could impact treatment outcomes

No BPD specific measure was used which may mean important changes to BPD presentation

56

were missed

Performed t-test after finding no significant analysis of variance interactions

Eurelings-Bontekoe, Peen, Noteboom, Alkema & Dekker. (2012). Differential treatment response of subtypes of patients with borderline personality organisation, as associated with theory-driven profiles of the Dutch short form if the MMPI: A naturalistic follow-up study

Large sample size

Relatively equal numbers of males and females so findings can be generalised to both males and females.

Consistent reporting of effect size and strong focus on effect size

Theoretically driven hypotheses.

Study also examined participant well-being as an outcome

Treatments received varied between participants

Uncontrolled trail so no cause and effect can be inferred about the treatment effect

No BPD specific outcome measure if used.

Giesen-Bloo, van Dyck, Spinhoven, van Tilburg, Dirksen, van Asselt, Kremers, Nadort & Arntz (2006). Outpatient Psychotherapy for Borderline Personality Disorder.

Assessment occurred at regular three month intervals

An adaptive biased urn procedure was used for randomisation to ensure balanced participant numbers across sites.

Groups did not differ on baseline measures of sociodemographics

Examining severity as a predictor of outcome was not a primary aim of this study.

Research assistants generally did not remain blinded to allocated treatment conditions.

No natural course or TAU control group

57

Power analyses are calculated and reported

An ITT approach was used

Gratz, Dixon-Gordon & Tull. (2014). Predictors of treatment response to an adjunctive emotion regulation group therapy for deliberate self-harm among women with borderline personality disorder.

Primary aim of study to examine predictors of outcome

Participants randomly allocated to treatment conditions

ITT analysis calculated

Post hoc analysis

Small sample size and associated risk of type 2 error

Large number of analyses increased the risk of type 1 error, however due to the risk of type 2 error due to small sample size the authors decided not a adjust the p value.

The authors suggest behavioural measures could have benefited

The study only included female participants which means the findings cannot be generalised to males with BPD

Effect size not reported

Hull, Clarkin & Kakuma. (1993). Treatment response of borderline inpatients: A growth curve analysis.

It was a primary aim of the study to identify features associated with outcomes.

Hypotheses were theoretically driven.

Unclear about validity of independent measure which was developed from the SCID by the authors.

Uncontrolled study so cannot infer cause and

58

Growth curve analysis examined rates and domains of change which captures the dynamic nature of change

Good ecological validity

effect

All participants were females meaning findings cannot be generalised to males with BPD.

No BPD dependent variable was used which could mask certain changes.

Relied on self-report measures rather than objective measures of symptoms

Many symptoms were combined to create factor variables (2 and 3 not discussed in this review). These could not be included as they cannot be compared to other studies.

Reasonably small sample size.

Many treatment components were included in hospitalisation so it is unclear which component (or which combination) produced change.

No follow-up

Kleindienst et al. (2011). Dissociation predicts poor response to dialectic behavioral therapy in female patients with borderline personality disorder.

Theory driven study with theoretically driven hypothesis and design.

Effect sizes reported

Ecological validity high due to

Unrepresentative participant group as it only includes females so the findings cannot be generalised to males with BPD.

23% drop-out

59

naturalistic nature of study

Prospective design of a controlled study

No BPD specific outcome measure was used

Uncontrolled study so cannot infer cause and effect

Strict inclusion criteria which limits representativeness of sample used

Lenzeweger, Clarkin, Levy, Yeomans & Kernberg. (2012). Predicting domains and rates of change in borderline personality disorder.

The predictors identified were based on two theoretical models

Participants were randomly allocated to treatment groups (although combined in this analysis).

Growth curve analysis examined rates and domains of change, the authors argue this better captures the dynamic nature of change than comparing baseline and end point measures.

Included 1-year follow-up

Relatively small sample size, especially for principle component analysis.

3 different treatment groups were examined as a single group (however treatment group was added as a covariate).

Males were under represented in the sample.

By using component analysis to identify their own domains of change it makes it difficult to compare the results of this study to other predictors of outcome studies.

Meares, Stevenson & Comerford, (1999). Psychotherapy with borderline patients: A comparison between treated and untreated cohorts.

Rating of DSM criteria as used in this study has been found reliable.

Acceptable drop-out rate (16%)

Equal participant numbers in control

Participants were not randomised into treatment or control conditions, leaving the groups open to bias.

Unequal baseline DSM scores with the control group having lower scores.

60

and treatment conditions Effect size not reported and insufficient data to calculate

Most of the therapists were still in training

No information provided on participant’s gender so can’t assume generalisability across genders.

Nesci. (2009). Identification and clinical utility of subgroups of borderline personality disorder.

Reported effect sizes

Good ecological validity

No examination of interaction effect to examine group differences due to small sample size and non-parametric statistical methods

Uncontrolled study so cause and effect cannot be inferred

Small sample size and non-normal distributed data

Unrepresentative sample as only included treatment resistant, high risk individuals

Under representation of males with BPD so findings can’t be generalised to males

No BPD specific measure was used which may mean important changes to BPD presentation were missed

Ryle & Golynkina, (2000). Effectiveness of time-limited

It was a primary aim of the study to identify features associated with

Uncontrolled study so cause and effect cannot be

61

cognitive analytic therapy of borderline personality disorder: Factors associated with outcome.

outcomes.

Assessments occurred at 6 months after therapy and 18 months.

18 participants attended the 18 month follow-up.

Relatively similar numbers of male and female participants- increasing representativeness of the participant group.

inferred.

Independently developed measure of BPD symptom severity which had not been validated or checked for reliability. Emptiness was not included in the severity rating due to lack of inter-rater reliability.

Dichotomous data categories on the basis of not meeting diagnostic criteria, this strict criterion might mask more subtle changes.

The unimproved group include participants which made no improvements and those who made some improvements.

Large no. of variables, which increases the risk of type 1 error.

Reasonably small sample size.

No effect sizes reported

Most of the therapists were still in training.

Sparapani (2014). Moderators of treatment outcome in dialectical behavior therapy: the role of emotion regulation and impulsivity.

Limited number of exclusion criteria which increases the representativeness of the variability amongst BPD population.

Small sample size limits generalisability and increases type 1 error

High number of analyses (72 regressions) with a significance value of .05 increases risk of type 1

62

Effect sizes reported

Ecological validity high due to naturalistic nature of study

error- four of the ten significant results identified would be expected by chance alone.

Participants varied in the amount of time they spent in treatment

Self-developed dependent variable measure which was not checked for validity or reliability in peer reviewed research, no norms available.

Uncontrolled study so cause and effect cannot be inferred

Unpublished study so not peer reviewed

Measures used were restricted to those used in therapy, important variables may have been missed

Spinhoven,Giesen-Bloo, van Dyck & Arntz. (2008). Can assessors and therapists predict the outcome of long-term psychotherapy in borderline personality disorder?

No significant between group differences at baseline (bar no. of axis I diagnoses)

Even participant numbers between groups that were randomised

Used ITT analyses

Under representation of male participants so findings cannot be generalised to males with BPD.

Examining the effect of severity of BPD on treatment outcome was not a primary aim of the study; severity was included as a covariate.

High number of independent variables for the participant numbers included.

63

Verheul, Van Den Bosch, Koeter, De Ridder, Stijnen & Brink, (2003). Dialectic behaviour therapy for women with borderline personality disorder.

Specific aim to examine effect of baseline severity.

Participants were randomly allocated to treatment conditions.

81% completed final assessment.

An adjusted significance value was used by applying a Bonferroni correction due to skewed distributions of BPD-SI and LPC.

Intent to treat (ITT) analysis was performed.

Severity was defined by DSH as opposed to BPD severity of no. of criteria met. The groups did not differ on BPD-SI score suggesting the severity of their BPD symptoms were not being studied in this paper.

Lower- and higher- severity groups were split using the median self-mutilating acts this led to unequal ranges of these acts. There was a range of 0-14 number of self-mutilating acts in the lower-severity group and a range of 14-1000 in the higher severity group.

Authors report it is unlikely the research assistant remained blind to participants’ treatment conditions.

Risk of the Hawthorne effect through greater enthusiasm of therapists for DBT over TAU.

Yen, Johnson, Costello & Simpson. (2009). A 5-day dialectic behavior therapy partial hospital program for women with borderline personality disorder: predictors of outcome from a 3-month follow-up study.

3 month follow-up included

Hypothesis driven on the basis of examining predictive power of different BPD symptoms to avoid under representing variability of BPD.

94% completed all assessments.

Longer follow-up would be beneficial

All participants were females meaning findings cannot be generalised to males with BPD.

Most participants engaged in additional therapy or medication use, these may act as extraneous variables.

64

Ecological validity from using follow-up in a ‘real-world’ setting.

A adjusted significance value was adopted to protect against type 1 error (p<.01)

Reported effect sizes

No comparison group was used

The authors do not explicitly state how they measured BPD symptoms. Symptoms were “endorsed” by participants, assumedly on the basis of the SCID which is the only measure of BPD used.

No BPD specific outcome measure was used which may mask BPD specific changes.

Relatively small effect sizes

65

Critique. The majority of studies using dimensional measures of BPD found

severity to be predictive of larger improvements over a range of treatments. Although

some of these findings can be accounted for by regression to the mean, the findings

of Bos et al. (2011) cannot, suggesting a genuine effect. The predictive effect of

severity was less consistent when defined by number of diagnostic criteria. However

there was more variability in the treatment approaches in the studies using number of

criteria. In Bos et al. (2011), 41.1% of the high severity group (based on BPD-40

score) did not reach BPD diagnosis suggesting that while diagnosis and severity are

related, there are considerable number of people with severe BPD symptoms that do

not reach diagnostic criteria. The results of Bos et al. (2011) suggest that dimensional

measures provide more accurate measurement of severity without over-reliance on

diagnosis.

A common limitation (Bohus et al., 2004; Black et al., 2013; Bos et al., 2011) and a

particular criticism of Black et al. (2009) is high discontinuation rates. Only 51%

participants in Black et al.’s (2009) treatment condition completed more than half of

STEPPS sessions. This risks positively biasing results by only including participants

who had a positive enough experience of therapy to remain in treatment. Controlling

for baseline scores would have been beneficial in these analyses. A strength of many

of the papers is their use of intent to treat analyses (Spinhoven et al., 2008; Giesen-

Bloo et al., 2006; Bohus et al., 2004; Bos et al., 2011; Gratz et al., 2014; Bateman &

Fonargy, 2013). However the frequent use of multiple analyses increased the risk of

Type 1 error in many studies (Black et al., 2009; Black et al., 2013; Gratz et al.,

2014; Bellino et al., 2015).

66

Alongside definitions of severity, studies also varied on the outcome measures used

which makes comparisons difficult. Two studies did not use a BPD-specific measure

(Bohus et al., 2004; Bellino et al., 2015) and instead used general severity scales or

axis-I related measures (e.g. Beck Depression Inventory [BDI]; Beck & Steer, 1984).

Although these are useful for measuring distress, only using these measures to

examine BPD treatments may not reveal the full extent of the changes made. Ryle

and Golynkina (2000) developed their own measure of BPD severity. This had not

been statistically validated, nor had reliability been established which reduced the

strength of their findings as they are not based on an established measure. This may

have contributed to this being the only study which found severity to be negatively

related to improvements. It is also important to consider how different aspects of

BPD are measured. Mentalisation is not measured, even by Bateman and Fonagy

(2013) who examine outcomes of MBT.

It is also useful to consider the treatment modalities examined. As outlined in the

methodology, no treatment modality has a sufficient body of research to warrant a

review of its own. In turn, this review includes a range of treatments varying in

structure, approach, intensity and aims which undoubtedly impact on who benefits

from treatment. Despite these difference, there was a trend towards studies

evaluating skills based groups (DBT, STEPPS, ERGT) finding a predictive effect of

severity whereas the more reflective therapies (CAT, MBT, Transference Focused

Psychotherapy (TFP)) did not find higher baseline severity predicting large

improvements. Gratz et al. (2014) suggests that CBT based interventions may be

more suited to severely affected individuals due to the structured approach and focus

on skill implementation in comparison to more psychodynamic approaches which

67

require higher functioning to benefit from less directive treatment. Nine studies

roughly followed this trend (Ryle & Golynkina, 2000; Spinhoven et al., 2008;

Giesen-Bloo et al., 2006; Black et al., 2009; Bos et al., 2011; Black et al., 2013;

Bateman & Fonagy, 2013; Gratz et al., 2014; Bellino et al., 2015), two went against

(Bohus et al., 2004; Meares et al., 1999). Further research comparing skills based

with more reflective therapies would be beneficial.

Severity does not account for all the variance observed in treatment response. The

impact of different symptoms of BPD may also be important in predicting treatment

responses. Understanding the symptom level predictors could help identify

individuals most likely to benefit from certain treatments.

BPD Symptoms

Yen et al. (2009) argue that to assume homogeneity within a sample of BPD is to

under represent the variety of presentations within this population. One way to

reduce the assumption of homogeneity when studying predictors of outcome is to

examine each of the BPD criteria separately.

Twelve papers (Hull et al., 1993; Ryle & Golynkina, 2000; Verheul et al., 2003;

Bohus et al., 2004; Yen et al., 2009; Davidson et al., 2010; Kleindienst et al., 2011;

Lenzeweger, et al., 2012; Sparapani, 2014; Gratz et al., 2014; Arntz et al., 2015;

Bellino et al., 2015) examine symptoms of BPD and how they relate to treatment

outcomes. This section will review research on each BPD criterion in turn providing

a summary of findings followed by a critique of research methodology.

Fear of abandonment. Yen et al. (2009) found those without fear of

abandonment made significantly larger improvements on levels of hopelessness only

68

(Beck’s Hopelessness Scale [BHS]; Beck, Weissman, & Lester, 1974), with a small-

medium effect size (R2 =.22). Unfortunately this was the only study examining the

impact of fear of abandonment on hopelessness; repetition would be required in

order to confirm this preliminary finding.

Bellino et al. (2015) examined fear of abandonment using the BPD-SI. They found

those with higher ‘fear of abandonment’ ratings showed significantly more

improvements on the Clinical Global Impression scale (CGI, Guy, 1976) following

IPT plus fluoxetine. In Bellino et al.’s (2015) multiple regression model, fear of

abandonment demonstrated a moderate-large effect size (p=.001, coefficient= .452).

Of four variables contributing to the multiple regression, ‘fear of abandonment’ had

the largest effect size.

With only two studies conducted in different countries, finding different results,

using different outcome measures, it would be entirely speculative to attempt to draw

any conclusions on the impact of abandonment fears at this stage.

Unstable relationships. Three studies found that ‘unstable relationships’ did

not predict outcomes on measures of general psychopathology (Yen et al, 2009;

Kleindienst et al., 2011; Bellino et al., 2015) but others found an impact on more

specific measures. It was not surprisingly that baseline Inventory of Interpersonal

Problems (IIP; Horowitz, Rosenberg, Baer, & Ureno, 1988) scores were positively

correlated with improvements on the IIP following DBT (Bohus et al., 2004) as

baseline scores account for a degree of variance in end of treatment scores. Given it

is a wide-ranging assessment of a variety of interpersonal behaviours the IIP does not

necessarily measure instability in relationships. For these reasons the conclusions

which can be drawn are limited. Yen et al. (2009) found significantly lower rates of

69

self harm after treatment in those who had endorsed the unstable relationships

symptom (p<.01, Wald= 7.0); however the authors note that they only measured

experience of unstable relationships at the start of treatment so could not rule out the

possibility that changes to relationships was what impacted on levels of self harm.

When Schema Therapy and TFP were examined unstable interpersonal relationships

(measured by the BPD-SI) did not predict recovery rates (BPD-SI<15) (Arntz,

Stupar-Rutenfrans, Bloo, van Dyck, & Spinhoven, 2015). This was the only study

which used a BPD-specific outcome measure.

Two studies examining skills-based treatment (Yen et al., 2009; Bohus et al., 2004)

found a positive relationship between unstable relationships and outcomes. Two

interpersonally based treatments (Bellino et al., 2015; Arntz et al., 2015) did not find

this effect suggesting unstable relationships are more influential in skills based

groups. This is based on very few studies. Replication would provide further clarity

on this hypothesis.

Identity disturbance. Yen et al. (2009) found that unstable identity predicted

significantly lower improvements in general psychopathology (Brief Symptom

Inventory score [BSI]; Derogatis & Melisaratos, 1983) with a small-medium effect

size (p<.01, R2=.27). Two other studies found unstable identity predicted

significantly higher rates of improvements in global functioning/ social adjustment

domain with a medium effect size (Pearson’s coefficient=-.36, Lenzenweger,

Clarkin, Levy, Yeomans, & Kernberg, 2011) and in CGI (Bellino et al., 2015,

medium effect size, multiple regression coefficient= 0.307).

70

One possible explanation for the different results could be that the DBT treatment

(Yen et al., 2009) was a 5-day treatment whereas the other studies examined longer-

term treatments. It is possible that in the short-term identity disturbance limits

improvements but over longer-term treatments people with identity disturbance

shower greater improvement. This is conceivable given the important contribution of

the therapeutic relationship to recovery (Clarkson, 2003) if someone has particular

difficulties with sense of self then they may be more suited to a longer treatment

where they can build a trusting and effective therapeutic relationship. Arntz et al.

(2015) found baseline identity disturbance was not predictive of treatment outcome.

However, Arntz et al. (2015) had a strict recovery criteria (BPD-SI<15) which could

mask more subtle changes and might account for the lack of significant effect of

identity disturbance in this study.

Impulsivity. Only Yen et al. (2009) found a (small) effect of impulsivity

(p<.01, R2=.17). Participants with higher levels of impulsivity demonstrated greater

improvements in their anger (State-Trait Anger Expression Inventory [STAXI];

Spielberger, 1999) suggesting there could be an aspect of this DBT treatment which

participants with higher impulsivity found particularly useful in applying to their

difficulties with anger.

Others did not find baseline impulsivity to significantly impact on treatment outcome

over five treatment modalities (Psychodynamic psychotherapy, Hull et al., 1993;

CAT, Ryle & Golynkina, 2000; IPT, Bellino et al., 2015; Schema Therapy and TFP,

Arntz et al., 2015). It is possible that DBT, which is well structured and skills

focused, is more suited to create changes in people with higher levels of impulsivity

than these, more psychodynamic and interpersonally based treatments.

71

Suicidal or self-harm behaviours. Most studies examining suicidal or self-

harm behaviours found this criterion did not predict outcomes over a range of

treatments (DBT, Yen et al., 2009; IPT, Bellino et al., 2015; Schema therapy and

TFP, Arntz et al., 2015; CBT Davidson et al., 2010).

Ryle and Golynkina (2000) found that ‘unimproved’ participants who had undergone

time limited CAT had significantly higher baseline deliberate self harm (recent and

lifelong). This study had a small sample size and did not report effect sizes. The

unimproved group may also be a slightly misleading category as this group included

participants who made no change and those who made some change but still met

criteria for BPD. The authors created dichotomous groups by categorising

participants as ‘improved’ or ‘unimproved’. Studying improvements as a continuous

variable would be more transparent and would reveal more subtle changes.

Verheul et al. (2003) defined severity according to degree of self-mutilating acts. In

contrast to the other studies reviewed, Verheul et al. (2003) found higher severity at

baseline to significantly predict greater improvements following DBT.

Affective instability. Studies examining affective instability found

contradictory results. In Bellino et al. (2015) affective instability predicted change in

severity of symptoms (CGI) with a small-medium effect size (multiple regression

coefficient= .242). In Gratz et al. (2014) difficulties in emotion regulation (the

Difficulties in Emotion Regulation Scale [DERS]; Gratz & Roemer, 2004) predicted

improvements in ZAN-BPD following ERGT.

Yen et al. (2009) found no effect of affective instability on any outcome measure

following DBT. Lenzenweger et al. (2012) examined how symptoms cluster and

72

change over time by performing a growth curve analysis using data from their RCT

on DBT, TFP, and Supportive Dynamic Psychotherapy. They found that lower levels

of negative affect predicted significantly higher rates of change in the domain of

anger/aggression with a medium-large effect size (Pearson’s coefficient= .48;

Lenzenweger et al., 2012). There are important differences between negative affect

and affective instability, it is likely that the study by Lenzenweger et al. (2012)

measured a different predictor variable to those specifically examining affective

instability.

Finally, an unpublished study (Sparapani, 2014) found emotion regulation difficulties

to be significantly related to a number of BPD domains but baseline scores on the

DERS only moderated final emptiness scores (R2=.206) but in the opposite direction

than predicted, with higher baseline DERS predicting higher emptiness scores

following DBT (β=397). Due to the contradictions in findings, affective instability

would benefit from further examination before reliable conclusions can be drawn.

Emptiness. Bellino et al. (2015) found no effect while Yen et al. (2009)

found that higher baseline emptiness scores significantly predicted greater

improvements on depression (BDI), dissociation (the Dissociative Experience Scale

[DES]; Bernstein-Carlson & Putnam, 1986) and general psychopathology (BSI).

These studies examined different therapies so it is possible that DBT (Yen et al.,

2009) is more suited to people with high levels of emptiness than IPT (Bellino et al.,

2015). One hypothesis, suggested by Yen et al. (2009), concerned the importance of

the mindfulness component of DBT on emptiness. Another is that the intensive social

component of the 5-day DBT treatment provided a more validating experience for

73

reducing symptoms of emptiness than the individual treatment in Bellino et al.

(2015).

The representativeness of Yen et al.’s (2009) sample is worth considering. Of their

50 participants, all but three endorsed emptiness (using the Structured Clinical

Interview for DSM-IV Axis II Personality Disorders [SCID] First et al., 1996).

Although the authors report that these three participants had a sufficiently varied set

of baseline symptoms, statistically three is not an ideal comparison group size which

calls into question the reliability of the results.

Yen et al. (2009) noted that emptiness is often neglected in research on BPD

treatment. Emptiness is a core symptom of BPD (DSM-IV-R, 2000). It is related to

multiple areas of psychosocial morbidity (Ellison, Rosenstein, Chelminski,

Dalrymple, & Zimmerman, 2015) and has the highest association with suicidal

ideation of the BPD criteria (Klonsky, 2008), for this reason emptiness warrants

further study.

Inappropriate anger. Higher baseline anger predicted significantly greater

improvements in anger following DBT (Bohus et al., 2004). Lenzenweger et al.

(2012) found a large effect (Pearson’s coefficient= .44) of higher baseline aggression

predicting higher rates of change in the domain of anger/aggression. Anger and

aggression, although frequently coinciding, are separate concepts so it is

questionable how neatly this variable maps onto the BPD symptom of anger. While

the findings in Bohus et al. (2004) could be explained through regression to the

mean, the findings of Lenzenweger et al. (2012) were reached through growth curve

and principle component analysis which takes a more dynamic look at the changes

occurring. However Lenzenweger et al.’s (2012) findings were based on only 58

74

participants which is lower than recommended for statistical techniques based on

principles of factor analysis.

Dissociative or paranoid experiences. Bohus et al. (2004), reports that higher

baseline DES scores predicted greater improvements on the DES following DBT.

However this pattern of improvement was also observed in the waiting list control

group suggesting findings could be assigned to a regression to the mean.

Bellino et al. (2015) and Yen et al. (2009) did not find that levels of dissociation

predicted outcomes on their set of dependent variables. Kliendienst et al. (2011)

found that higher baseline dissociation was related to significantly smaller

improvements on the Global Severity Index (GSI) (β=-.017). Arntz et al. (2015) was

the only study to use a BPD specific measure and found that dissociation (BPD-SI)

predicted a smaller chance of recovery (effect size of β=.030).

Kliendienst et al. (2011) suggest that dissociation during treatment disrupts learning

which leads to smaller improvements; however the findings are inconclusive. Further

evidence is required to understand the impact of dissociation on treatment.

Critique. Research into the predictive value of BPD symptoms is still

relatively sparse. Five studies were uncontrolled (Hull et al., 1993; Ryle &

Golynkina, 2000; Yen et al., 2009; Kleindienst et al., 2011; Sparapani, 2014)

meaning we cannot assume that the observed effects would not have occurred

without the intervention. Five studies (Bohus et al., 2004; Lenzeweger et al., 2012;

Gratz et al., 2014; Arntz et al., 2015; Bellino et al., 2015) used data from previous

efficacy research to examine predictors of outcome. These studies operate by

performing exploratory analysis to examine whether any baseline variables predict

75

the observed outcomes. This post hoc approach increases the risk of important

variables being missed as baseline variables have already been decided upon and

may not be based on theoretical considerations, or in some cases variables may be

conveniently constructed from whichever measures were used in the original RCT.

The power of post-hoc analyses are lower than a priori analyses which is a limitation

of many of the papers reviewed.

Kliendienst et al. (2011) was theoretically designed on the basis of dissociation

disrupting learning. Lenzenweger et al. (2012) identified predictor variables on the

basis of two models of BPD (neurobehavioural model; Depue & Lenzenweger, 2005,

and a psychodynamic object relation model; Kernberg, 1984). A number of the

studies do not provide a clear rationale for their choice of predictor variables (Bohus

et al., 2004; Ryle & Golynkina, 2000; Gratz et al., 2014; Bellino et al., 2015).

One study chose predictor variables on the basis of past research with the aim of

replication (Arntz et al., 2015) however most of the research was exploratory and, in

turn, a large amount of analyses were performed with multiple variables and outcome

measures. This increases the risk of Type 1 error. While two studies adopted an

adjusted significance value to protect against Type 1 error (Yen et al., 2009; Bohus et

al., 2004), others did not make this adjustment (Gratz et al., 2014; Bellino et al.,

2015; Arntz et al., 2015). Gratz et al. (2014) did not adjust their significance value

due to their small sample size and associated risk of Type 2 error, while this seems a

necessary measure it still leaves opens the risk of Type 1 error. Arntz et al. (2015)

also opted not to adjust their p-value due to their sample size. Bellino et al. (2015)

acknowledged the higher risk of Type 1 error in their initial regression analyses but

claimed this risk was not as high when the multiple regression was calculated which

76

accounts for multicollinearity and multiple tests, although this is a reasonable

argument conclusions should be interpreted with caution.

Another limitation is that only four studies used a BPD-specific measure (Artnz et

al., 2015; Ryle & Golynkina, 2000; Gratz et al., 2014; Sparapani, 2014). As therapy

targets BPD, important changes may be missed without an appropriate BPD measure.

Arntz et al. (2015) employed the BPD-SI and Ryle and Golynkina (2000) used the

PAS (Tyrer, Alexander, & Ferguson, 1987) but both used a recovery level cut-off to

create dichotomous variables which might have led to more subtle changes being

missed.

The lack of research aimed at replicating results makes it difficult to compare

findings. At this stage, there is insufficient evidence to draw reliable conclusions

about any single treatment modality but it is likely that different individuals will be

suited to different treatment approaches. Replication studies with theoretically driven

hypotheses are needed to drive this research area forward.

BPD Subgroups

Research suggests that severity of BPD is useful in predicting response to treatment.

Acknowledging the limited research and high number of variables, no single

symptom proves a consistent predictor of outcome. Researchers have begun looking

at potential subgroups of BPD (Sanislow et al., 2002; Zittel, Bradley, & Weston,

2006; Digre et al., 2009; Bradley, Zittel & Western, 2005). One study found that a

three factor model of BPD provided a better fit than BPD as a unidimensional

concept (Andión et al., 2011).

77

Four studies examined how subgroups of BPD predict treatment outcomes (Digre et

al., 2009; Nesci, 2009; Altieri, 2011; Eurelings-Bontekoe, Peen, Noteboom, Alkema

& Dekker, 2012). Unfortunately each of these examines different groupings of BPD,

again demonstrating the need for replication in this area. Three studies took place in

Australia (Digre et al., 2009; Nesci, 2009; Altieri, 2011) and one in the Netherlands

(Eurelings-Bontekoe, 2012). Two are unpublished studies examining how subgroups

respond to DBT based treatments (Nesci, 2009; Altieri, 2011). One examines the

response of three subgroups to SPECTRUM (an Australian DBT based treatment)

(Digre et al., 2009) and the final study looks at treatment response of five subgroups

to a variety of treatments (Eurelings-Bontekoe, 2012).

Using cluster analysis Nesci (2009) identified two subgroups: self-good/self-bad

group and self-good/other-bad group, differing on their attribution of negative

events. The external attributors (self-good/other-bad) showed only significant

improvements in depression following DBT. The internal attributors (self-good/self-

bad) demonstrated significant improvements on a number of measures (including

depression, self-harm and suicide attempts).

Altieri (2011) identified two subgroups using two-step cluster analysis (cluster 1:

dysregulated/ more functional self and cluster 2: dysregulated/ defective self) with

participants in the second cluster presenting with more severe symptoms in all BPD

domains. Following DBT, Altieri (2011) found participants in cluster 2 made a

number of significant improvements (STAXI, Million Clinical Multi-axial Inventory-

III [MCMI; Millon, 1994], MCMI Borderline Personality pathology, MCMI

Depression and Anxiety Scales) which were not made by cluster 1. However it seems

possible that these findings could be explained by regression to the mean or the

78

severity effect discussed earlier in this review. Cluster 1 only included five

participants which limits the representativeness of this comparison group, this also

makes it almost impossible to find a significant result or to determine a precise effect

size. Neither Nesci (2009) nor Altieri (2011) performed interaction analyses to

examine group difference; Nesci (2009) assigned this to their small sample size and

use of non-parametric tests.

Digre et al. (2009) examined response to SPECTRUM, an Australian treatment based

on DBT principles. Using cluster analysis, they identified three subgroups:

withdrawn-internalising, severely disturbed-internalising and anxious-externalising.

Despite time by cluster interactions (2x3 ANOVA) being non-significant for any

outcome measure, the authors performed paired sample t-tests for each measure

within each cluster. They justified this because of the low participant numbers in

each cluster and subsequent low power. While this is a useful exploration of the data,

the non-significant ANOVA greatly reduces the conviction of subsequent findings.

Participants in the withdrawn-internalising group demonstrated significant changes

in DES scores (d=.98, large effect), the anxious-externalising group showed

significant changes in BDI scores (d=1.34, large effect) but no significant changes

were observed in the severely disturbed-internalising group.

Eurelings-Bontekoe et al. (2012) identified five subgroups of Borderline Personality

Organisation (BPO) based on profile interpretation of the Minnesota Multiphasic

Personality Inventory (MMPI; Hathaway & McKinley, 1943): Immature BPO,

Narcissistic BPO, High-level/ Overcontrolled BPO, Low-level BPO and Psychotic

BPO. Using a repeated-measures mixed MANCOVA the authors found the High-

level BPO group benefited most from treatment whereas the Psychotic BPO group

79

did not demonstrate significant change. The Psychotic BPO group was identified as

having the most severe symptoms. In this sense the findings contradict Altieri (2011)

who found the most severe group made the largest improvements. However these

groups vary on more than severity of symptoms which makes direct comparison of

these findings impractical.

Critique. Externalising groups in both Nesci (2009) and Digre et al. (2009)

demonstrated changes primarily in depression symptoms following DBT based

treatments. For externalising participants, DBT produced improvements in mood and

other depression features.

Eurelings-Bontekoe, Luyten, Remijsen, et al. (2010) identified High level BPO

participants, who showed the largest improvements, as internalisers and Psychotic

BPO participants (no significant change) as externalisers. Nesci (2009) also found

that their internalising group (self-good/self-bad) demonstrated more improvements

than the externalising group (self-good/other-bad). Digre et al. (2009) found both

internalisers and externalisers made improvements (on the DES and BDI

respectively) but found severely-disturbed-internalisers did not benefit from

treatment.

Digre et al. (2009) recruited participants classified as severe cases or treatment

resistance which might account for why they identified a severely disturbed-

internalizing group unlike Eurelings-Bontekoe et al. (2012). However considering

Eurelings-Bontekoe et al. (2012) had participant numbers of 2,062 compared with

Digre et al.’s (2009) 77, it seems unlikely that they missed all severe or treatment

resistant participants. Digre et al. (2009) point out that while the subgroups identified

in these research papers may be statistically relevant it does not necessarily mean

80

they will translate to clinical relevance. Three of these four studies examined similar

treatment approaches (DBT) which aided comparison, however the grouping of

participants into different subgroups creates problems for comparison. For example,

the severely disturbed internalisers from Digre et al. (2009) may have been grouped

with the other internalisers in Nesci (2009) or in the High-level, overcontrolled BPO

group in Eurelings-Bontekoe et al. (2012).

Discussion

Findings summary

The results echo those of Barnicott et al. (2012) in finding that baseline severity of

BPD is associated with greater improvements following psychological treatment. It

appears this result cannot be always be accounted for by regression to the mean (Bos

et al., 2011). There are a number of difficulties evaluating research into severity

including the range of severity and outcome measures used and the array of treatment

modes studied which makes direct comparisons unfeasible. Despite these difficulties,

there was a trend towards studies evaluating skills based groups (DBT, STEPPS,

ERGT) finding higher severity being associated with larger outcomes while studies

evaluating more reflective therapies (MBT, TFP, CAT) did not observe this effect.

Fear of abandonment was found to be particularly important for predicting treatment

outcomes following IPT (Bellino et al., 2015). Findings from Yen et al. (2009)

suggest that identity disturbance might be problematic for shorter term therapy and

people with identity disturbances might be more suited to longer term treatments

(Lenzenweger et al., 2011; Bellino et al., 2015). Impulsivity and DSH did not

reliably predict outcomes (Ryle & Golynkina., 2000; Bellino et al., 2015; Arntz et

81

al., 2015; Yen et al., 2009). Results were mixed for affective instability. Emptiness

was identified as an understudied symptom. Yen et al. (2009) alone found emptiness

to be related to multiple improvements following brief intensive therapy however

this finding would benefit from replication.

The findings on individual symptoms were varied and inconsistent; this is not

surprising considering the varied treatment types, outcome measures and variable

measures. Replication studies examining the predictive value of BPD symptoms

would be beneficial to increase the reliability of the findings. The available research

does not reveal clear relationships between particular symptoms and treatment type.

Unstable relationships appeared more predictive for DBT groups than interpersonal

treatments however more research is needed to reveal potential patterns. There was a

general pattern of symptom based predictors being less able to predict change in

more interpersonally based treatments. It seems feasible that this reflects the different

foci of these treatments. The symptom based measures used in a large proportion of

this research may be less suited to measuring changes in these more reflective

therapies.

The treatment responses of subgroups of BPD suggest that people with more

internalising tendencies were more likely to show improvements compared to

externalising individuals. This finding did not appear to apply to more severely-

disturbed internalisers but may suggest an important dimension when understanding

the heterogeneous treatment response of people with BPD.

The need for replication is a theme throughout this area of research. Further research

looking at the associations between symptoms and treatment outcomes would help

clarify which findings in the existing literature are robust and emerge across different

82

participant groups and which are accidental findings or specific to the particular

group being studied.

Strengths and Limitations

One limitation which extends across a large proportion of the research is the

representativeness of research samples. Despite previous claims that BPD is more

common in females (DSM-IV-TR, 2000) equal numbers of males and females with

BPD has since been suggested (Grant et al., 2008). The majority of participants in

BPD research are female meaning females are disproportionately represented. For

this reason, findings cannot be assumed to apply to males.

Another common limitation was the exploratory nature of much of the research,

especially in studies using post hoc analyses. These studies often include a large

number of analyses increasing the risk of Type 1 error. There were a limited number

of studies accounting for this with adjusted significance values, often due to small

sample sizes and associated risk of Type 2 error. This is a difficult pay off when

there is a lack of power from limited sample sizes. Research would benefit from

theoretically driven hypotheses on which variables are selected which would help

reduce the risk of Type 1 error due to fewer analyses.

Uncontrolled studies are frequently used in this field of research, although more

controlled studies are required to establish a causal relationship, the uncontrolled

studies have a benefit of good ecological validity. More studies would benefit from

extending their follow-up period to ensure findings are maintained over time.

Limitations of the current review and areas for future research

83

No review could cover all individual differences found in populations of people with

BPD. Factors which could modify treatment outcomes are as limited as the

components of the person undertaking treatment. For this reason future research

investigating predictors of treatment response should draw on psychological theories

of BPD and factors contributing to its maintenance. Many variables are bound to be

important to treatment response, including mentalisation, childhood trauma, and

attachment style were not covered but would be interesting avenues for future

reviews. Trans-diagnostic individual differences or societal factors may be equally or

even more helpful in predicting treatment outcomes and provide interesting avenues

for future research.

This review selected papers which examine treatment response as defined by changes

in quantitative measures of BPD or related general psychopathology. However other

ways of defining treatment response, including qualitative feedback from participants

or measures of quality of life may reveal different predictor variables. This would be

another interesting area for future research.

Inconsistent findings are expected when one takes into account the large number of

individual and therapeutic differences naturally occurring in the reviewed research.

In addition, research took place in a range of different countries with different

healthcare systems. However the research demonstrates good ecological validity

which reflects what is observed in real-life practice. In order to gain a more precise

understanding of how individual differences relate to treatment outcomes more

tightly controlled studies would be beneficial to control for extraneous variables.

The severity findings highlight a potential area of missed treatment. People with less

severe BPD benefit less from treatment. Clinicians may find this lower-severity

84

group harder to treat. It might be necessary to re-consider the treatment needs of

individuals with less severe BPD. Without this we run the risk of only treating BPD

up to a certain level, leaving many people with lower, but significant, difficulties

without viable treatment options. A criticism of the current review is the inability to

form recommendations about symptom level predictors but it is hoped that further

research will provide clarity. In terms of subgroups, it might be advisable for

clinicians to consider the impact of coping and blame attribution style and recognise

different needs of these individuals. Again, it is hoped that future research will

provide further information resulting in clearer recommendations for clinicians.

Conclusion

In conclusion, this review has found confirmatory evidence that baseline severity of

BPD is predictive of better treatment outcomes, especially in skills based treatments

such as STEPPS. This review has begun to look at how different symptoms relate to

treatment outcomes and has found preliminary evidence for the importance of

internalising and externalising styles in subgroups of BPD on treatment outcomes.

Replication is invaluable at this stage, with preliminary findings emerging but in

desperate need of replication. While severity is a reasonably well established

predictor of treatment response (although further replication is welcomed) research

which breaks down this variable into its parts would make a valuable contribution to

demystifying the current findings relating to symptom and subgroup level predictors.

85

References

Altieri, P. (2011). Borderline personality disorder and dialectical behaviour therapy

in an Australian private hospital setting treatment response and BPD

subtypes. (Unpublished doctoral thesis). RMIT University, Melbourne,

Australia.

American Psychiatric Association. (1987). Diagnostic and Statistical Manual of

Mental Disorders,3rd ed, Text Revision (DSM-III-R). Washington, DC:

APA.

American Psychiatric Association. (2000). Diagnostic and Statistical Manual of

Mental Disorders, 4th ed, Text Revision (DSM-IV-TR). Washington, DC:

APA.

American Psychiatric Association. (2001). Practice guideline for the treatment of

patients with borderline personality disorder. The American Journal of

Psychiatry 158(10 Suppl), 1-52.

American Psychiatric Association. (2013). Diagnostic and statistical manual of

c mental disorders (5th ed.). Washington, DC: APA.

Andión. Ó., Ferrer, M., Gancedo, B., Calvo, N., Barral, C., Torrubia, R., & Casas, M.

(2011). Confirmatory factor analysis of borderline personality disorder

symptoms based on two different interviews: The Structured Clinical

Interview for DSM-IV Axis II disorder and the Revised Diagnostic Interview

for Borderlines. Psychiatry Research, 190, 304-308.

86

Andrews, G., Pollock, C., & Stewart, G. (1989). The determination of defense style

by questionnaire. Arch Gen Psychiatry, 46, 455-460.

Arntz, A., Dreessen, L., Schouten, E., & Weertman, A. (2004). Beliefs in personality

disorders: a test with the Personality Disorder Belief Questionnaire. Behav

Res Ther, 42, 1215-1225.

Arntz, A., Stupar-Rutenfrans, S., Bloo, J., van Dyck, R., & Spinhoven, P. (2015).

Prediction of treatment discontinuation and recovery from borderline

personality disorder: Results from an RCT comparing schema therapy and

transference focused psychotherapy. Behaviour Research and Therapy, 74,

60-71. doi:10.1016/j.brat.2015.09.002

Arntz, A., van den Hoorn, M., Cornelis, J., Verheul, R., van den Bosch, L.M.C., &

de Bie, A.J.H.T. (2003). Reliability and validity of the borderline personality

disorder severity index. Journal of Personality Disorders, 17, 45–59.

Asnaani, A., Chelminski, I., Young, D., & Zimmerman, M. (2007). Heterogeneity of

borderline personality disorder: do the number of criteria met make a

difference? Journal of Personality Disorder, 21(6), 615-625.

Barnett, A. G., van der Pols, J. C., & Dobson, A. J. (2005). Regression to the mean:

what it is and how to deal with it. International Journal of Epidemiology,

34(1), 215, 220.

Barnicot, K., Katsakou, C., Bhatti, N., Savill, M., Fearns, N., & Priebe, S. (2012).

Factors predicting the outcome of psychotherapy for borderline personality

87

disorder: A systematic review. Clinical Psychology Review, 32(5), 400-412.

doi:10.1016/j.cpr.2012.04.004

Barratt E. Anxiety and impulsiveness related to psychomotor efficiency. (1959).

Percept Mot Skills. 9, 191– 198

Bateman, A. W., & Fonagy, P. (2004). Psychotherapy for borderline personality

disorder: Mentalization-based treatment of BPD. Oxford: Oxford University

Press.

Bateman, A., & Fonagy, P. (2013). Impact of clinical severity on outcomes of

mentalisation-based treatment for borderline personality disorder. The British

Journal of Psychiatry, 203(3), 221-227. Retrieved from

http://search.ebscohost.com/login.aspx?direct=true&AuthType=ip,shib&db=

psyh& AN=2013-32050-013&site=ehost-live&custid=s4121186

Beck, A. T., & Steer, R. A. (1984). Internal consistencies of the original and revised

Beck Depression Inventories. Journal of Clinical Psychology, 40, 1365–67.

Beck A, Ward CH, Mendelson M, Mock J, Erbaugh J. (1961). An inventory for

measuring depression. Arch Gen Psychiatry, 4, 561–71.

Beck, A. T., Weissman, A., Lester, D., et al. (1974). The measurement of pessimism:

The Hopelessness Scale. Journal of Consulting and Clinical Psychology, 42,

861–5.

Bellino, S., Bozzatello, P., & Bogetto, F. (2015). Combined treatment of borderline

personality disorder with interpersonal psychotherapy and pharmacotherapy:

88

Predictors of response. Psychiatry Research, 226(1), 284-288.

doi:10.1016/j.psychres.2014.12.064

Bernstein-Carlson, E. B., & Putnam, F. W. (1986). Development, reliability, and

validity of a dissociation scale. The Journal of Nervous and Mental Disease,

174, 727–5.

Black, D. W., Allen, J., St. John, D., Pfohl, B., McCormick, B., & Blum, N. (2009).

Predictors of response to systems training for emotional predictability and

problem solving (STEPPS) for borderline personality disorder: An

exploratory study. Acta Psychiatrica Scandinavica, 120(1), 53-61.

doi:10.1111/j.1600-0447.2008.01340.x

Black, D. W., Blum, N., McCormick, B., & Allen, J. (2013). Systems training for

emotional predictability and problem solving (STEPPS) group treatment for

offenders with borderline personality disorder. Journal of Nervous and Mental

Disease, 201(2), 124-129. doi:10.1097/NMD.0b013e31827f6435

Blum, N., Pfohl, B., John, D. S., Monahan, P., & Black, D. W. (2002). STEPPS: a

cognitive-behavioral systems-based group treatment for outpatients with

borderline personality disorder-a preliminary report. Comprehensive

Psychiatry, 43(4), 301-310.

Bohus, M., Haaf, B., Simms, T., Limberger, M. F., Schmahl, C., Unckel, C., et al.

(2004). Effectiveness of inpatient dialectical behavioral therapy for borderline

personality disorder: A controlled trial. Behaviour Research and Therapy, 42(5),

487-499. Retrieved from http://search.proquest.com/docview/71738013?

accountid=17256

89

Bos, E. H., van Wel, E. B., Appelo, M. T., & Verbraak, J. P. M. (2011).

Effectiveness of systems training for emotional predictability and problem

solving (STEPPS) for borderline personality problems in a ‘real-world’

sample: Moderation by diagnosis or severity? Psychotherapy and

Psychosomatics, 80(3), 173-181. doi:10.1159/000321793

Bradley, R., Zittel Conklin, C., & Western, D. (2005). The borderline personality

diagnosis in adolescents: gender differences and subtypes. Journal of Child

Psychology and Psychiatry, 46(9), 1006-1019.

Briere, J. (1995). Trauma Symptom Inventory. USA: Psychological Assessment

Resources, Inc.

British Psychological Society. (2011). Response to the American Psychiatric

Association: DSM-5 Development. Leicester: The British Psychological

Society.

Clarkin, J. F., Yeomans, F. E., & Kernberg, O. F. (2006). Psychotherapy of

Borderline Personality Disorder Focusing on Object Relations. Washington,

DC: American Psychiatric Publishing.

Clarkin, J. F., Foelsch, P. A., & Kernberg, O. F. (2001). The Inventory of Personality

Organization. White Plains, NY: The Personality Disorders Institute,

Department of Psychiatry, Weill College of Medicine of Cornell University.

Clarkson, P. (2003). The therapeutic relationship (2nd ed.). Philadelphia: Whurr

Publishers.

90

Corney, R. H., Clare, A. W., & Fry, J. (1982). The development of a self-report

questionnaire to identify social problems—A pilot study. Psychological

Medicine, 12, 903–909.

Critical Appraisal Skills Programme. (2017). CASP Systemic Review Checklist.

[online]. Available at: http://www.casp-uk.net/checklists. Accessed: 21st

September 2017.

Davidson, K. M. (2007). Cognitive Therapy for Personality Disorders: A Guide for

Clinicians. Second Edition. Routledge.

Davidson, K. M., Tyrer, P., Norrie, J., Palmer, S. J., & Tyrer, H. (2010). Cognitive

therapy v. usual treatment for borderline personality disorder: prospective 6-

year follow-up. The British Journal of Psychiatry, 197, 456-462.

De Jong, K., Nugter,M. A., Polak,M. G.,Wagenborg, J. E. A.,

Spinhoven, P., & Heiser, W. J. (2007). The Outcome Questionnaire

(OQ–45) in a Dutch population: A cross-cultural validation.

Clinical Psychology and Psychotherapy, 14, 288–301.

Depue, R. A., & Lenzenweger, M. F. (2005). A neurobehavioral model of

personality disturbance. In M. F. Lenzenweger & J. F. Clarkin (Eds.), Major

theories of personality disorder (2nd ed.) (pp. 391–453). New York, NY:

Guilford Press.

Derogatis LR, & Melisaratos N. (1983). The Brief Symptom Inventory: An

introductory report. Psychological Medicine, 13, 595–605.

91

Derogatis, L.R. & Savitz, K.L. (2000). The SCL-90-R and the Brief Symptom

Inventory (BSI) in Primary Care In: M.E.Maruish, ed. Handbook of

psychological assessment in primary care settings, Volume 236 Mahwah,

NJ: Lawrence Erlbaum Associates, pp 297-334.

Derogatis, L. R., & Spencer, P. M. (1982). BSI administration and

procedures manual I. Baltimore, MD: Clinical Psychometric

Research.

Digre, E. I., Reece, J., Johnson, A. L., & Thomas, R. A. (2009). Treatment response

in subtypes of borderline personality disorder. Personality and Mental

Health, 3(1), 56-67. doi:10.1002/pmh.64

Ellison, W. D., Rosenstein, L., Chelminski, I., Dalrymple, K., & Zimmerman, M.

(2015).The clinical significance of single feature of borderline personality

disorder: anger, affective instability, impulsivity, and chronic emptiness in

psychiatric outpatients. Journal of Personality Disorders, 30(2), 261-270.

Endicott, J., Spitzer, R. L., Fleiss, J. L., & Cohen, J. (1976). The Global Assessment

Scale: a procedure for measuring overall severity of psychiatric disturbance.

Arch Gen Psychiatry, 33, 766–771.

Eurelings-Bontekoe, E. H. M., Luyten, P., Remijsen, M., & Koelen, J. A. (2010).

Association between personality organization as assessed by theory driven

profiles of Dutch Short Form of MMPI and self-reported structural

personality pathology. Journal of Personality Assessment, 92, 599–609.

92

Eurelings-Bontekoe, E. Peen, J., Noteboom, A., Alkema, M., & Dekker, J. (2012).

Differential treatment response of subtypes of patients with borderline

personality organization, as assessed with theory-driven profiles of the dutch

short form of the MMPI: A naturalistic follow-up study. Journal of

Personality Assessment, 94(4), 380-392. doi:10.1080/00223891.2012.674995

First, M. D., Spitzer, R. L., Gibbon, M., et al. (1996). Structured Clinical Interview

for DSM-IV Axis II Personality Disorders (SCID-II). 2. New York:

Biometrics Research Department, New York State Psychiatric Institute.

Giesen-Bloo, J. H., Wachters, L. M., Schouten, E., & Arntz, A. (2010). The

borderline personality disorder severity index-IV: psychometric evaluation

and dimensional structure. Personality and Individual Differences, 49, 136-

141.

Giesen-Bloo, J., Arntz, A., & Schouten, E. (2006). The borderline personality

checklist: Psychometric evaluation and factorial structure in clinical and

nonclinical samples. In J. Giesen-Bloo (Ed.), Crossing borders: Theory,

assessment and treatment in borderline personality disorder. (pp. 85-101).

Maastricht: Universitaire Pers.

Goldman, H. H., Skodol, A. E., & Lave, T. R. (1992). Revising Axis V for DSM-

IV:areviewof measures of social functioning. American Journal of

Psychiatry, 149, 1148–1156.

Goodman, M., Patil, U., Steffel, L., Avedon, J., Sasso, S., Triebwasser, J., & Stanley,

B. (2010). Treatment utilization by gender in patients with borderline

personality disorder. Journal of Psychiatric Practice, 16(3), 155-63.

93

Grant, B. F., Chou, S. P., Goldstein, R. B., Huang, B., Stinson, F. S., Saha, T. D., et

al. (2008). Prevalence, correlates, disability, and comorbidity of DSM-IV

borderline personality disorder: results from the wave 2 national

epidemiologic survey on alcohol and related conditions. Journal of Clinical

Psychiatry, 69(4), 533-545.

Gratz, K. L. (2001). Measurement of deliberate self-harm: Preliminary data on the

Deliberate Self-Harm Inventory. Journal of Psychopathology and Behavioral

Assessment. 23, 253–263.

Gratz, K. L., Dixon-Gordon, K., & Tull, M. T. (2014). Predictors of treatment

response to an adjunctive emotion regulation group therapy for deliberate self-

harm among women with borderline personality disorder. Personality

Disorders: Theory, Research, and Treatment, 5(1), 97-107.

doi:10.1037/per0000062

Gratz, K. L., & Roemer, L. (2004). Multidimensional assessment of emotion

regulation and dysregulation: Development, factor structure, and

initial validation of the difficulties in emotion regulation scale. Journal of

Psychopathology & Behavioral Assessment, 26, 41–54.

Gunderson, J. G. (2009). Borderline personality disorder: ontogeny of a diagnosis.

American Journal of Psychiatry, 166(5), 530-539.

Guy, W. (1976). Clinical Global Impression (C.G.I.). ECDEU Assessment Manual

for Psychopharmacology. National Institute of Mental Health, US Dept

Health, Education, and Welfare Publication (ADM): Rockville (MD), pp.76–

338.

94

Hathaway, S. R., & McKinley, J. C. (1943). The Minnesota

Multiphasic Personality Inventory. Minneapolis: University of

Minnesota Press.

Hamilton, M. (1959). The assessment of anxiety states by rating. British Journal of

Medical Psychology , 32, 50.

Hamilton, M. (1960). A rating scale for depression. Journal of Neurology,

Neurosurgery & Psychiatry, 23, 56–62.

Hayes, S. C., Strosahl, K., Wilson, K. G., Bissett, R. T., Pistorello, J., Toarmino, D.,

& McCurry, S. M. (2004). Measuring experiential avoidance: A preliminary

test of a working model. The Psychological Record, 54, 553–578.

Horowitz, L. M., Rosenberg, S. E., Baer, B. A., & Ureno, G. (1988). Inventory of

interpersonal problems: psychometric properties and clinical applications.

Journal of Consulting and Clinical Psychology, 56, 885–892.

Hull, J. W., Clarkin, J. F., & Kakuma, T. (1993). Treatment response of borderline

inpatients - a growth curve analysis. Journal of Nervous and Mental Disease,

181(8), 503-509. doi:10.1097/00005053-199308000-00005

Hyler, S.E. (1994). Personality diagnostic questionnaire. New York: New York State

Psychiatric Institute.

Jacobson, N. S., & Truax, P. (1991). Clinical significance: a statistical approach to

defining meaningful change in psychotherapy research. Journal of Consulting

and Clinical Psychology, 59(1), 12-19.

95

Kellogg, S.H., & Young, J. E. (2006). Schema therapy for borderline personality

disorder. Journal of Clinical Psychology, 62, 445-458.

Kernberg, O. F. (1984). Severe personality disorders. New Haven, CT: Yale.

Kinderman, P., & Bentall, R. P. (1996). A new measure of causal locus: The internal,

personal and situational attributions questionnaire. Personality and Individual

Differences, 20, 261-264.

Kleindienst, N., Limberger, M. F., Ebner-Priemer, U. W., Keibel-Mauchnik, J., Dyer,

A., Berger, M., et al. (2011). Dissociation predicts poor response to dialectial

behavioral therapy in female patients with borderline personality disorder.

Journal of Personality Disorders, 25(4), 432-447.

Klonsky, D. (2008). What is emptiness? Clarifying the 7th criterion for borderline

personality disorder. Journal of Personality Disorders, 22(4), 418-426.

Kopala-Sibley, D. C., Zuroff, D. C., Russell, J. J., Moskowitz, D. S., & Paris, J.

(2012). Understanding heterogeneity in borderline personality disorder:

differences in affective reactivity explained by the traits of dependency and

self-criticism. Journal of Abnormal Psychology, 121(3), 680-691.

Lana, F., & Fernandez-San Martin, M. I. (2013). To what extent are specific

psychotherapies for borderline personality efficacious? A systematic review

of published randomised controlled trials. Actas Espanolas de Psiquiatria,

41(4), 242- 252.

Lenzenweger, M. F., Clarkin, J. F., Levy, K. N., Yeomans, F. E., & Kernberg, O. F.

(2012). Predicting domains and rates of change in borderline personality

96

disorder. Personality Disorders: Theory, Research, and Treatment, 3(2), 185-

195. doi:10.1037/a0025872

Linehan, M. M. (1993). Cognitive behavioral treatment of borderline personality

disorder. New York: Guilford Press.

Linehan, M.M., & Comtois, K.A. (1994). Lifetime Parasuicide Count. Seattle, WA:

University of Washington. (Unpublished manuscript).

Linehan, M. M. & Heard, H. L. (1987). Treatment history interview (THI).

University of Washington; Seattle, WA: Unpublished measure.

Linehan, M. M., Heard, H. L., & Wagner, A. (1989). Parasuicide history interview:

Complete assessment of parasuicide behavior. Seattle, WA: University of

Washington.

Lovibond, S.H. & Lovibond, P.F. (1995). Manual for the Depression, Anxiety and

Stress Scales (2nd ed). Sydney: University of New South Wales.

Majani, G. & Callegari, S. (1998). SAT-P Satisfaction Profile. Soddisfazione

Soggettiva e Qualità Della Vita. Erickson, Trento (IT).

Meares, R., Stevenson, J., & Comerford, A. (1999). Psychotherapy with borderline

patients: A comparison between treated and untreated cohorts. The Australian

and New Zealand Journal of Psychiatry, 33(4), 467-472.

Millon, T. (1994). Manual for the MCMI-III. Mineapolis: National Computer

Systems.

97

Moher, D., Liberati, A., Tetzlaff, J., & Altman. D. G. (2009). Preferred reporting

items for systematic reviews and meta-analyses: The PRISMA statement.

PLoS Medicine, 6(7): e1000097. doi: 10.1371/journal.pmed1000097

Nesci, J. B. B. (2009). Identification and clinical utility of subgroups of borderline

personality disorder. (Unpublished doctoral thesis). RMIT University,

Melbourne, Australia.

Newhill, C. E., Eack, S. M., & Mulvey, E. P. (2009). Violent behaviour in borderline

personality. Journal of Personality Disorders, 23(6), 541-554.

National Institute for Health and Care Excellence. (2012). Methods for the

development of NICE public health guidance (third edition). London:

National Institute for Health and Care Excellence (NICE).

Norcross, J. C., & Wampold, W. E. (2011). What works for whom: tailoring

psychotherapy to the person. Journal of Clinical Psychology, 67, 127-132.

Oldman, J. M. (2006). Borderline personality disorder and suicidality. The American

Journal of Psychiatry, 163(1), 20-26.

Rosenberg , M. (1965). Society and the Adolescent Self-image. Princeton, NJ:

University Press.

Ryle, A., & Golynkina, K. (2000). Effectiveness of time-limited cognitive analytic

therapy of borderline personality disorder: Factors associated with outcome.

British Journal of Medical Psychology, 73, 197-210.

doi:10.1348/000711200160426

98

Sanislow, C. A., Grilo, C. M., Morey, L. C., Bender, D. S., Skodol, A. E.,

Gunderson, J. G... McGlashan, T. H. (2002). Confirmatory Factor Analysis of

DSM-IV Criteria for Borderline Personality Disorder: Findings from the

Collaborative Longitudinal Personality Disorders Study. The American

Journal of Psychiatry, 159(2), 284-290.

Sansone, R. A, Wiederman, M. W., & Sansone, L. A. (1998). The Self-Harm

Inventory (SHI): Development of a scale for identifying self-destructive

behaviors and borderline personality disorder. Journal of Clinical

Psychology. 54, 973–983.

Spielberger, CD. (1999). State-Trait Anger Expression Inventory. Odessa, FL:

Psychological Assessment Resources.

Spielberger, C. D., Gorsuch, R. L., & Lushene, R. E. (1970). STAI Manual. Palo

Alto, California: Consulting Psychologists Press.

Sparapani, E. M. (2015). Moderators of treatment outcome in dialectical behavior

therapy: The role of emotion regulation and impulsivity. ProQuest Information

& Learning). Dissertation Abstracts International: Section B: The Sciences and

Engineering, 76 (1-) Retrieved from http://search.ebscohost.com/login.aspx?

direct=true&AuthType=ip,shib&db=psyh&AN=2015-99140-405&site=ehost-

live&custid=s4121186

Spinhoven, P., Giesen-Bloo, J., van Dyck, R., & Arntz, A. (2008). Can assessors and

therapists predict the outcome of long-term psychotherapy in borderline

personality disorder? Journal of Clinical Psychology, 64(6), 667-686.

doi:10.1002/jclp.20466

99

Stoffers, J. M., Voellm, B. A., Ruecker, G., Timmer, A., Huband, N., & Lieb, K.

(2012). Psychological therapies for people with borderline personality

disorder. Cochrane Database of Systematic Reviews, (8),

CD005652.doi:10.1002/14651858.CD005652.pub2

Stone, M. H. (2006). Management of borderline personality disorder: a review of

psychotherapeutic approaches. World Psychiatry, 5(1), 15-20.

Specialist Unit for Review Evidence (SURE: 2013). Retrieved from

http://www.cardiff.ac.uk/specialist-unit-for-review-evidence

Tellegen, A. (1982). Multidimensional Personality Questionnaire

manual. Minneapolis, MN: University of Minnesota Press.

Trompenaars, F. J., Masthoff, E. D., Van Heck, G. L., Hodiamont, P. P., & De Vries,

J. (2005). Content validity, construct validity, and reliability of the

WHOQOL-Bref in a population of Dutch adult psychiatric outpatients. Qual

Life Res, 14, 151–160.

Tyrer, P. (1999). Borderline personality disorder: a motley diagnosis in need of

reform. The Lancet, 354, 2095-2096.

Tyrer, P., Alexander, J., & Ferguson, B. (1987). In P. Tyrer (Ed.), Personality

disorders: diagnosis, management and course (pp. 43–62). London: Wright.

Verheul, R., van, d. B., Koeter, M. W. J., de Ridder, Maria A. J., Stijnen, T., & van,

d. B. (2003). Dialectical behaviour therapy for women with borderline

personality disorder: 12-month, randomised clinical trial in the netherlands. The

British Journal of Psychiatry, 182(2), 135-140. doi:10.1192/bjp.182.2.135

100

Vitaliano, P. P., Russo, J., Carr, J. E., Maiuro, R. D., & Becker, J. (1985). The ways

of coping checklist: Revision and psychometric properties. Multivariate

Behavioral Research, 20, 3-26.

Watson, D. & Clark, L. A. (1994). The PANAS- Manual for the positive and

negative affect schedule-expanded form. Iowa City: The University of Iowa.

Weissman, M. M., & Bothwell, S. (1976). Assessment of social adjustment by

patient self-report. Arch Gen Psychiatry, 33, 1111–5.

Widiger, T. A., & Weissman, M. M. (1991). Epidemiology of borderline personality

disorder. Hospital and Community Psychiatry, 42(10), 1015-1021.

Williams, K. E., Chambless, D. L., & Ahrens, A. (1997). Are emotions frightening?

An extension of the fear of fear construct. Behavior Research and Therapy,

35, 239-248.

World Health Organisation. (1992). The ICD-10 classification of mental and

behavioural disorders: clinical descriptions and diagnostic guidelines.

Geneva, World Health Organization.

Wright, A. G. C., Hallquist, M. N., Beeney, J. E., & Pilkonis, P. A. (2013).

Borderline personality pathology and the stability of interpersonal problems.

Journal of Abnormal Psychology, 122(4), 1-16.

Yen, S., Johnson, J., Costello, E., & Simpson, E. B. (2009). A 5-day dialectical

behavior therapy partial hospital program for women with borderline personality

disorder: Predictors of outcome from a 3-month follow-up study. Journal of

Psychiatric Practice, 15(3), 173-182. doi:10.1097/01.pra.0000351877.45260.70

101

Young, J.E. (1998). Young Schema Questionnaire (short form). New York:

Cognitive Therapy Centre.

Zanarini, M. C. (2003). Zanarini rating scale for borderline personality disorder

(ZAN-BPD): A continuous measure of DSM-IV borderline psychopathology.

Journal of Personality Disorder, 17(3), 233-242.

Zanarini, M. C., Frankenburg, F. R., Sickel, A. E., & Young, L. (1996). Diagnostic

interview for DSM-IV personality disorders. Boston, MA: McLean Hospital.

Zimmerman, M., & Mattia, J. I. (1999). Axis I diagnostic comorbidity and borderline

personality disorder. Comprehensive Psychiatry, 40(4), 245-252.

doi:http://dx.doi.org/10.1016/S0010-440X(99)90123-2

Zittel Conklin, C., Bradley, R., & Western, D. (2006). Affect regulation in

Borderline Personality Disorder. The Journal Nervous and Mental Disease,

194(2), 69-77.

102

Appendix A. Guidelines for authors from Clinical Psychology Review

Removed for ethesis

103

Research Part 2

Predictors of response to Systems Training for Emotional Predictability and Problem Solving (STEPPS) for the treatment of Borderline

Personality Disorder

Word Count: 9,994

Abstract

104

Background: Systems Training for Emotional Predictability and Problem Solving

(STEPPS) has proven useful in the treatment of Borderline Personality Disorder

(BPD). Researchers have begun to examine how individual differences relate to

treatment outcomes in this client group. Severity has been the most consistent

predictor of treatment outcome identified so far. However, BPD is a heterogeneous

diagnosis incorporating a large variety of presentations and symptom profiles. This

research examined the predictive influence of severity as a unidimensional construct

before breaking this variable down by BPD symptom and examining the predictive

influence of each criterion on STEPPS outcomes.

Method: Quantitative, naturalistic data was collected from STEPPS groups across the

South East of England. Baseline measures of overall and symptom level severity

were examined and weekly measures of BPD severity were used to measure

treatment response.

Results: Missing data from drop-out was high. Two approaches to managing missing

data were explored and analyses were run using both; similarities and differences are

discussed. Participants made significant improvements following treatment.

Regression analyses demonstrated an overall effect of severity on outcomes but only

when a deletion approach to missing data was used. Dissociation was the most

reliable symptom level predictor of outcome.

Conclusion: Results are discussed in the context of techniques for management of

missing data. Significant predictors, severity, dissociation and identity disturbance

are discussed in relation to the existing literature. Limitations are discussed along

with recommendations for future research.

Introduction

105

Borderline Personality Disorder (BPD) is a diagnosis which encompasses a wide

range of presentations differing in severity, symptom combination and symptom

severities (American Psychiatric Association [APA], 2001). An overview of these

symptoms can be found in table 1. Individuals with BPD do not only vary in their

symptom presentation but also in their response to treatment (Lana & Fernandez-San

Martin, 2013). Identifying factors which predict treatment response would be

beneficial to our understanding of treatment suitability and help us individualise

treatments. One way of examining this is to investigate whether any of the individual

characteristics of the people entering treatment can be used to predict the variance in

treatment outcomes. Researchers have begun to examine BPD-related predictors of

outcome. This study aims to contribute to this body of research and extend the

existing findings about predictors of outcomes for Systems Training for Emotional

Predictability and Problem Solving (STEPPS; Harvey, Black, & Blum, 2010).

Although BPD is relatively uncommon in the general population (1.16%, Stone,

2006) there are incongruently higher numbers of individuals with BPD in mental

health services, with a prevalence of 15% within inpatients (Widiger & Weissman,

1991), and suicide rates significantly higher than the general population (Pompili,

Girardi, Ruberto, & Tatarelli, 2005). BPD is often synonymous with high risk to self

and has historically been considered difficult to treat (Stone, 2006). This can result in

high levels of emotional distress due to repeated crises (National Institute for Health

and Care Excellence [NICE], 2009). Treatment of BPD incurs a significant cost to

the NHS and prevalence of BPD is associated with additional costs to social services

and wider society (NICE, 2009).

Table 1

106

Symptoms of BPD and definitions provided by the Diagnostic and Statistical Manual- Version four-text revised (2000).

Symptom Definition

Avoids abandonment Frantic efforts to avoid real or imagined abandonment

Unstable relationships A pattern of unstable and intense interpersonal relationships characterized by alternating between extremes of idealization and devaluation

Identity disturbance Markedly and persistently unstable self image or sense of self

Impulsivity Impulsivity in at least two areas that are potentially self damaging

Suicidal or Self-Mutilative Behaviours

Recurrent suicidal behaviour, gestures, or threats, or self mutilating behaviour

Affective Instability Affective instability due to a marked reactivity of mood

Inappropriate Anger Inappropriate, intense anger or difficulty controlling anger

Emptiness Chronic feelings of emptiness

Dissociation/Stress-Related Paranoid Ideation

Transient, stress related paranoid ideation or severe dissociative symptoms

A large systematic review of psychotherapeutic treatment of BPD was conducted by

the Cochrane centre (Stoffers et al., 2012) which concluded that there was evidence

for the beneficial effects of psychological treatment for BPD, particularly for

Dialectic Behavioural Therapy (DBT; Linehan, 1993), but a definite need for further

research into these treatments. Other treatments with a reviewed evidence base

included Mentalisation Based Therapy (MBT; Bateman & Fonagy, 2004), Schema

Focused Therapy (SFT; Kellogg & Young, 2006) and Systems Training for

Emotional Predictability and Problem Solving (STEPPS; Harvey et al., 2010) which

is a skills based group program designed to help with emotion regulation. According

to the self-determination theory (SDT; Ryan, 2005), skills based groups, such as

107

STEPPS, provide alleviation from distress by directly teaching people with BPD the

skills they failed to acquire during normal development due to environmental

deficits. These skills are necessary to manage stress or get our basic psychological

needs met. STEPPS has produced good outcomes using a relatively short-term

treatment program (Bos, Bas van Wel, Appelo & Verbraak, 2011). Considering the

ever-increasing demands on NHS finances and resources, the increased interest in

short-term treatments for BPD such as STEPPS is understandable. These treatments

are not only more cost effective but may have an advantage over longer term

treatments in dealing with the high attrition rates observed in individuals with BPD

(Hopwood, 2006).

Despite a number of effective talking therapies for BPD, the degree to which

individuals benefit from treatment still varies significantly. For example, Bohus et al.

(2004) examined a short-term DBT program and found that while 50% of their

participants made clinically significant improvements, the other 50% did not. A more

recent study predicted that as many as 40% of people diagnosed with BPD who

request treatment may not benefit from it (Lana & Fernandez-San Martin, 2013).

Bos et al. (2011) found that 65.4% of participants showed reliable change following

STEPPS treatment and 21.2% also made clinically significant change. While this is a

respectable level of improvement, there remained 34.6% of participants who did not

make reliable improvements, demonstrating that some people with BPD are more

suited to STEPPS than others. Understanding what underlies this suitability and, in

turn, who is most suited to STEPPS would be practically, financially and ethically

advantageous.

108

As a diagnostic category, BPD incorporates a wide array of individual differences

which might lead us to expect variations in the degree to which people respond to

treatment. There have been a number of non-BPD specific client factors which have

been identified as influencing treatment suitability, including coping style (Beutler &

Moos, 2003), resistance potential (Beutler et al., 1991), motivation and intelligence

(Luborsky, Auerbach, Chandler, Cohen, & Bachrach, 1971) and attachment (Levy,

Meehan, Temes, & Yeomans, 2011). Alongside the typical range of demographic

and severity variables found within psychiatric diagnoses, individuals with BPD also

present with an assortment of co-morbid diagnoses which are the rule rather than the

exception in this population (Zimmerman & Mattia, 1999).

Within the available literature on predictors of treatment outcome for BPD, few

consistent predictors have been identified, with demographics and co-morbid

diagnoses failing to predict response (Barnicot et al., 2012). The most consistent

predictor of treatment response identified thus far is severity of BPD (Barnicot et al.,

2012). Research demonstrates that participants with more severe BPD symptoms

prior to treatment show greater improvements (Black, Blum, McCormick, & Allen,

2013; Gratz, Dixon-Gordon, & Tull, 2014; Black et al., 2009; Bos et al., 2011). Gaw

and Beutler (1995) discussed problem severity and its relationship with motivational

distress. These authors suggest that subjective distress can motivate people to seek

and engage in treatment to reduce distress whereas those with lower severity may

feel more ambivalence towards change. This could provide an explanation for why

more severely affected individuals shower greater improvements from treatment.

Further replication of this effect of severity in different settings would be beneficial.

109

Within this overarching severity variable, the BPD diagnosis catalogues a wide range

of presentations which all meet Diagnostic and Statistical Manual-Text Revised, 4th

Edition (DSM-IV-TR, 2000, APA) criteria. Nine BPD criteria are outlined in the

DSM (see table 1). To reach diagnosis a person is required to meet five or more of

these criteria, allowing for over 150 different symptom profiles (Sanislow et al.,

2002). This huge variety of symptom profiles creates huge diversity in the

presentations and experiences of people with BPD.

Due to the significant heterogeneity in the symptoms of people with BPD and the

varied severities of each of these symptoms, it is useful to look at the influence of

each symptom separately in order to understand the contribution of each on the

variability in treatment response. As BPD is such a varied concept in and of itself,

using an overall severity score runs the risk of masking important individual

differences, which may account for differences in treatment response.

Following research examining naturally occurring changes in BPD symptoms over

time, Zanarini et al. (2007) proposed that symptoms fall into two categories: acute

symptoms which emerge during specific and acute periods of illness and have a

faster rate of remission and temperamental symptoms which are more enduring

features of BPD. Zanarini et al. (2007) argue that primarily treating acute symptoms

could be to the detriment of the psychosocial functioning of individuals with BPD

which is closely linked with temperamental symptoms.

Some researchers have examined the predictive power of different BPD symptoms

on treatment outcome. This pool of research is inconsistent in its selection of the

symptoms it examines, methodological design, treatment modality and measures

used. Unsurprisingly, the findings are equally inconsistent (Bohus et al., 2004;

110

Kleindienst et al., 2011; Arntz, Stupar-Rutenfrans, Bloo, van Dyck, & Spinhoven,

2015; Gratz et al., 2014; Black et al 2009; Yen, Johnson, Costello, & Simpson,

2009). In addition, few studies have examined all BPD symptoms within the same

study. One study which did examine the predictive power of each of the DSM-IV-TR

(APA, 2000) criteria was conducted by Yen et al. (2009). The authors identified that

higher emptiness ratings predicted greater improvements following a 5-day DBT

program which they suggest could be due to the intense social component of this 5-

day group treatment, however this finding does require replication. Emptiness is

classified as a temperamental symptom making it interesting that it is predictive of

outcomes in such a short-term treatment. These authors also note that identity

disturbance might be problematic for shorter term therapy, however it is unclear, at

this stage, how short-term a treatment would need to be for identity disturbance to be

problematic, as this 5-day treatment is particularly short for a BPD treatment.

Identity disturbance is considered an acute symptom so can be expected to show

faster rates of improvements. Two studies examining longer-term treatments

(Lenzeweger, Clarkin, Levy, Yeomans & Kernberg, 2012; Bellino, Bozzatello &

Bogetto, 2015) found higher baseline identity disturbance to be predictive of larger

improvements following treatment. Two studies examining DBT found unstable

relationships to be related to greater improvements (Bohus et al., 2004; Yen et al.,

2009) which fit with its positioning as an acute symptom, although one study failed

to replicate this finding (Kleindienst et al., 2011) meaning it would benefit from

further investigation. Impulsivity and deliberate self harm have not reliably predicted

outcomes (Bellino et al., 2015; Arntz et al., 2015; Yen et al., 2009) or have been

related to poorer outcomes (Ryle & Golynkina., 2000; Gratz et al., 2014). Despite

deliberate self-harm (DSH) being a symptom which changes most readily in

111

community samples, in treatment baseline DSH does not predict more generalised

improvements in BPD or general psychopathology. Further research in this area

could enhance our understanding of who is suited to which treatment as it is likely

that individuals with different presentations may require different treatment strategies

(Livesley, 2012). Despite repeated findings that BPD severity predicts STEPPS

outcome (Black et al., 2013; Black et al., 2009; Bos et al., 2011) researchers have not

yet examined the impact of the different BPD symptoms on STEPPS outcomes.

In the current study predictors of outcome were examined using a measure of BPD at

the beginning of treatment and a weekly measure of symptom change to identify

factors predicting improvements in BPD following STEPPS.

The research begins by looking to replicate the finding that severity of BPD, as a

unidimensional construct, is a useful predictor of treatment outcome using STEPPS

treatment within NHS England settings. The BPD diagnosis was then broken down

into its separate DSM-IV-TR defined symptoms to examine the predictive influence

of each of the individual symptoms on treatment response. None of the existing

research on symptom level predictors has examined STEPPS treatment and there are

inconsistencies in the research using other treatment models so there is a degree of

exploration in this area of the research, however hypotheses were developed based

on the findings from other skills based treatments (largely DBT).

Hypotheses:

1. There will be a statistically significant improvement in BPD symptoms, from

the start to the end of treatment.

112

2. Greater overall symptom severity at the start of treatment will predict

significantly greater BPD symptom improvements following STEPPS.

3. Symptoms of emptiness, identity disturbance and unstable relationships

severity will be related to significantly greater improvements. Impulsivity and

suicidal/self harm behaviours will not predict treatment outcome.

Severity has been found to be predictive of STEPPS outcomes, this study makes the

logical next step of deconstructing and dissecting the ‘severity’ variable into its

individual symptoms to predict STEPPS outcomes. It is hoped that this will add to

our understanding of how different individuals with BPD respond to STEPPS. In

turn, it is hoped that this will move us closer towards an understanding of treatment

suitability which could ultimately contribute to reduced costs from unsuccessful or

unsuited treatment and provide clients with the best opportunity for positive

treatment outcomes.

Method

Design

A quantitative longitudinal naturalistic design was used to explore treatment

response using data from National Health Service (NHS) STEPPS treatment groups1.

1This research project is one third of a larger research project on STEPPS treatment. Two other research projects (URN: 6338462 and URN: 6339971) utilised the same participant group of STEPPS group members to answer different research questions. Materials were developed together and disseminated only once. A full list of measures collected across all three projects can be found in appendix B, highlighting the time point when they were collected and a list of measures used by each of the projects can be found in appendix C. Data collection was divided equally between each of the researchers as was data entry. All data analysis and written work was done individually for each separate research project.

113

Quantitative outcome measures were taken over three time points (repeated

measures): T1 (session 2), T2, (session 10) and T3 (session 20). Data was explored

using quantitative statistical analyses.

The dependent variable was change in BPD symptomatology measured using the

Borderline Evaluation of Symptoms over Time (BEST; Blum, Pfohl, John, Monahan

& Black, 2002) total score which was taken weekly over the 20 week program.

Predictor variables were measured using the Zanarini BPD self-report measure

(ZAN-BPD) (Zanarini & Frankenburg, 2008). The following variables were defined:

Severity of BPD, measured using ZAN-BPD total score,

Individual symptom severity using the ZAN-BPD individual item scores.

Participants

85 participants were recruited across three mental health NHS trusts in the south of

England. All members of STEPPS treatment groups over participating sites were

invited to take part in the study between March 2016 and December 2016.

Participants gave written informed consent to partaking in the research, according to

the procedure approved by South East Coast, Surrey Research Ethics Committee,

and were made aware that they could withdraw from the study at any time.

Five participants (5.9%) withdrew from the study and their data was destroyed. Data

from three participants were removed as they were missing all independent variable

data, and a further six were removed as they were missing all outcome variable data,

leaving a final sample of 71 (see CONSORT diagram in figure 1). Initial power

calculations stipulated that for an effect size of 0.8 with a ‘Linear multiple regression:

fixed model, R2 increase’ between 55-77 participants were required. Attrition rates

114

from previous research (Barnicot, Katsakou, Marougka, & Priebe, 2011) and

discussions with group facilitators were used to make estimations around attrition.

Unfortunately a number of the sites within the study experienced a higher than

expected rate of attrition which reduced the number of participants for the study. Of

the final 71, 46 attended up to or beyond session 15. 35.2% dropped out of treatment

before session 15 and were classified as non-completers. Differences between

completers and non-completers are explored in the results. This attrition rate was

higher than the 25% observed in the 2011 systematic review and meta-analysis of

drop-out from BPD psychotherapy (Barnicot et al., 2011) but is close to the attrition

rate of 36% observed in a preliminary study of STEPPS treatment within the U.K

(Harvey et al., 2010).

Figure 1: CONSORT Diagram showing number of participants at different stages of

the study.

115

Participant ages ranged from 18-58 (mean: 31.94, median: 30). 90.1% were female

and 85.9% reported having a formal diagnosis of BPD. Different NHS Trusts used a

number of methods for assessing suitability for treatment and BPD diagnosis

including standardised measures and clinical interviews. 15.5% of participants were

undergoing an additional therapy during this time, alongside a few participants

engaged in family therapy and CBT, most reported attending ‘psychotherapy’ which

made it difficult to be certain of the range of interventions participants were engaged

in. 7% of participants had engaged with a previous STEPPS treatment but only 1.4%

had previously completed STEPPS. These confounding variables were entered into

statistical analyses to examine any potential bias. See table 2 for further details on

demographics.

Table 2: Table of demographics for participant who completed measures at date collection time points

T1 T2 T3

N 71 36 30

Age in years

(SD)

31.94

(10.78)

38.00

(11.15)

35.22

(14.04)

Gender

Female

Male

90.1%

9.9%

83.3%

16.7%

80.0%

20%

The inclusion criteria for this study matched the inclusion criteria for being accepted

into a STEPPS treatment group. These criteria were as follows: being over the age of

18, having a diagnosis of BPD or BPD traits, motivation and ability to work with a

group and to complete homework tasks. Group members were assessed for treatment

suitability using clinical interviews and standardised assessment measures by group

116

facilitators. Participants were counted as “completers” if they attended to at least

session 15 of a possible 20 sessions; Black et al. (2009) suggest 15 sessions of

STEPPS is the threshold for optimal improvements. The STEPPS program allows

participants to miss a maximum of three sessions, however the data suggested that

this is not consistently followed as 16.9% of participants had data missing from three

consecutive weeks but then continued with treatment. In order to maximise

participant numbers and to work in line with what was being observed in clinical

practice, these participants were included in analyses.

Exclusion criteria also match those of the STEPPS group which prevents the

inclusion of participants with severe self-harm, eating difficulties or substance

misuse behaviours as well as significant risk of violence.

Data collection tools

Measure of dependent variable. The BEST (Blum et al., 2002) was used to

measure change in BPD symptoms (appendix D). The BEST measures BPD severity

and is sensitive to clinical change. It was developed for use within the STEPPS

treatment program and is routinely administered during each session. It is a 15 item

self-report measure which has demonstrated moderate test retest reliability (r=.62)

and high internal consistency (Cronbach Alpha 0.86) with individuals with BPD

about to undergo STEPPS treatment (Pfhol et al., 2009). The internal consistency

from this sample is taken from T1 BEST scores. Items 13-15 on the BEST are

positively worded while all other items are negatively worded, to correct for this,

these items were reverse scored prior to calculating Cronbach’s Alpha. The

Cronbach’s Alpha coefficient for this sample was 0.82, demonstrating good internal

consistency. Pfhol et al. (2009) found a baseline mean of 38.7. In this study,

117

participants had a mean T1 BEST score of 45.77, with higher scores indicating more

severe BPD. Each of the 15 items is rated on a scale of 1-5 and an equation is

provided for calculating a BEST total score. In this study, the BEST was completed

by each participant at each session they attended; these were collected during the

course of the program at T1, T2 and T3.

Measure of predictor variables. All predictor variables within this study are

related to BPD symptomatology and, as such, can be measured using different

dimensions of the Zanarini rating scale of Borderline Personality Disorder (ZAN-

BPD) self-report version (Zanarini & Frankenburg, 2008). The ZAN-BPD is a 9 item

questionnaire which was designed to map on to the 9 DSM-IV-TR (APA, 2000)

criteria for BPD. A self-report version of the ZAN-BPD was published in 2008

(Zanarini & Frankenburg, 2008, see appendix E), this measure was selected as it

reduced facilitator burden as it could be given directly to participants to complete.

The ZAN-BPD provides a severity score for each of the BPD criteria on a likert scale

from 0-4; these scores are summed to provide a total severity score. Higher scores

indicate more severe symptoms. This measure was used to assess all the independent

variables in this study, this was designed to avoid participant burden in light of this

study being only one third of a larger STEPPS research project and to minimise the

number of measures participants were asked to complete. This was done to reduce

the impact on treatment engagement and reduce the number of participants dropping

out of the research trial. This appeared to have been effective as only 5.9% of

participants left the research trial. The ZAN-BPD has demonstrated high convergent

validity with the interview version (median .70) and demonstrated good internal

consistency (.84). The test-retest reliability was in the excellent range (>.75)

118

(Zanarini, Weingeroff, Frankenburg, & Fitzmaurice, 2015). Within this study the

ZAN-BPD had a Cronbach’s alpha of 0.64. This is lower than ideal. It is possible

that the high heterogeneity within the BPD diagnosis led to a violation of Cronbach’s

alpha’s assumption of unidimensionality. Some have suggested that BPD can be

better understood as multiple subgroups rather than a unidimensional construct

(Andión et al., 2011) which may have led to the violation of the assumption of

unidimensionality. Kline (1999) states that when dealing with psychological

constructs values below 0.7 can be expected due to diversity in the constructs being

measured.

ZAN-BPD total score was used as a measure of BPD severity as a predictor of

outcome. Separate items of the ZAN-BPD were used to provide a measure of each of

the nine separate symptoms of BPD as outlined by the DSM-IV-TR (APA, 2000).

Predictor variables were measured at T1.

Covariate variables. Additional variables which could influence outcomes

were measured as covariates; these were identified through discussions with STEPPS

facilitators, consultation with graduates of STEPPS treatment and reviewing the

research literature. These variables were measured using a demographics sheet

provided at T1 (appendix F). Information was gathered on additional treatments,

previous engagement in STEPPS or other forms of treatment, BPD diagnosis and

years since receiving diagnosis. Individual groups were coded for, in order to explore

differences caused by variations in group related factors. Although the manualised

nature of STEPPS should reduce some of this variation it was still deemed important

to account for facilitator style, therapeutic relationship and group dynamics which

may vary between sites and individual groups.

119

Additional demographic details were not included as covariates as a previous review

on predictors of treatment outcomes for BPD found demographics such as age and

gender to be unrelated to outcomes (Barnicot et al. 2012).

Description of treatment program

STEPPS is a 20 week group program (see appendix G for a breakdown of STEPPS

content). The treatment is for outpatients with BPD or traits of BPD. The program is

based on CBT principles and is a skills based group which consists of three

components, psychoeducation around BPD, emotional management skills and

behavioural management skills (Black, Blum, Pfohl, & John, 2004).

14 STEPPS groups were recruited from over the south of England. Facilitators of the

program were Clinical Psychologists, Assistant Psychologists, Counselling

Psychologists, Occupational Therapists, Mental Health Nurses, Social Workers and

Therapy Assistants. All group facilitators attended some form of supervision with

65.8% of facilitators attending STEPPS specific supervision. 34.2% did not specify

the type of supervision but reference was made to peer and group supervision,

debrief and reflection sessions. 26.3% of facilitators stated that they had undergone

STEPPS training. These arrangements were in place to ensure adherence to the

treatment model. Collection of this general information about the groups for use as

covariates in the regression analyses was discussed with and agreed by the South

East Coast Research Ethics Committee (see details in appendix H). Information

about supervision and training arrangements and professional composition of

facilitator teams was recorded (see appendix I). No identifiable information was

gathered about individual facilitators, and they were not participants in this research.

120

As such, it was agreed that consent did not need to be gained for collection of this

information.

Procedure

A more detailed breakdown of the procedure, which was provided to group

facilitators, can be found in appendix J:

STEPPS is a manualised program so the group sessions and routine data

collection were expected to be similar across trusts which allowed for

combined data analysis

Researchers attended the introductory session of each group to provide

potential participants with information about the project as well as a written

information sheet (appendix K). Researchers also delivered to each service

T1, T2 and T3 pre-prepared measures at this time in case of a situation where

researchers were unable to attend a data collection point.

Participants were given a week to look over the information sheet and

consider their participation. Researchers attended session 2 to gather

informed consent (appendix L) from those who wished to participate and took

the T1 measures (see appendix B for measures collected at each stage of the

research, including those from the larger research trial). In relevance to this

study, the ZAN-BPD was an additional measure to what is routinely gathered

during STEPPS and was gathered at T1 after consent was given. During the

consent procedure, participants were asked if they would like to be entered

into a prize draw for one of three Amazon vouchers, if they wished to be

entered they were asked to provide a contact email or home address.

121

Participants were assigned a unique research number, known only to the

research team. Each pack of additional measures had a site code and

participant number attached. A key of participant codes was kept on a secure

drive which was password protected for researchers to identify any

participant who wished to withdraw from the study.

Researchers attended sessions at T2 and T3 to collect the additional measures

from group members participating in the study and photocopied the routinely

collected (BEST) measures. All data was anonymised on site so no

identifiable information was taken off-site.

On the six occasions where researchers were unable to attend a data

collection session the group facilitators placed the box with the additional

measures in the group room for participants to complete and return to the

box. The measures were in labelled envelopes for participants to complete.

These were collected by a researcher at the next available date. Lead

facilitators were consulted prior to agreeing to participate in the research and

agreed to make the box of measures available to participants in the case of a

researcher not being able to attend a data collection time point.

Group facilitators made the routinely collected data (BEST measures)

available to researchers; these were photocopied and anonymised on site.

Originals were left at the service.

Measures were stored securely at the University of Surrey. Data from

participants who withdrew their consent during the duration of group was

destroyed.

Participants who requested a copy of the results were sent a summary report

including information from all three studies upon completion of the project.

122

Ethics

The research was sponsored by the University of Surrey Research Integrity and

Governance Office and was approved by the NHS Multi-Site Research Ethics

Committee (appendix M). R&D approval was gained from each of the NHS trusts

partaking in the research (appendix N). Participant numbers were used to ensure

participant confidentiality. Data was held in accordance with the Data Protection Act,

1998 (Gov.uk, 2015).

Analytic Strategy

Weekly BEST measures were taken for each participant over the 20 week program.

A median BEST total score was taken from the first half of the program (week 1-10)

and a median from the second half of the program (week 11-20). A median score was

selected as it is less biased by extreme scores (bad week or stressful session or

natural fluctuation) (Manikandan, 2011). After the data was cleaned and examined

through descriptive statistics, the reliability of measures were established using

Cronbach’s Alpha and comparisons between completers and non-completers were

statistically explored using t-tests.

Hypothesis 1: T-tests were performed to examine change over the course of

treatment along with reliable change and clinically significant change (Jacobson &

Truax, 1991).

To examine predictors of treatment outcome, the first stage was to determine which

of the T1 predictors or covariates showed any relationship to change on the BEST.

This was done by looking at correlations between variables.

123

The dependent variable for regression analyses was to be BEST Median 2. BEST

Median 1 would be entered into the first step of the regression to account for

participants’ scores at the start of treatment. If predictor variables were too highly

correlated with BEST Median 1(moderate to strong correlation), resulting in

multicollinearity, then a BEST Change score (BEST Median 2 subtracted from

BEST Median 1) could be used as an alternative dependent variable.

Hypothesis 2: Hypothesis 2 would be tested by entering ZAN-BPD total score from

T1 into a regression model predicting the variance in BEST outcomes.

Hypothesis 3: Regression models would be produced to examine which T1 variables

were jointly predictive of BEST outcome. Predictors which had demonstrated a

relationship with either BEST Change or BEST Median 2 would be entered into the

models.

Non-significant contributors would be dropped from the model using a criteria of

p>0.2, to identify the simplest model which predicts the largest amount of variance.

A cut-off of 0.2 was chosen to avoid the removal of important variables which are

controlled for within the model (Kirkwood & Sterne, 2003). Statistical analysis was

completed using SPSS (IBM Corp., 2015).

Additional data

Qualitative data was gathered alongside the quantitative data outlined above. The

responses to the qualitative questions were not deemed to contribute towards

answering the research questions. This information was analysed and can be found in

appendix V. Participants’ answers were often very brief and respondents did not

focus on what aspects of themselves made them suitable or unsuitable to STEPPS

124

and often focused on what they enjoyed or did not enjoy instead. The categories

which emerged from the content analysis did not address the research question which

is why it is not included in the main report. Follow up data was also gathered but

due to a low response rate that would have decreased the power of the analyses, this

data was not included in the analyses. A descriptive exploration of this data can be

found in appendix W.

Results

The results are based on the analysis of 71 participants. The data was cleaned

through examination of maximum and minimum values and the range to identify any

values entered in error, these were corrected using the original data files. No data

was excluded at this stage.

Missing Data

One participant was missing three ZAN-BPD subscores (1.4% of the independent

variable). Little’s MCAR test was performed to check the assumption that this

missing data occurred at random, this test was not significant (chi squared= 10.95,

df= 6, p=.10) demonstrating that the missing data was likely to have occurred at

random. This allowed for the use of Expectation Maximisation (Dempster, Laird, &

Rubin, 1977) to replace these missing values2.

Missing values were much higher across the dependent variable. Median 1 from the

first half of the STEPPS program had 0% missing data, Median 2 had 32.4% missing

data. This reflects participant’s gradual drop out of treatment across the 20 week

program.

2 Analyses were repeated with deletion of this participant and results did not significantly differ.

125

An important statistical decision focused on how to manage this high rate of missing

data. One approach was to use pairwise deletion, this retains the precision but

reduces participant numbers and, in turn, the power of analyses. Some authors have

argued that if there is adequate power for testing hypotheses then deletion is

preferable (Cheema, 2014). Expectation Maximisation (EM) provides another

approach to coping with missing data. This method is based on the assumption that

the data is truly missing at random. Little’s MCAR test confirmed that the missing

BEST data did occur at random; however, Gelman and Hill (2007) query whether

missing data can genuinely be missing at random or whether missingness will

depend on unexamined variables. There are strong arguments for estimating missing

data even when the amount of data missing is large and when data is missing due to

attrition (Graham, 2009). It is argued that this increases the power of analyses and

can protect against biasing from selective drop-out. To manage the pros and cons

associated with these different methods, analyses were conducted first using the

deletion approach and then conducted again following EM with a full estimated

dataset. Results from the two approaches are compared and considered within the

discussion3.

Descriptive Statistics

Descriptive statistics were calculated for each of the independent variables measured

via the ZAN-BPD (total score and individual symptom scores) and on the two BEST

Median scores and the BEST Change score.

Table 3. Table of descriptive statistics of BEST scores (dependent variables)

3 Intent to treat analyses were also performed. Using ITT approach no predictors made significant contributions to understanding the variance in Median 2 given participants’ scores at Median 1. This was likely due to the high rate of drop-out and conservative statistical approach.

126

N Mean SD Skewness Kurtosis BEST Median 1 71 44.08 9.86 -.185 .317BEST Median 2 48 38.10 11.18 -.556 -.047BEST Change score 48 4.79 9.06 -.857 2.018BEST Median 1 post EM 71 43.60 9.02 -.344 .279BEST Median 2 post EM 71 39.96 10.41 -.495 .091BEST Change post EM 71 3.64 8.45 -.495 1.048

Table 4. Descriptive statistics of ZAN-BPD scores (Predictor variables)

N Mean SD Skewness Kurtosis

ZAN-BPD total score 71 20.66 5.47 -.192 -.128

ZAN-BPD Anger 71 2.35 1.04 -.370 -.197

ZAN-BPD Affective instability 71 3.01 0.96 -.916 .480

ZAN-BPD Emptiness 71 2.70 1.02 -.368 -.588

ZAN-BPD Identity disturbance 71 2.46 1.19 -.304 -.831

ZAN-BPD DSH 71 1.70 1.44 .273 -1.271

ZAN-BPD Impulsivity 71 2.06 1.32 -.106 -1.017

ZAN-BPD Dissociation 71 2.63 1.11 -.316 -.992

ZAN-BPD Fear of abandonment 71 1.96 1.33 .080 -1.174

ZAN-BPD Relationship instability 71 1.77 1.23 .163 -1.030

Tests of normality. Z scores were calculated from Skewness and Kurtosis

values and compared to a 1.96 criteria to establish whether these values were

significant (at .05 level). See appendix O and P for further details.

Visual inspection of histograms and boxplots (appendix O and P) suggest that most

of the data was normally distributed enough for parametric testing; the one exception

was DSH. Where necessary, a non-parametric equivalent was used when exploring

this variable.

Change over course of treatment

127

To test hypothesis 1 t-tests were used to compare scores at the start and end of

treatment. There was a statistically significant reduction from Median 1 (M: 44.08) to

Median 2 (M: 38.10) in BEST scores, t= 3.66, df= 47, p=.001. This change remained

significant to the same degree when using the full estimated sample (t=3.63, df=70,

p=.001), supporting the hypothesis that there would be a significant improvement in

BPD symptoms over the course of treatment. Reliable change index (RCI) (Jacobson

& Traux, 1991) was calculated using the full sample (N=71) as this is a better

estimate of the population tendency. The RCI was 10.61. The mean change did not

reach a level which would be categorised as a reliable improvement. Only 25% of

those who had a BEST change score (N=48) made a positive change of this degree

and only 18.31% of the full estimated sample. No participants made clinically

significant change (using criterion A: 25.25) (Jacobson & Traux, 1991) in either

sample.

Comparisons of completers and non-completers

Participants who attended STEPPS up to and beyond session 15 were sent follow-up

questionnaires (see appendix W for further information) and, as such, were

pragmatically classified as completers. 46 participants attended to at least session 15,

compared to 25 who dropped out before session 15. Comparisons of these groups

were examined to explore any significant between group differences. An alternative

cut-off point for completers (session 18), based on STEPPS criteria of a maximum of

3 sessions that can be missed, was also calculated and can be found in appendix Q.

Cut-off at session 15 was used as it more closely matched the procedure for

administering follow-up questionnaires.

128

Completers were compared to non-completers on each of the T1 measures using t-

tests (except DSH which was compared using Mann Whitney U test). The only

variable which the groups significantly differed on was emptiness. There was a

significant difference in the scores of completers (M= 2.52, SD= 1.049) and non-

completers (M=3.04, SD=0.89; t= 2.09, df= 69, p=.04) with non-completers having

significantly higher emptiness scores at T1. The magnitude of this difference (mean

difference= .518, 95% CI: .025 to 1.012) was moderate (eta squared=.06) according

to Cohen’s 1988 criteria.

Correlations between predictor and dependent variables

Relationships between independent variables and BEST dependent variables (BEST

Median 2 and BEST Change score) were explored through correlation statistics and

are summarised in table 5 and 6 (further information and correlation with covariates

can be found in appendix R). Spearman’s rho is reported. The variables identified for

entering into the regression analyses did not differ from when Pearson’s correlations

were used for continuous variables so Spearman’s rho is reported throughout for

consistency and clarity. Separate calculations were completed for the pairwise

deletion sample and the full estimated sample. These statistics were run to identify

the variables which show a relationship with the dependent BEST variables and

filters down which would be entered into the subsequent regression models.

Using the pairwise deletion sample the only confounding variable with a significant

correlation with BEST Change was psychiatric inpatient service usage (p=.008).

None of the confounding variables had a significant relationship with BEST Median

2 (see appendix R for covariate correlations).

129

ZAN-BPD total score had a significant relationship with BEST Median 2 (p=.037).

Dissociation significantly correlated with BEST Median 2 (p=.003). Identity

disturbance significantly correlated with BEST Change score (p=.016) as did

Relationship instability (p=.048).

Using the full estimated sample, again, the only confounding variable with a

significant correlation was psychiatric inpatient service usage with BEST Change

(p=.009) (appendix R). Looking at the predictor variables ZAN-BPD total score had

a significant relationship with BEST Median 2 (p=.017). Emptiness significantly

correlated with BEST Median 2 (p=.024) as did Dissociation (p=.001) and

Relationship instability significantly correlated with BEST Change score (p=.039).

130

Table 5: Inter-correlation matrix for severity and symptom variables with BEST dependent variables using pairwise deletion sample N=48

BESTMed1N=48

BESTMed2N=48

BESTChangN=48

ZanTotal

N=48

ZanAngerN=48

ZanAffectN=48

ZanEmptiN=48

ZanIdentiN=48

ZanDSHN=48

ZanImpulsN=48

ZanDissocN=48

BESTMed1 N=48

1.000 .557** .217 .735** .315* .409** .213 .343* .434** .522** .440**

BESTMed2 EMN=48

1.000 -.618** .302* .148 .059 .130 -.051 .127 .219 .423**

BESTChang EMN=48

1.000 .228 .110 .176 -.068 .346* .259 .142 -.166

ZanTotalN=48 1.000 .504** .530** .267 .506** .514** .572** .648**

ZanAngerN=48 1.000 .335** -.051 -.095 -.004 .321* .222

ZanAffectN=48 1.000 .212 -.014 .138 .365* .209

ZanEmptiN=48 1.000 .409** -.158 .001 .335*

ZanIdentiN=48 1.000 .442** .063 .273

ZanDSHN=48 1.000 .313* .136

ZanImpuls N=48 1.000 .309*

ZanDissocN=48 1.000

ZanAband N=48ZanRelatiN=48Correlations using the pairwise deletion sample (N=48). Spearman’s rho is reported. The variables identified for entering into the regression analyses did not differ when Pearson’s correlations were used for continuous variables so Spearman’s rho is reported throughout the table. Correlation in bold were examined to establish severity and symptom level predictors with a relationship with either BEST Median 2 or BEST Change scores for entering into regression models

Key:

BESTMed2: BEST Median 2 score from second half of treatment (pairwise deletion sample)BESTChang: BEST Change score (pairwise deletion sample)BESTMed2 EM: BEST Median 2 score from second half of treatment (Full estimated sample)BESTChang EM: BEST Change score (Full estimated sample)

* Correlation is significant at the 0.05 level** Correlation is significant at the 0.01 level

ZanTotal: ZAN-BPD total score from T1ZanAnger: ZAN-BPD Anger symptom score from T1ZanAffect: ZAN-BPD Affect symptom score from T1ZanEmpti: ZAN-BPD Emptiness symptom score from T1ZanIdentu: ZAN-BPD Identity disturbance symptom score from T1ZanDSH: ZAN-BPD Deliberate self harm symptom score from T1ZanImpuls: ZAN-BPD Impulsivity symptom score from T1ZanDissoc: ZAN-BPD Dissociation symptom score from T1ZanAband: ZAN-BPD Fear of abandonment symptom score from T1ZanRelati: ZAN-BPD Relationship instability symptom score from T1

131

Table 6: Inter-correlation matrix for severity and symptom variables with BEST dependent variables, using the full estimate sample N=71

BESTMed1N=71

BESTMed2N=71

BESTChangN=71

ZanTotal

N=71

ZanAngerN=71

ZanAffectN=71

ZanEmptiN=71

ZanIdentiN=71

ZanDSHN=71

ZanImpulsN=71

ZanDissocN=71

ZanAbandN=71

ZanRelatiN=71

BESTMed1 N=71

1.000

.557** .217 .601

**.276*

.302*

.301*

.378**

.372**

.376**

.367**

.198

.309**

BESTMed2 EMN=71

1.000

-.572**

.282*

.124 .112

.268*

.181

.114 .122 .38

3**.058 .022

BESTChang EMN=71

1.000 .201 .17

6 .024 -.062

.159

.223 .200 -.12

3.039

.246*

ZanTotalN=71

1.000

.464**

.475**

.299*

.417**

.591**

.575**

.559**

.459**

.647**

ZanAngerN=71

1.000

.313**

-.046

-.118

.046

.258*

.179

.091

.517**

ZanAffectN=71

1.000

.130

-.082

.226 .210 .16

3.131

.411**

ZanEmptiN=71

1.000

.420**

-.019 .003 .36

5**-.16

3-.01

0

ZanIdentiN=71

1.000

.234*

.007 .329**

.081 .055

ZanDSHN=71

1.000

.453**

.105

.268* .230

ZanImpuls N=71

1.000

.149

.093

.243*

ZanDissocN=71

1.000

.182

.283*

ZanAband N=71

1.000

.371**

ZanRelatiN=71

1.000

132

Regression Models

Variables which correlated with either BEST Change or BEST Median 2 were

entered into the regression models reported in tables 7-10. The following sections

outline these regression models. Preliminary analyses were conducted to ensure no

violation of the assumptions of normality, linearity, multicollinearity and

homoscedasticity (see appendix S for details and line and scatterplots for each of the

regressions). Only one participant had an inpatient psychiatric admission at T1 which

was statistically identified as a significant outlier so this case was removed from all

regression analyses.

Severity models. To test hypothesis 2 a simple regression model was used.

BEST Change score was used as the dependent variable as ZAN-BPD total score and

BEST Median 1 were highly correlated (r2=.601, n=71, p<.001) meaning entering

both of these variables into the same regression model was not meaningful as it

violated assumptions of multicollinearity. ZAN-BPD total score was entered into a

simple linear regression to predict BEST Change. In this model (reported in table 7)

T1 severity explained 10.8% the variance observed in BEST Change score, F(1, 45)=

5.47, p=.024. Severity, measured via the ZAN-BPD total score, had a positive,

medium sized coefficient with BEST Change score, r=.329 n=48, p= .024. Severity

significantly contributed to a model predicting improvements in BPD symptoms

following STEPPS, supporting hypothesis 2 using the pairwise deletion sample.

133

Table 7: Results of a Regression model predicting BEST Change scres using ZAN-BPD total score with the pairwise deletion sampleLinear Model: Severity Predictor Sample: Pairwise Deletion SampleVariable R 2 B (SE) Standardised

coefficient Beta

t Significance p value

ZAN-BPD Total score

.108 .511 (.219)

.329 2.338 .024

R² .108 F 5.465Adjusted R² .088 p (F-statistic) .024Standard Error of Estimates

8.06

When the estimated full sample was used, the model using ZAN-BPD total score

(reported in table 8) no longer reached significant in predicting the variance in BEST

Change score, F(1, 45)= 3.08, p=.084). In this sample ZAN-BPD total score and

BEST Change had a small positive correlation that was not significant, r=.208, n=71,

p=.084. When the full estimated sample was used hypothesis 2 was no longer

supported.

Table 8: Results of a regression model predicting BEST Change score using ZAN-BPD total score with the estimated full sampleLinear Model: Severity Predictor Sample: Estimated Full SampleVariable R 2 B (SE) Standardised

coefficient Beta

t Significance p value

ZAN-BPD Total score

.043 .311 (.177)

.208 1.756 .084

R² .043 F 3.08Adjusted R² .029 p (F-statistic) .084Standard Error of Estimates

7.99

Scatterplots of the significant severity model found with the pairwise deletion sample

and the non-significant severity model using with full estimated sample can be seen

in figure 2 with data points more concentrated in the full estimated sample

demonstrating a stronger relationship between the variables with this sample.

134

Symptom models. To test hypothesis 3 symptom variables which had

demonstrated a relationship with either BEST Change score or BEST Median 2 in

the correlation analyses were entered into a regression model predicting BEST

Median 2. An alternative approach involving adding all 9 symptom predictors is

explored in appendix T.

BEST Median 1 was entered at Step 1, explaining 45.7% of the variance in BEST

Median 2. Unstable relationships, dissociation and identity disturbance were then

entered in Step 2. Unstable relationships did not significantly contribute to the model

(p=.229); however, removal of this item resulted in a slightly lower R² value so

unstable relationships remained in the final model (reported in table 9) but did not

make a unique contribution. The final model predicted 59.2% (F(4, 42)=15.23,

p<.001) of the variance in Median 2 scores. Dissociation, identity disturbance and

unstable relationships explained an additional 13.5% of the variance in BEST

Median 2 after controlling for BEST Median 1. In the final model, both identity

. Scatter plots of the severity model using the pairwise deletion sample (left) and the

135

disturbance and dissociation made statistically significant contributions, with the

identity disturbance recording a higher beta value (beta=-.320, p=.005) than

dissociation (beta=.302, p=.010). Eigenvalues were examined and each predictor had

most of its variance loading onto different dimensions which demonstrated no

multicollinearity (see appendix U). Hypothesis 3 predicted that identity disturbance,

unstable relationships and emptiness would predict outcomes. Of these three, only

identity disturbance significantly contributed to this model so the results were

inconsistent with hypothesis 3. Higher identity disturbance led to lower scores

following treatment when taking into account scores at T1, whereas dissociation was

associated with higher scores following treatment when taking into account T1

scores.

Table 9: Results of a regression model predicting BEST Median 2 using BEST Median 1 and symptom predictors with the pairwise deletion sample

Hierarchical Model: Symptom Predictors

Sample: Pairwise Deletion Sample

Variable R 2 B (SE) Standardised coefficient Beta

t Significance p value

Step 1 of regression

BEST Median 1

.457 .822 (.134)

.710 6.111

<.001

Identity Disturbance

-.3.025

(1.021)

-.320 -2.96

4

.005

Dissociation 3.037 (1.121

)

.302 2.710

.010

Step 2 of regression

Unstable Relationships

.592-1.177 (.965)

-.130 -1.21

9

.229

Full model statistics

R²Adjusted R²Standard Error of Estimates

.592

.5537.511

Fp (F-statistic)Step 2 R² Change

15.23<.001.135

136

Using the estimated full sample, variables which had correlated with either BEST

Change score or BEST Median 2 were entered into a regression model (unstable

relationships, emptiness and dissociation) along with BEST Median 1. Non-

significant contributors, with p>0.2, were removed, one at a time, starting with the

variable with the highest p value. Removing emptiness the variance explained by the

model remained at 49.7%. Removing unstable relationships (second highest p value)

reduced the variance explained to 46.7% so the final model (reported in table 10)

included BEST Median 1, unstable relationships and dissociation F(3, 66)=21.70,

p<.001. Both BEST Median score 1 (beta=.634, p<.001) and dissociation (beta=.223,

p=.024) were significant contributors to this model, although unstable relationships

was approaching significant (p=.052). In terms of hypothesis 3, neither emptiness or

identity disturbance were found to predict outcomes using this sample. Unstable

relationships also did not make a significant contribution to the model predicting

outcomes, going against the hypothesis.

Table 10: Results of a regression model predicting BEST Median 2 using BEST Median 1 and symptom predictors with the full estimated sample

Hierarchical Model: Symptom Predictors

Sample: Estimated Full Sample

Variable R 2 B (SE)

Standardised coefficient

Beta

t Significance p value

Step 1 of regression

BEST Median 1

.436 .745 (.116)

.634 6.422 <.001

Dissociation 2.092 (.905)

.223 2.311 .024

Step 2 of regression

Unstable Relationships

.497 -1.576 (.795)

-.187 -1.981 .052

Full model statistics

R²Adjusted R²Standard Error of Estimates

.497

.4747.6

Fp (F-statistic)Step 2 R² Change

21.70<.001.061

137

Scatterplots of the symptom model found with the pairwise deletion sample and the

model using with full estimated sample can be seen in figure 3 showing a similar

sped and strength of relationship across both samples.

Discussion

It was hypothesised that there would be an improvement in BPD symptoms over the

course of STEPPS treatment. This hypothesis was supported with a statistically

significant improvement in BEST scores. However these improvements were not

large enough to be classified as a reliable change or clinically significant change

(Jacobson & Truax, 1991). Hypothesis 2 predicted that greater BPD severity, as

measured by the ZAN-BPD total score, would predict greater improvements. When

then pairwise deletion sample was used for the regression then this hypothesis was

supported, however when the full estimated sample was used severity no longer

significantly predicted improvements. Finally, hypothesis 3 predicted that symptoms

of emptiness, unstable relationships and identity disturbance would predict

improvements. Neither of the samples fully supported this hypothesis but when the

pairwsie sample was used identity disturbance significantly predicted outcomes. As

predicted, neither impulsivity nor DSH significantly predicted outcomes.

Figure 3. Scatter plots of the symptom model using the pairwise deletion sample (left) and the full estimated sample (right)

138

Findings supported the hypothesis that there would be a significant improvement in

BPD symptoms over the course of treatment; however, only 25% of participants

made reliable improvements (Jacobson & Truax, 1991). The percentage meeting

reliable change in this study was much lower than the 65.4% found in previous

research examining STEPPS treatment (Bos et al., 2011); however these authors

calculated reliable change on the basis of a measure of general psychopathology

(Symptom Checklist-90; Arrindell & Ettema, 2003) rather than their measure of

BPD. In the current study STEPPS was not used as an augmentation therapy as

consistently or in the same way as in other studies (only 15.5% of participants were

receiving additional therapy although many received other supportive treatment such

as care coordination) so the results cannot be easily compared to studies in which

participants were also receiving individual therapy. This may have contributed,

somewhat, to the lower rates of statistically reliable change observed in this study

compared to rates in other research (Bos et al., 2011).

Previous research has found severity to be predictive of larger improvements

following STEPPS treatment (Black et al., 2009; Black et al., 2013; Bos et al., 2011);

this study replicated this finding, with ZAN-BPD total score predicting change in

BEST score when pairwise deletion was used, however not when the full estimated

sample was used. The amount of variance explained by severity in these past papers

is not clearly reported. This is unfortunate as it would be beneficial to be able to

compare the different contributions of severity in each of these studies. The

observation that individuals with more severe BPD at T1 demonstrated larger

improvements could be a result of a regression to the mean effect. However, if this

was the case, it would be expected that a similar effect would be observed when

139

using the estimated full sample as EM reduces the precision of the data and

regression to the mean is more noticeable with increased measurement error (Barnett,

van der Pols & Dobson, 2005). Bos et al. (2011) proposed that the observed effect of

severity could not be assigned to a regression to the mean in their study as this was

only observed in those assigned to the STEPPS treatment condition. This suggests a

true effect of severity on treatment outcomes. This would provide evidence for the

theory by Gaw and Beutler (1995) that more severely affected individuals can benefit

from motivational distress which can increase engagement with treatment. A

replication of the current study with a randomised design would reduce the impact of

regression to the mean and provide further clarity on the effect of severity.

The finding that severity predicts better outcomes in this skills-based treatment

suggests that teaching skills that failed to develop during childhood (Ryan, 2005) is a

useful way to treat BPD. Some studies examining more reflective treatments (MBT

and CAT) have not found this effect of severity on outcomes (Ryle & Golynkina,

2000; Bateman & Fonagy, 2013). The self determination theory states that reflective

therapies work by providing acceptance and validation so that a person can continue

with their development and naturally obtain these missing skills (Ryan, 2005).

However this pattern of skills-based groups finding a predictive effect of baseline

severity but reflective treatments failing to find this effect, might suggest that more

severely affected individuals benefit more from being directly taught missing coping

strategies rather than allowing them to develop naturally. This is speculative at this

stage and controlled research trials examining the differential effect of severity on

different treatment approaches would be required to provide clarity on this.

140

Despite missing data being a wide-spread problem in psychological research,

statistical methods of imputing or estimating missing data are not commonly used

(Cheema, 2014; Graham, 2009). In this study there were a number of differences

between findings when the non-manipulated, pairwise deletion sample was used

compared to the estimated full sample dataset. One of these differences was that

when a full sample dataset was examined, the effect of severity was no longer

observed. Previous studies which observed the effect of severity on STEPPS

outcome (Black et al., 2009; Black et al., 2013) do not report having imputed missing

data from participants who dropped out of treatment. It is possible that these results

may not have held if a full sample was imputed. Excluding participants who do not

complete treatment can risk biasing results so that conclusions are drawn which are

not reflective of the STEPPS population as a whole. By considering the full sample,

it might be possible to draw more cautious, and possibly more accurate, conclusions

about the predictive power of severity on treatment outcomes. One study which

found baseline severity to be related to larger improvements following STEPPS,

completed by Bos et al. (2011), used intention to treat analyses where pre-treatment

scores were carried forward. They continued to find a significant effect of baseline

severity, although estimated differences were smaller using this approach. It is also

worth noting that the sample size was considerably larger than the current study and

their percentage of participants not receiving their allotted treatment was lower,

which increases the power of their analyses to pick up this effect of severity even

when using the conservative ITT approach. The lower power in this study could lead

to smaller effects being missed when using more conservative approaches.

141

There are a number of possible explanations for the different results observed when

using these two samples. It is first worth questioning whether the sample of

participants who completed treatment was biased by selective drop-out. Completers

differed from non-completers on T1 symptoms of emptiness but on no other T1 trait

(including severity). Emptiness has been shown to be related to more psychosocial

impairments than other BPD symptoms (Ellison, Rosenstein, Chelminski,

Dalrymple, & Zimmerman, 2016). Emptiness has also been related to feelings of

hopelessness (Klonsky, 2008) these factors may have increased the likelihood of

drop-out. Previous research has found baseline impulsivity to be predictive of

attrition from STEPPS (Black et al., 2009) but this study did not replicate this

finding. Future research which examines whether emptiness reliably predicts attrition

from STEPPS would be of value. If this was confirmed this could indicate an

important target group for efforts in treatment retention, who may require additional

psychosocial support or work targeting feelings of hopelessness or may be better

suited to an alternative therapy.

Foster and Fang (2004) argue that the only way to reliably establish the best model

for statistically managing attrition is to understand the missing data mechanism.

These authors highlight how attrition is linked with individual characteristics and this

relationship is never so complex as in intervention-based research. This has led some

to question the plausibility of the missing at random assumption (Raykov & West,

2015; Potthoff, Tudor, Pieper, & Hasselblad, 2006). There are a huge number of

other variables which were not examined in this study which may have differed

between completers and non-completers. It is therefore feasible that some of the

142

additional, unexamined variables could have a moderating effect on severity in those

participants who ended treatment early.

Another reasonable hypothesis is that it is a purely statistical effect being observed.

EM increases the power of statistical analyses by increasing the sample size,

however, in doing so it reduces the precision of analyses by decreasing the reliability

of the data. EM can increase the risk of Type 2 error. Excluding participants who

drop-out can also bias estimations of parameters as well as reducing statistical power

(Kang, 2013). This trade-off may be the leading, or only, contributor to the different

results observed in this paper.

It is also important not to neglect the possibility that the results observed here are a

random effect related to this specific sample and would not be observed in

replication. Or, of course, there is always a reasonable likelihood that it is a

combination of these factors, and others beyond. However it raises an interesting

question, not only for our understanding of how severity relates to STEPPS treatment

outcomes, but also a more general appreciation and recognition of the impact of our

statistical decision-making when dealing with missing data. Future research

examining the role of severity in predicting STEPPS outcomes with management of

missing data through estimation likelihood and a larger sample size to provide better

precision in estimating effects would be interesting to pull apart these results.

When symptom level predictors were examined in the non-estimated sample,

dissociation and identity disturbance were significantly able to predict variance in

scores at the end of treatment, given where they were at the start of treatment, by an

extra 12%. It was hypothesised that identity disturbance would be related to

outcomes but dissociation was an unexpected finding. This suggests that the

143

prevalence of these symptoms at the beginning of treatment could give some

indication towards a person’s suitability to STEPPS treatment. After taking into

account BEST scores at the beginning of treatment, participants with lower

dissociation had better BEST scores at the end of treatment (lower scores) and those

with higher scores on identity disturbance demonstrated lower scores at the end of

treatment when taking into account their scores at the beginning of treatment.

Previous studies have found mixed results in relation to identity disturbance, Yen et

al. (2009) found that identity disturbance predicted fewer improvements following

DBT, however Lenzenweger et al. (2012) and Bellino et al. (2015) found identity

disturbance to predict higher rates of improvement. Yen et al. (2009) examined a

very short DBT treatment. It is possible that this short-term treatment was not suited

to individuals with higher rates of identity disturbance. STEPPS is also a relatively

short-term treatment in terms of BPD therapies, however the results of this study

might suggest that 20 weeks is sufficient to engage individuals with identity

disturbance, although this is suggested cautiously at this stage, as this finding was not

replicated using the full estimation sample. It is possible that the group format of

STEPPS benefits individuals with more severe identity disturbance by being

facilitative of improvements in this more readily changeable symptom.

When the estimated full sample dataset was examined, and after controlling for

BEST scores at the start of treatment, only unstable relationships and dissociation

were left in the final model and, of the two, only dissociation made a significant

contribution. Dissociation was the only symptom predictor that contributed to

regression models with both the deletion approach and the full estimated sample,

suggesting this is a more robust finding of this study. The theory of structural

144

dissociation of personality (Van der Hart, Nijenhuis, Steele, & Brown, 2004) claims

that dissociated self-states occur when personality is divided into an emotional part

and an apparently normal part of the personality. The apparently normal part of the

personality attempts to avoid the emotional part through detachment and numbing or

dissociation (Mosquera, Gonzalez, & Leeds, 2014). Arntz et al. (2015) found

dissociation predicted smaller change of recovery in Schema Therapy and

Transference Focused Therapy. Kliendienst et al. (2011) examined DBT and found

that lower dissociation scores were related to better treatment outcomes. They

suggest that this is because higher rates of dissociation interrupt learning during

treatment. Individuals with BPD with dissociative symptoms show disrupted

emotional-learning compared to BPD participants without dissociative symptoms

(Ebner-Priemer et al, 2009). In this study, participants who made larger

improvements in STEPPS had lower dissociation scores. It is possible that

participants with higher rates of this chronic and robust BPD symptom will

experience more dissociative states, which may inhibit their ability to process or

internalise the content of STEPPS to the same degree. If further research confirmed

this hypothesis, implications for clinicians would evolve around management of

dissociation within treatment or providing grounding training prior to treatment

commencement.

Emptiness and unstable relationships were both found to correlate with outcomes

when using the full estimated sample. However, when entered into regression

models, neither made significant contributions. These findings do not support the

hypothesis that emptiness and unstable relationships would be predictive of greater

improvements, although it is possible that their effects are mediated through some of

145

the other symptoms. The evidence for this may be stronger for unstable relationships,

which did not make a significant contribution to either of the symptom models but

remained a non-significant predictor in both, as removing it reduced the strength of

the models. Neither impulsivity nor suicidal/self harm behaviours were related to

outcomes, which supports the hypothesis that these symptoms would not predict

outcomes. In considering Zanarini et al.’s (2007) classifications of symptoms, one

may have predicted that acute symptoms, which show faster rates of remission,

would be more predictive of outcomes. However this would assume that therapy acts

by speeding up naturally occurring remission. It would be interesting, in future, to

use an outcome measure which looks at the rates of change in the different symptoms

rather than only looking at an overall BPD severity.

Findings from this study would benefit from replication, however if the findings

were found to be reliable over replication studies there are a number of useful

clinical applications to consider. Emptiness was found to be a predictor of treatment

drop-out, this may highlight a helpful subgroup of people with BPD to target for

treatment retention efforts. The finding that dissociation predicts poorer outcomes

suggests that clinically individuals with high rates of dissociation may benefit from

preliminary work to help them manage dissociative states, for example through

grounding techniques, to prevent dissociation from disrupting learning. As

individuals with less severe BPD make less improvements this may highlight a

subsection of people with BPD who may require a different approach or focus of

intervention. This would allow for continued recovery. Finally, it is hoped that this

research highlights the importance of considering the different methods of managing

missing data. It is hoped that an implication of this research would be that future

146

researchers would consider the impact of their statistical decisions about managing

missing data and the biases this can introduce.

Limitations of this study include it being non-randomised, so we are unable to

conclude whether similar outcomes would have been observed in individuals who

were not receiving STEPPS treatment. Without an element of manipulation, cause

and effect cannot be inferred. A controlled study would also reduce the risk of

assigning regression to the mean as clinical change. Despite exhausting all available

avenues within the time-frame of this study, the participant numbers are relatively

small, which limits the generalisability of the findings. Representativeness of the

sample can also be called into question as the majority of participants were female

and recent estimates contradict the previously held belief that BPD is more common

amongst females (Grant et al., 2008). Replication with larger participant numbers

within NHS settings would be desirable. Statistically, it would have been beneficial

to control for the effect of individual group difference, however participant numbers

were too low to include each group as a nominal variable into a regression. There

were some inter-group differences in degrees of supervision and training, which

raises questions of treatment fidelity. Tighter controls for reducing these group

differences would be beneficial. A statistical limitation of the current study is the

high risk of Type 1 error due to the large amount of analyses. A Bonferroni

correction for the correlation analyses was considered however due to the relatively

low sample size and associated risk of Type 2 error this was not applied. A larger

sample size would have permitted this protection against Type 1 error.

Both the independent and dependent variables in this study were based on participant

self report. This was designed to reduce burden on group facilitators and potentially

147

allow for higher participant recruitment. There is always an element of subjective

self-report bias in using such measures. Objective measures from a professional

working with the participants would be desirable to corroborate participant self-

report wherever possible in future studies. Covariates were taken into consideration

and factored into analyses where appropriate. However, not all covariates could be

examined and it is possible that key variables may have been missed, such as

medication use. Only one dependent variable measure was used which was a

symptom measure of BPD. Using only this measure may have masked changes in

other domains. The STEPPS literature also highlights changes in quality of life and

general psychopathology (Bos et al., 2011). If, as Zanarini et al. (2007) suggest,

temperamental symptoms interfere more with psychosocial functioning, we might

expect to see more evidence of an effect of these symptoms if a measure of quality of

life was adopted rather than symptom based measures.

Follow-up data was sought and gathered but unfortunately there was a low response

rate, so including this data in the regressions would have drastically reduced the

power of the analyses. Descriptive statistics on the responses received can be found

in appendix W. A higher rate of follow-up data which allowed for further statistical

analysis would be desirable to explore whether any improvements were maintained

following the end of STEPPS.

The variables in this study were based on the DSM-IV-TR (2000) criteria. Although

the DSM-5 (2013) has kept the same criteria, meaning the symptoms studies here are

still relevant to this new manual, an alternative set of diagnostic criteria (Oldham,

2015) were suggested in the appendices of DSM-5. This highlights how other

symptoms and variables may be important in understanding BPD and treatment

148

suitability. It would be beneficial for future studies to continue exploring new

variables and their influence on treatment response. Black and Blum (2017) suggest

factors which could contribute to people’s suitability for STEPPS. These included

feeling they cannot learn new skills or feeling the only way things can change is if

others around them change. Individual variables such as sense of autonomy or

flexible thinking could contribute to the heterogeneous treatment outcomes observed.

The self-determination theory (Ryan, 2005) argues that autonomy is a defining

feature of BPD which suggests this could be an interesting feature to explore.

Another limitation of this study was the lack of information on participants who

dropped out of STEPPS; it would have been beneficial to gather information from

participants who dropped out of treatment on their reasons for finishing treatment

early.

Qualitative data was collected alongside quantitative data with questions designed to

allow space for participants to provide contextual information or share their views on

their suitability to STEPPS. However, in practice, these responses did not contribute

to answering the research question so are reported in appendix V. Wording of these

questions could have benefited from further review as certain terms provided a large

amount of crossover of answers. For example, the use of ‘suitability’ and ‘relevant’.

Further development of these questions may have produced data which contributed

to our understanding of treatment suitability and explored participants’ experiences

of STEPPS.

Despite these limitations, the findings of the current study extend the research on the

predictors of treatment response in STEPPS and make a first step towards breaking

down the heterogeneous ‘severity’ factors in relation to predicting STEPPS

149

outcomes. Strengths of the current study lie in its ecological validity, as data was

gathered from real-life STEPPS groups within NHS settings across different trusts.

The use of multiple methods of managing missing data was an advantage as it

allowed exploration of the different results from these two statistical approaches. It

has been useful to consider the potential bias which can occur when participants who

drop out of treatment are excluded from statistical analysis. Findings which are based

only on participants who complete treatment should not be generalised to the wider

participant populations which enter into treatment. The significant findings relating

to severity and identity disturbance should be interpreted with caution as, at this

stage, they only apply to the subset of participants who completed treatment. The

results from this study have highlighted a number of areas for future research, which

could provide important information on factors influencing the outcomes of therapy

for individuals with BPD. This study has made a first step towards understanding

how the presence of different BPD symptoms relates to STEPPS treatment outcomes.

References

150

Alesiani, R., Boccalon, S., Giarolli, L., Blum, N., & Fossati, A. (2014). Systems

training for emotional predictability and problem solving (STEPPS): program

efficacy and personality features as predictors of drop-out- an Italian study.

Comprehensive Psychiatry, 55, 920-927.

American Psychiatric Association. (1994). Diagnostic and Statistical Manual of

Mental Disorders, 4th ed, (DSM-IV). Washington, DC: APA.

American Psychiatric Association. (2000). Diagnostic and Statistical Manual of

Mental Disorders, 4th ed, Text Revision (DSM-IV-TR). Washington, DC:

APA.

American Psychiatric Association. (2001) Practice Guideline for the treatment of

patients with borderline personality disorder. Washington DC: American

Psychiatric Association.

American Psychiatric Association. (2013). Diagnostic and statistical manual of

eaddasfmental disorders (5th ed.). Washington, DC: APA.

Andión. Ó., Ferrer, M., Gancedo, B., Calvo, N., Barral, C., Torrubia, R., & Casas, M.

(2011).Confirmatory factor analysis of borderline personality disorder

symptoms based on two different interviews: The Structured Clinical

Interview for DSM-IV Axis II disorder and the Revised Diagnostic Interview

for Borderlines. Psychiatry Research, 190, 304-308.

Arntz, A., Stupar-Rutenfrans, S., Bloo, J., van Dyck, R., & Spinhoven, P. (2015).

Prediction of treatment discontinuation and recovery from borderline

personality disorder: Results from an RCT comparing schema therapy and

151

transference focused psychotherapy. Behaviour Research and Therapy, 74,

60-71. doi:10.1016/j.brat.2015.09.002

Arrindell, W. A., & Ettema, J. H. M. (2003). SCL-90. Symptom Checklist; Lisse:

Swets Test Publishers.

Barnett, A. G., van der Pols, J. C., & Dobson, A. J. (2005). Regression to the mean:

what it is and how to deal with it. International Journal of Epidemiology,

34(1), 215-220.

Barnicot, K., Katsakou, C., Bhatti, N., Savill, M., Fearns, N., & Priebe, S. (2012).

Factors predicting the outcome of psychotherapy for borderline personality

disorder: A systematic review. Clinical Psychology Review, 32(5), 400-412.

doi:10.1016/j.cpr.2012.04.004

Barnicot, K., Katsakou, C., Marougka, S., & Priebe, S. (2011). Treatment completion

in psychotherapy for borderline personality disorder: a systematic review and

meta-analysis. Acta Psychiatrica Scandinavica, 123(5), 327-338.

Bateman, A. W., & Fonagy, P. (2004). Psychotherapy for borderline personality

disorder: Mentalization-based treatment of BPD. Oxford: Oxford University

Press.

Bellino, S., Bozzatello, P., & Bogetto, F. (2015). Combined treatment of borderline

personality disorder with interpersonal psychotherapy and pharmacotherapy:

Predictors of response. Psychiatry Research, 226(1), 284-288.

doi:10.1016/j.psychres.2014.12.064

152

Beutler, L. E., Engle, D., Mohr, D., Daldrup, R. J., Bergan, J., Meredith, K., &

Merry, W. (1991). Predictor of differential response to cognitive,

experiential, and self-direct psychotherapeutic procedures. Journal of

Consulting and Clinical Psychology, 59(2), 333-340.

Beutler, L. E., & Moos, R. H. (2003). Coping and coping styles in personality and

treatment planning: introduction to the special series. Journal of Clinical

Psychology, 59(10), 1045-1047.

Black, D. W., Allen, J., John, D. S., Pfohl, B., McCormick, B., & Blum, N. (2009).

Predictors of response to systems training for emotional predictability and

problem solving (STEPPS) for borderline personality disorder: An exploratory

study. Acta Psychiatrica Scandinavica, 120(1), 53-61. doi:10.1111/j.1600-

0447.2008.01340.x

Black, D. W., Blum, N., Pfohl, B., & John, D. (2004). The STEPPS group treatment

program for outpatients with borderline personality disorder. Journal of

Contemporary Psychotherapy, 34(3), 193-210.

Black, D. W., Blum, N., McCormick, B., & Allen, J. (2013). Systems training for

emotional predictability and problem solving (STEPPS) group treatment for

offenders with borderline personality disorder. Journal of Nervous and Mental

Disease, 201(2), 124-129. doi:10.1097/NMD.0b013e31827f6435

153

Black, D. W., & Blum, N. S. (2017). Systems Training for Emotional Predictability

and Problem Solving for Borderline Personality Disorder: Implementing

STEPPS around the Globe. New York: Oxford University Press.

Blum, N., Pfohl, B., John, D., Monahan, P., & Black, D.W. (2002). STEPPS: A

Cognitive-Behavioural Systems-Based Group Treatment for Outpatients with

Borderline Personality Disorder-A Preliminary Report. Comprehensive

Psychiatry, 43(4), 301-310.

Bohus, M., Haaf, B., Simms, T., Limberger, M. F., Schmahl, C., Unckel, C., et al.

(2004). Effectiveness of inpatient dialectical behavioral therapy for borderline

personality disorder: A controlled trial. Behaviour Research and Therapy, 42(5),

487-499. Retrieved from http://search.proquest.com/docview/71738013?

accountid=17256

Bos, E. H., van Wel, E. B., Appelo, M. T., & Verbraak, J. P. M. (2011).

Effectiveness of systems training for emotional predictability and problem

solving (STEPPS) for borderline personality problems in a ‘real-world’

sample: Moderation by diagnosis or severity? Psychotherapy and

Psychosomatics, 80(3), 173-181. doi:10.1159/000321793

Cheema, J. R. (2014). Some general guidelines for choosing missing data handling

methods in educational research. Journal of Modern Applied Statistical

Methods, 13(2), 53-75.

Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences. New York,

NY: Routledge Academic.

154

Cronbach, L. J., & Furby, L. (1970). How we should measure ‘change’, or should

we? Psychological Bulletin, 74(1), 68-80.

Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from

incomplete data via the EM algorithm. Journal of the Royal Statistical

Society, Series B, 39(1), 1-38.

Ebner-Priemer, U. W., Mauchnik, J., Kliendienst, N., Schmahl, C., Peper, M.,

Rosenthal, M. Z., ... Bohus, M. (2009). Emotional learning during

dissociative states in borderline personality disorder. Journal of Psychiatry

and Neuroscience, 34(3), 214-222.

Ellison, W. D., Rosenstein, L., Chelminski, I., Dalrymple, K., & Zimmerman, M.

(2015). The clinical significance of single feature of borderline personality

disorder: anger, affective instability, impulsivity, and chronic emptiness in

psychiatric outpatients. Journal of Personality Disorders, 30(2), 261-270.

Foster, E. M., & Fang, G. Y. (2004). Alternative methods for handling attrition: an

illustration using data from the fast track evaluation. Evaluation Review,

28(5), 434-464.

Gaw, K. F., & Beutler, L. E. (1995). Integrating treatment recommendations. In

Beutler, L. E. & Berren, M. R. (Eds.), Integrative Assessment of Adult

Personality (280-319). New York: Guildford Press.

Gelman, A., & Hill, J. (2007). Missing-data imputation. In Gelman, A., & Hill, J.

(Eds.), Data Analysis using Regression and Multilevel/ Hierarchical Models

(529-543). New York: Cambridge University Press.

155

Ghasemi, A., & Zahediasl, S. (2012). Normality tests for statistical analysis: a guide

for non-statisticians. International Journal of Endocrinology and Metabolism,

10(2), 486-489.

Gov.uk. (2015). Data protection - GOV.UK. [online] Available at:

https://www.gov.uk/data-protection/the-data-protection-act

Graham, J. W. (2009). Missing data analysis: making it work in the real world.

Annual Review of Psychology, 60, 549-576.

Grant, B. F., Chou, S. P., Goldstein, R. B., Huang, B., Stinson, F. S., Saha, T. D., …

Ruan, J. (2008). Prevalence, correlates, disability and comorbidity of DSM-

IV borderline personality disorder: results from the wave 2 national

epidemiologic survey on alcohol and related conditions. Journal of Clinical

Psychiatry, 69(4), 533-545.

Gratz, K. L., Dixon-Gordon, K., & Tull, M. T. (2014). Predictors of treatment

response to an adjunctive emotion regulation group therapy for deliberate self-

harm among women with borderline personality disorder. Personality

Disorders: Theory, Research, and Treatment, 5(1), 97-107.

doi:10.1037/per0000062

Harvey, R., Black, D., & Blum, N. (2010). Systems Training for Emotional

Predictability & Problems Solving (STEPPS) in the United Kingdom: A

Preliminary Report. Journal of Contemporary Psychotherapy, 40, 225-232.

Hopwood, C. J. (2006). Brief treatments for borderline personality. Clinical

Psychology and Psychotherapy, 13, 269-283.

156

IBM Corp. Released 2015. IBM SPSS Statistics for Windows, Version 23.0.

Armonk, NY: IBM Corp.

Jacobson, N. S., & Truax, P. (1991). Clinical significance: a statistical approach to

defining meaningful change in psychotherapy research. Journal of Consulting

and Clinical Psychology, 59(1), 12-19.

Kang, H. (2013). The prevention and handling of missing data. Korean Journal of

Anesthesiology, 64(5), 402-406.

Kellogg, S.H., & Young, J. E. (2006). Schema therapy for borderline personality

disorder. Journal of Clinical Psychology, 62, 445-458.

Kirkwood, B. R., & Sterne, A. C. (2003). Medical Statistics (2nd ed.). Oxford:

Blackwell.

Kleindienst, N., Limberger, M. F., Ebner-Priemer, U. W., Keibel-Mauchnik, J., Dyer,

A., Berger, M., et al. (2011). Dissociation predicts poor response to dialectial

behavioral therapy in female patients with borderline personality disorder.

Journal of Personality Disorders, 25(4), 432-447.

Kline, P. (1999). A Handbook of Psychological Testing, 2nd ed. London: Routledge.

Klonsky, D. (2008). What is emptiness? Clarifying the 7th criterion for borderline

personality disorder. Journal of Personality Disorders, 22(4), 418-426.

Lana, F., & Fernandez-San Martin, M. I. (2013). To what extent are specific

psychotherapies for borderline personality efficacious? A systematic review

of published randomised controlled trials. Actas Espanolas de Psiquiatria,

41(4), 242- 252.

157

Lenzenweger, M. F., Clarkin, J. F., Levy, K. N., Yeomans, F. E., & Kernberg, O. F.

(2012). Predicting domains and rates of change in borderline personality

disorder. Personality Disorders: Theory, Research, and Treatment, 3(2), 185-

195. doi:10.1037/a0025872

Levy, K. N., Meehan, K. B., Temes, C. M., & Yeomans, F. E. (2011). Attachment

theory and research: implication for psychodynamic psychotherapy. In Levy,

R. A., Ablon, J. S., Kächele, H (Eds.), Psychodynamic psychotherapy

research: Evidence-based practice and practice-based evidence (401-416).

London: Humana Press.

Linehan, M. M. (1993). Cognitive behavioral treatment of borderline personality

disorder. New York: Guilford Press.

Livesley, J. W. (2012). Moving beyond specialist therapies for borderline personality

disorder: the importance of integrated domain-focused treatment.

Psychodynamic Psychiatry, 40(1), 47-74.

Luborsky, L., Auerbach, A. H., Chandler, M., Cohen. J., & Bachrach, H. M. (1971).

Factors influencing the outcome of psychotherapy: a review of qualitative

research. Psychological Bulletin, 75(3), 145-185.

Manikandan, S. (2011). Measures of central tendency: median and mode. Journal of

Pharmacology and Pharmacotherapuetics, 2(3), 214-215.

Mosquera, D., Gonzalez, A., & Leeds, A. M. (2014). Early experience, structural

dissociation, and emotional dysregulation in borderline personality disorder:

the role of insecure and disorganised attachment. Borderline Personality and

Emotional Dysregulation, 1(15), 1-8.

158

National Institute for Health and Care Excellence. (2009). Borderline personality

disorder: treatment and management. London: The British Psychological

Society and The Royal College of Psychiatrists.

Oldham, J. M. (2015). The alternative DSM-5 model for personality disorders. World

Psychiatry, 14(2), 234-236.

Pfohl, B., Blum, N., St John, D., McCormick, B., Allen, J., & Black, D. W. (2009).

Reliability and validity of the borderline severity over time (BEST): a self

rated scale to measure severity and change in persons with borderline

personality disorder. Journal of Personality Disorders, 23(3), 281-293.

Pompili, M., Girardi, P., Ruberto, A., & Tatarelli, R. (2005). Suicide in borderline

personality disorder: A meta-analysis Nordic Journal of Psychiatry, 59(5), 319-

324.

Potthoff, R. F., Tudor, G. E., Pieper, K. S., & Hasselblad, V. (2006). Can one assess

whether missing data are missing at random in medical studies? Statistical

Methods in Medical Research, 15, 213-234.

Raykov, T., & West, B. T. (2015). On enhancing plausibility of the missing at

random assumption in incomplete data analyses via evaluation of response-

auxiliary variable correlations. Structural Equation Modeling, 22(3), 429-

438.

Ryan, R. M. (2005). The developmental line of autonomy in the etiology, dynamics

and treatment of borderline personality disorder. Developmental and

Psychopathology, 17, 987-1006.

159

Ryle, A., & Golynkina, K. (2000). Effectiveness of time-limited cognitive analytic

therapy of borderline personality disorder: Factors associated with outcome.

British Journal of Medical Psychology, 73, 197-210.

doi:10.1348/000711200160426

Sanislow, C. A., Grilo, C. M., Morey, L. C., Bender, D. S., Skodol, A. E.,

Gunderson, J. G... McGlashan, T. H. (2002). Confirmatory Factor Analysis of

DSM-IV Criteria for Borderline Personality Disorder: Findings from the

Collaborative Longitudinal Personality Disorders Study. The American Journal

of Psychiatry, 159(2), 284-290

Stoffers, J. M., Voellm, B. A., Ruecker, G., Timmer, A., Huband, N., & Lieb, K.

(2012). Psychological therapies for people with borderline personality

disorder. Cochrane Database of Systematic Reviews, (8),

CD005652.doi:10.1002/14651858.CD005652.pub2

Stone, M. H. (2006). Management of borderline personality disorder: a review of

psychotherapeutic approaches. World Psychiatry, 5(1), 15-20.

Van der Hart, O., Nijenhuis, E., Steele, K., & Brown, D. (2004). Trauma-related

dissociation: concept clarity lot and found. Australian and New Zealand

Journal of Psychiatry, 38(11/12), 906-914.

Widiger, T. A., & Weissman, M. M. (1991). Epidemiology of borderline personality

disorder. Hospital and Community Psychiatry, 42(10), 1015-1021.

Yen, S., Johnson, J., Costello, E., & Simpson, E. B. (2009). A 5-day dialectical

behavior therapy partial hospital program for women with borderline personality

160

disorder: Predictors of outcome from a 3-month follow-up study. Journal of

Psychiatric Practice, 15(3), 173-182. doi:10.1097/01.pra.0000351877.45260.70

Zanarini, M. C., Frankenburg, F. R., Reich, D. B., Silk, K. R., Hudson, J. I., &

McSweeney, L. B. (2007). The subsyndromal phenomenology of borderline

personality disorder: a 10-year follow-up study. The American Journal of

Psychiatry, 146(6), 929-935.

Zanarini, M. C., & Frankenburg, F. R. (2008). Zanarini Rating Scale for Borderline

Personality Disorder: Self-report version (ZAN-BPD: SRV). Belmont, MA:

McLean Hospital.

Zanarini, M. C., Weingeroff, J. L., Frankenburg, F. R., & Fitzmaurice, G. M. (2015).

Development of the self-report version of the Zanarini rating scale for

borderline personality disorder. Personality and Mental Health, 9, 243-249.

Zimmerman, M., & Mattia, J. I. (1999). Axis I diagnostic comorbidity and borderline

personality disorder. Comprehensive Psychiatry, 40(4), 245-252.

doi:http://dx.doi.org/10.1016/S0010-440X(99)90123-2

161

List of Appendices

Appendix A: Journal Guidelines for Authors

Appendix B: Measures and time points when they were collected

Appendix C: Breakdown of shared data and data unique to each study

in the wider STEPPS project

Appendix D: The Borderline Evaluation of Severity over Time

Appendix E: The Zanarini rating scale of Borderline Personality Disorder (ZAN-BPD) self report version

Appendix F: Demographics and Service Usage Questionnaire

Appendix G: Details of STEPPS Session Content

Appendix H: Details of ethical review discussion on collecting group

information relating to facilitation

Appendix I: Information on facilitator profession, training and supervision

Appendix J: Detailed Protocol for Facilitators

Appendix K Information sheet provided to participants prior to consent to

treatment

Appendix L: Consent form

Appendix M: Copy of letter of Ethical Approval

Appendix N: Copies of R&D Approvals

Appendix O: Independent Variables: Test of normality, Boxplots

and Histograms

Appendix P:Dependent Variables: Test of normality, Boxplots

and Histograms

Appendix Q: Comparisons of completers and non-completers

Appendix R: Preliminary Investigations of Relationships with dependent variables

Appendix S: Regression analysis additional information

Pg. 165

Pg. 166

Pg. 167

Pg. 168

Pg. 169

Pg. 170

Pg. 171

Pg. 173

Pg. 181

Pg. 184

Pg. 186

Pg. 196

Pg. 203

Pg. 209

Pg. 212

Pg. 213

Pg. 216

Pg. 174

Pg. 177

162

Appendix T: Exploring regression with all 9 symptom measures

Appendix U: Eigen values of symptom level predictors

Appendix V: Content Analysis

Appendix W: Descriptive statistics of follow-up data

Pg. 222

Pg. 223

Pg. 224

Pg. 228

163

Appendices

Appendix A: Journal Guidelines for Authors

Removed for ethesis

164

Appendix B: Measures and time points when they were collected

Table 11. Measures and time points when they were collected

Session 1

Session 2 (T1)

Session 10 (T2)

Session 20 (T3)

Every week

Information sheet Consent form Demographics Quality of Life questionnaire Group Session Rating Scale ZAN-BPD BEST Filter questionnaire Skills monitoring card Service usage questionnaire

165

Appendix C: Breakdown of shared data and data unique to each study in the wider STEPPS project

Table 12: Data used by each of the three projectsUsed by MeasuresAll three studies BEST scores (weekly)

Demographic data (T1)Service usage (T1)

Study 1. Predictors of BPD-specific outcomes (this study)

ZAN-BPD (T1)

Study 2. Mechanisms of change Group Session Rating Scale (T1, T2, T3)Skills monitoring card (weekly)ZAN-BPD (T1 and T3)

Study 3. Predictors of Quality of life outcomes

Quality of Life questionnaire (T1, T2, T3) Service usage (T2, T3)Group Session Rating Scale (T1, T2, T3)ZAN-BPD (T1 and T3)Filter questionnaire (T1 and T3)

166

Appendix D: The Borderline Evaluation of Severity over Time (BEST; Blum, Pfohl, John, Monahan & Black, 2002)

Removed for ethesis

167

Appendix E: The Zanarini rating scale of Borderline Personality Disorder (ZAN-BPD) self report version (Zanarini & Frankenburg, 2008)

Removed for ethesis

168

Appendix F: Demographics and Service Usage Questionnaire

Site code Participant code

Demographic Information

1. Age: .............

2. Do you have a diagnosis of Borderline Personality Disorder (BPD) / Emotionally Unstable Personality Disorder (EUPD)? (please circle)

YES / NO

If so, how many years ago were you given this diagnosis ...............................

3. Are you currently undertaking any other form of psychological therapy (apart from STEPPS)? (please circle)

YES / NO

If so, which therapy? .............................................................................................

4. Have you started a STEPPS treatment programme before? (please circle)YES / NO

If yes, how many times? .................

If yes, how many STEPPS treatment programmes have you completed? …………….

5. Have you previously undertaken any other forms of psychological therapies (other than STEPPS)? (please circle)

YES / NO

If so, which therapy? ......................................................................

Service Usage Questionnaire

1) During the last 7 days how many contacts have you had with the following:

Services Number of contactsA) Telephone call to the Mental Health Line/Crisis LineB) A and EC) Crisis Team/ Home Treatment Team D) Inpatient psychiatric hospital E) Inpatient general hospital (e.g. physical health symptoms

as a result of self-harm)

2) Have there been incidents of self-harm during this period? (Please circle) YES / NO

If YES, please state an approximate number of incidences.............................................

169

Appendix G: Details of STEPPS Session Content

Every week the BEST scale, Emotional Intensity Continuum and Skills Monitoring Card are completed and reviewed prior to the individual session content.

Week Lesson/skill Content1. Introduction to Group &

awareness of illness. Awareness of BPD Goals for STEPPS Introduction to the ‘Emotional Intensity

Continuum’ Introduction to the reinforcement team

2. Filters Introduction to filters Completion of filter questionnaire (Pre) Common negative filters How to challenge filters Common helpful filters

Emotional Management Skills3. Distancing What is distancing?

How do you use distancing? Relaxation

4. Communicating, Part 1 What is communicating? Communicating thoughts, emotions &

physical sensations How to communicate feelings Relaxation

5. Communicating, Part 2 Review skills from last week Distance, then communicate worksheet

6. Challenging, Part 1 What does challenging mean? How do you challenge thoughts? Where do distorted thoughts come

from? Common thought distortions Challenging worksheet

7. Challenging, Part 2 Common myths about emotions Common thought distortions worksheet Distance, communicate then challenge

worksheet8. Challenging, Part 3 Review distance, communicate then

challenge worksheet9. Distracting, Part 1 What is distracting?

Distracting activities Positive affirmations and encouraging

statements Encouraging exercise Distracting, part 1 worksheet Distance, communicate, challenge,

distract worksheet10. Distracting, Part 2 Additional distraction techniques

Three relaxation strategies Imagery worksheet

170

Distance, communicate, challenge, distract worksheet

11. Managing problems, Part 1

What is the managing problems skill? Managing problems worksheet Distance, communicate, challenge,

distract & manage worksheet12. Managing problems, Part

2 ‘Letting go’ poem Visualisation worksheet Write your own visualisation script Distance, communicate, challenge,

distract worksheetBehavioural Management Skills13 Behaviour management Introduction to behaviour management

Behaviour management test14. Setting goals Setting lifestyle goals

Six steps to goal-setting Setting goals worksheet

15. Eating and sleeping Visualisation exercise Eating behaviours Healthy eating behaviours vs. eating

disordered behaviours Sleeping behaviours Sleep diary

16. Exercise, leisure & physical health

What is healthy exercise? Potential benefits of exercise &

exercise diary Leisure & leisure diary Pets Physical health & physical health diary

17. Abuse avoidance What is abuse avoidance? Abuse avoidance worksheet Avoiding abusive behaviours

worksheet Asking for help worksheet

18. Relationship behaviours, Part 1

Improving relationship skills Building and maintaining healthy

relationships Using your STEPPS skills to maintain

healthy relationships Warning signs of abusive or harmful

relationships Myths about relationship behaviours

19. Relationship behaviours, Part 2

Developing healthy boundaries Circle of support Filter questionnaire (post)

20. Wrapping it all up Course summary & group evaluation

171

Appendix H: Details of ethical review discussion on collecting group information relating to facilitation

172

Appendix I: Information on facilitator profession, training and supervision

173

Site code

No. of Facilitators and disciplines

Adherence to the model

01 2 facilitators Clinical Psychologists

Peer supervision was in place throughout with other STEPPS facilitators from another borough. 

02 2 facilitators Clinical Psychologist Trainee Clinical

Psychologist

Monthly STEPPS supervision with facilitators in another borough to ensure adherence

Weekly reflection session between facilitators

03 2 facilitators

1 x senior mental health nurse (charge nurse, band 6)

1 x senior occupational therapist (band 6).

(Plus 1 x occupational therapist (band 5) observing as part of training) 

2 day training in 2007 No supervision specifically for STEPPS

but supervision available via usual arrangements (ie: team supervision with a psychologist and individual clinical/managerial supervision).

04 3 facilitators Occupational Therapist Social Worker Assistant Psychologist

Weekly informal supervision pre and post STEPPS group for 30 minutes.

Formal supervision monthly x1 hour for each facilitator.

05 3 facilitators Counselling

Psychologist, OT Assistant Psychologist

Monthly supervision for facilitators

06 2 facilitators Senior clinical

psychologist Community

psychiatric nurse

Monthly group supervision led by consultant psychologist for all STEPPS facilitators.  These supervision sessions cover usual supervision needs and also process issues, e.g., any changes in delivery/processes/outcome measures

07 3 facilitators CBT therapist Mental health social

worker Assistant psychologist

Regular STEPPS supervision was monthly which one member of the facilitation team would attend.

Weekly debrief sessions to discuss each group. This included observing fidelity to the model.

All facilitators were either experienced STEPPs facilitators/Trainers within the Trust or had attended one a one day training session provided by the Trust

08 3 facilitators Weekly supervision post- session

174

Appendix J Detailed Protocol for Facilitators

DETAILED PROTOCOL FOR FACILITATORS (Please read parts 1 and 2)Part 1: Facilitator tasks only (summary)

Prior to STEPPS group (optional)

If invitation letters are being sent to group members, the Participant Information Sheet could be included to give potential participants time to read about the study details.

Every session It would be helpful if group members are given time to complete the Skills Monitoring Card on a weekly basis as per the STEPPS protocol; group members could be prompted to do this when they complete the BEST form in each session.

Session 1 - Book the room for an extra 15 minutes, so that researchers will have time to talk to the group- Mention to the group that researchers will be coming at the end of the session to talk about a research study; it is up to group members if they want to stay.

Session 2 (T1) - Book the room for an extra 30 minutes, so that researchers will have time to collect consent forms and T1 measures- At the end of this session, the researchers will leave a cardboard box at the service base. The box will contain blank T2 and T3 measures for each study participant. Note: this box will only contain the additional measures that are not part of STEPPS. The standard STEPPS measures (BEST, Skills Monitoring Card, Filter Questionnaire) will be copied by the researchers so that the service or participants can keep the originals.

Session 9 (this may not be applicable to all groups)

- If participants have been keeping their own BEST forms and Skills Monitoring Cards (as opposed to the facilitators collecting them), please remind them to bring them all (weeks 2-9) to the session next week. - Remind group that the researchers will be attending the next session to collect T2 measures.

Session 10 (T2)

- Book the room for an extra 15 minutes, so that researchers will have time to collect the measures- Put the cardboard box in the group room at the start of the session. The box will contain envelopes clearly labelled with the participants’ names and “T2” – participants will need to complete the measures in the envelope at the end of this session or during the break. The researchers will collect these measures and the standard STEPPS measures when they come at the end of the session. - If the researchers are unable to attend on this date, please keep the completed measures in a secure place (e.g. with other group material) until the researchers are able to collect them.

Session 19 (this may not be applicable to all groups)

- If participants have been keeping their own BEST forms and Skills Monitoring Cards (as opposed to the facilitators collecting them), please remind them to bring them all (weeks 11-19) to the session next week. - Remind group that the researchers will be attending the next session.

Session 20 (T3)

- Book the room for an extra 15 minutes, so that researchers will have time to collect the measures- Put the cardboard box in the group room at the start of the session. The box will contain envelopes clearly labelled with the participants’ names and “T3” – participants will need to complete the measures in the

175

envelope at the end of this session or during the break. The researchers will collect these measures and the standard STEPPS measures when they come at the end of the session. - If the researchers are unable to attend on this date, please keep the completed measures in a secure place (e.g. with other group material) until the researchers are able to collect them. Part 2: Full study protocol including researcher activities

Prior to STEPPS Group

FACILITATORS: Preliminary information about the study can be given to STEPPS group members before the start of treatment (e.g. at screening assessments or introductory sessions). It may be possible to include the participant information sheet for the study when sending out the group invitation letters to patients. (NB this step is optional and will differ depending on service protocol)

STEPPS session 1

FACILITATORS: At the start of the session, group facilitators will mention to the group that a researcher(s) will be coming to the end of the session to talk about the research study (this may take 10-15 minutes - please extend room bookings to allow for this). It is up to group members if they want to stay.

NB: it is important for the research that all STEPPS members are encouraged to complete a BEST form and a Skills Monitoring Card weekly (as per the STEPPS protocol) – Please encourage all group members to do this.

RESEARCHERS: At the end of the session a researcher(s) will arrive and give the group verbal information about the study as well as a participant information sheet. Participants can ask questions about the study, and may seek guidance from their care team or group facilitator about their participation after this session.

STEPPS session 2 (T1)

RESEARCHERS: Researcher(s) will attend the end of STEPPS session 2 (again, group members can decide whether to stay after the group and room bookings will need to be extended for approx. 15-30 minutes). Signed informed consent will be obtained from group members who wish to participate in the study. These individuals will also be asked to complete the following additional T1 measures:

- Survey – demographics and service use - Quality of Life questionnaire- Group Session Rating Scale- Zanarini (self-report)

RESEARCHERS: Participants will be coded as they provide informed consent and each participant will be allocated an envelope labelled with their code (T1 measures can be put straight into these envelopes and should not contain any identifying information). Data

176

from routine questionnaires will be extracted onto record forms which are also anonymised using this system (this may involve photocopying measures with identifying information omitted e.g. folded over). Anonymised data will be taken and stored at the University of Surrey. The routine data collected will be:

- BEST- Filter questionnaire- Skills monitoring card

FACILITATORS: At the end of STEPPS session 2, facilitators will be given a box containing additional measures for T2 and T3. The box will be labelled with a site code. Inside the box will be individual envelopes labelled with each participants name (i.e. those who have provided informed consent). This box will need to be stored on site until T2 (STEPPS session 10)

STEPPS session 9

FACILITATORS: It would be helpful if facilitators could remind participants to bring all their BEST questionnaires and skills monitoring cards from the course so far to the next session (assuming group members normally keep these measures themselves)

STEPPS session 10 (T2)

FACILITATORS: At the start of STEPPS session 10, facilitators will put out the box of additional measures they were provided with and remind participants to complete these, either during the break or after the group. Participants can find their envelope (it will have their name written on it) in the box and should return the completed questionnaires to the box in the same envelope. Participants will be asked to stay for approx. 10 minutes at the end of the session so that the researcher(s) can extract the data from the routine questionnaires (please adjust room booking times to allow for this)

RESEARCHERS: A researcher(s) will arrive at the end of this session to collect additional and routine measures. All measures will be anonymised on site; no identifyable data will be taken off site. Additional measures will be taken out of the named envelopes and placed into the corresponding coded envelopes. The additional measures collected at T2 are:

- Survey - service use- Zanarini (self-report)- Group Session Rating Scale- Quality of Life scale

The routine measures that will be copied are:- BEST (from weeks 2 – 10)- Skills monitoring card (from weeks 2 – 10)

Collected data will be taken to the University of Surrey to be stored.

STEPPS session 19

FACILITATORS: It would be helpful if facilitators could remind participants to bring all their BEST questionnaires and skills monitoring cards from the course so far to the next session (assuming group members normally keep these measures themselves), in addition to the filter questionnaire completed during this session.

177

STEPPS session 20 (T3)

FACILITATORS: At the start of STEPPS session 20, facilitators will put out the box of additional measures they were provided with and remind participants to complete these, either during the break or after the group. Participants can find their envelope (it will have their name written on it) in the box and should return the completed questionnaires to the box in the same envelope. Participants will be asked to stay for approx. 10 minutes at the end of the session so that the researcher(s) can extract the data from the routine questionnaires (please adjust room booking times to allow for this)RESEARCHERS: A researcher(s) will arrive at the end of this session to collect additional and routine measures. All measures will be anonymised on site; no identifyable data will be taken off site. Additional measures will be taken out of the named envelopes and placed into the corresponding coded envelopes. The additional measures collected at T3 are:

- Survey - service use and qualitative questions- Zanarini (self-report)- Group Session Rating Scale- Quality of Life scale

The routine measures that will be copied are:- BEST (from weeks 11 – 20)- Skills monitoring card (from weeks 11 – 20) - Filter questionnaire (routinely completed in session 19)

Collected data will be taken to the University of Surrey to be stored.

6-7 weeks after the end of the STEPPS group

RESEARCHERS: If participants requested to be contacted by post for the follow-up measures, these can be sent out six weeks after the end of the STEPPS group (i.e. two weeks before T4) with a letter indicating the date the measures should be completed. If participants requested to be contact by email to complete the follow-up measures online, an email can be sent out seven weeks after the end of the STEPPS group (i.e. one week before T4) reminding them to do so.

2 months after the end of the STEPPS group (T4)

PARTICIPANTS: Having been prompted by researchers, participants will be responsible for completing the follow-up measures at this time. If completing them by post they will need to send them back to the researchers (a stamped addressed envelope will have been provided with the measures). The follow-up measures to be completed are:

- BEST- Skills monitoring card- Measure of service use- Quality of Life scale

December 2016

This is the estimated end of the data collection period. After this time participants will no longer be able to withdraw from the study. At this time we will select winners for the prize draw and contact winning participants directly.

178

Appendix K: Information sheet provided to participants prior to consent to treatment

Participant Information Sheet (Version 4: 19/1/16)

Project title: Predictors of outcomes, predictors of quality of life and mechanisms of change in STEPPS  

Introduction

We are three Trainee Clinical Psychologists studying at the University of Surrey. We would like to invite you to take part in a research project, which we are undertaking as part of our training (PsychD). Before you decide whether to participate, you need to understand why the research is being done and what it will involve for you. Please take the time to read the following information carefully and talk to others about the study if you wish (e.g.) your care coordinator.

What is the purpose of the study?

The NHS offer STEPPS (Systems Training for Emotional Predictability and Problem Solving) as a treatment option for people with a diagnosis of (or symptoms of) Borderline Personality Disorder (BPD). It is important that the NHS understands how this treatment works, who it works for, and whether any benefits are meaningful for those who receive it. This study aims to explore those questions by looking at group participants’ experiences throughout the treatment.

Why have I been invited to take part in the study?

Everyone who has been offered the STEPPS treatment program on which you are enrolled has been invited to take part in our research.

Do I have to take part?

No, you do not have to participate. There will be no adverse consequences in terms of your care or treatment if you decide not to participate. You can withdraw from this study at any time during the STEPPS treatment program without giving a reason and you can ask for your data to be withdrawn at any point before the end of data collection (we estimate this will be 30th December 2016). The data collection end date means that after that, the data will be entered into an analysis, and it will not be possible to withdraw your data. If you drop out of the research study but do not request your data to be withdrawn before 30th December 2016, the data may still be used.

179

What will my involvement require?

Participation in this research will not involve much input over and above what you would normally do as part of the STEPPS group treatment. This is because, if you consent, we will use answers to questionnaires that you would be filling in anyway. In addition, we have provided three additional questionnaires, which you will be asked to fill in at three time points (sessions 2, 10 and 20). You will be able to fill these when you attend the group session. We will collect follow-up questionnaires two months after the end of the group, to see whether there has been any further change in this time. On the consent form, there is space to provide either an email address or a postal address to receive these. One of the additional questionnaires asks for your feedback on your experience at the end of the treatment programme; it is possible that anonymised quotes from some questionnaires may be used in the write up of the research to illustrate feedback that was commonly given.

If you wish, we will also enter you into an optional prize draw – all participants will have the chance to win one of the following:

1x £25 Amazon voucher OR 2x £10 Amazon vouchers

Winners will be drawn in December 2016. If you have won we will contact you using the details you give us on the consent form.

What are the possible disadvantages or risks of taking part?

We do not anticipate any negative effects of participating in this research. However, we understand that some individuals may find it stressful to complete questionnaires, and may worry about the consequences of expressing negative feedback or criticism. If you experience any discomfort as a result of participating, you are encouraged to speak to your care coordinator or one of your group facilitators about your concerns.

What are the possible benefits of taking part?

Participating in this research may give you the opportunity to reflect on your progress through the group. In addition, we hope that your participation will contribute to improving how we deliver STEPPS in future.

Will my taking part in the study be kept confidential?

Yes. All of the information you give will be anonymised so that those reading reports from the research will not know who has contributed to it. Data will be stored securely in line with University of Surrey policy and personal data will be handled in accordance with the Data Protection Act (1998).

180

What happens when the research study stops?

We aim to complete data collection by December 2016, the results of the study will then be written up by December 2017. After this date, if you would like to receive a summary of the findings, please indicate this on the consent form. All anonymised data will be stored securely for a minimum of 10 years at the University of Surrey. Personal data (e.g. email or postal addresses provided on the consent form) will be destroyed when the study ends, in September 2017. As this research is being conducted as part of three students’ academic degrees (PsychD qualifications), anonymised results from the study may also be presented in other formats e.g. conferences, presentations, and research articles published in academic journals. No data would be published that could identify any individual.

Who is organising and funding the research?

This research is organised by the University of Surrey. It is not funded as it is part of the requirements of our doctoral qualification to complete a research project.

What if there is a problem? (Contact details of researcher)

Any complaint or concern about any aspect of the way you have been dealt with during the course of the study will be addressed. You may contact any of the following people (we cannot guarantee immediate response):

NAME (student): EMAIL NAME (supervisor): EMAIL

If you wish to contact someone outside the research team:

NAME (JOB DESCRIPTION) EMAIL:

If you do not have access to email you can leave us a message on the following mobile number:

STEPPS research team mobile:

Who has reviewed the project?

This study has been reviewed and received a favourable ethical opinion from South East Coast - Surrey Research Ethics Committee.

Thank you for taking the time to read this Information Sheet.

Appendix L: Consent form

181

Consent Form (Version 3: 4/12/2015)

Title of project: Predictors of outcomes, predictors of quality of life and mechanisms of change in STEPPS

I have read and understood the Information Sheet provided (version 4, 19/1/2016). I have been given a full explanation by the researchers of the nature, purpose, location and likely duration of the study, and of what I will be expected to do. I have been given the opportunity to ask questions on all aspects of the study.

I agree to comply with any instruction given to me during the study and to co-operate fully with the researchers. I shall inform them (or a member of my care team) immediately if I experience any negative effects to my wellbeing as a result of my participation.

I consent to my questionnaire responses, as outlined in the accompanying information sheet, being used for this research, which will be written up as part of PsychD Clinical Psychology qualifications for three students; this may include quotes from my feedback sheet at the end of the STEPPS treatment. I understand that all personal data relating to volunteers is held and processed in the strictest confidence, and in accordance with the Data Protection Act (1998). All data will be anonymised and no identifiable data will be shared with anyone outside my care team.

I understand that I am free to withdraw from the study at any time, and can request for my data to be withdrawn from the study, until 30 th December 20164, without needing to justify my decision and without prejudice.

I agree to complete follow-up questionnaires two months after the end of the group (please provide either an email address or a postal address on the next page to receive these)

I confirm that I have read and understood the above and freely consent to participating in this study. I have been given adequate time to consider my participation and agree to comply with the instructions and restrictions of the study.

I would like to be entered into the prize draw (if you win, we will use the contact details you have provided on the next page)

I would like a summary of the results of the study (if yes, we will use the contact details you have provided on the next page)

4 Please note this is an estimated date, which represents the proposed end of data collection and the start of data analysis.

182

Name of participant (BLOCK CAPITALS) ......................................................

Signed...................................................... Date......................................................

Name of researcher/person taking consent………………………………………….

Signed.................................................... Date……………………………………..

Contact details for follow up questionnaires (also for prize draw and summary of results if you have opted in for these)

Please provide either an email address or a postal address (If you provide an email address we will ask you to complete the follow up questionnaires online. If you provide a postal address we will send you paper copies of the follow up questionnaires with a stamped addressed envelope to return them to us):

Name………………………………………………………………………………………

…………

Email address:

……………………………………………………………………………………..

Postal address:

…………………………………………………………………………………………

Appendix M: Copy of letter of Ethical Approval

183

184

185

186

187

188

189

190

191

192

193

Appendix N: Copies of R&D Approvals

194

195

196

197

198

199

200

Appendix O: Independent Variables: Test of normality, Boxplots and Histograms

ZAN-BPD measure of affect instability was significantly negatively skewed, demonstrating that scores clustered around the high end. All other Z scores demonstrated no significant skew. Self-harm and suicidal behaviours and efforts to avoid abandonment subscales indicated significant kurtosis demonstrating a relatively flat distribution of scores.

Ghasemi and Zahediasl (2012) argue that the Shapiro-Wilk test of normality is a more accurate measure of normality as it has better power than the Kolomogorov-Smirnov. Due to the relatively low sample size in this study, the Shapiro-Wilk test of normality is reported. Only the ZAN-BPD total score (p=0.705) was normally distributed. Boxplots identified some outliers, however these were not identified as extreme outliers (more than three box lengths from the edge of the box) and 5% trimmed means did not vary largely from the full mean value suggesting these scores were not biased by outliers.

Table 13: Z scores for skewness and kurtosis

Skewness SE of Skew

Skew Z score

Kurtosis

SE of Kurtosis

Kurtosis Z score

ZanTotal-0.192 0.285

-0.673684211 -0.128 0.563 -0.22735

ZanAnger-0.37 0.285

-1.298245614 -0.197 0.563 -0.34991

ZanAffect-0.916 0.285

-3.214035088 0.48 0.563 0.852575

ZanEmpti -0.368 0.285 -1.29122807 -0.588 0.563 -1.0444ZanIdenti

-0.304 0.285-

1.066666667 -0.831 0.563 -1.47602ZanDSH 0.273 0.285 0.957894737 -1.271 0.563 -2.25755ZanImpulsi

-0.106 0.285-

0.371929825 -1.017 0.563 -1.80639ZanDissoc -0.316 0.285 -1.10877193 -0.992 0.563 -1.76199ZanAbandon 0.08 0.285 0.280701754 -1.174 0.563 -2.08526ZanRelation 0.163 0.285 0.571929825 -1.03 0.563 -1.82948ZanDisturbRelate

-0.017 0.285-

0.059649123 -0.648 0.563 -1.15098ZanBehDysreg 0.056 0.285 0.196491228 -1.022 0.563 -1.81528ZanAffectDysreg

-0.618 0.285-

2.168421053 0.61 0.563 1.083481

201

202

203

204

205

206

Appendix P: Dependent Variables: Test of normality, Boxplots and Histograms

Dependent Variables: Skewness and Kurtosis

48 participants had a BEST Change score, demonstrating that they attended at least one session in both the first and second half of treatment. Z scores were calculated from Skewness and Kurtosis values and compared to a 1.96 criteria to establish whether these values were significant (at 0.05 level).

BEST Change score was significantly negatively skewed, demonstrating that change scores were clustered around the high end. Z scores for BEST Change also indicated significant positive kurtosis demonstrating a tail-heavy distribution of scores. The significant Shapiro-Wilk value for the BEST Change score demonstrated that this score was not normally distributed (p=0.02), however BEST Median 2 score was normally distributed (p=.20).

Despite some skew and kurtosis in these variables, assumptions for regression analyses rely to residuals being normally distributed rather than raw scores. Histograms were examined and they suggested a reasonably normal distribution of data. Boxplots identified some outliers in BEST scores, however these were not identified as extreme outliers (more than three box lengths from the edge of the box) and 5% trimmed means did not vary largely from the full mean value suggesting these scores were not biased by outliers.

Table 14. Test of Normality

207

Tests of Normality

Kolmogorov-Smirnova Shapiro-Wilk

Statistic df Sig. Statistic df Sig.

BESTMedian1 .072 71 .200* .989 71 .797

BESTMedian2 .088 48 .200* .968 48 .205

BESTChange .129 48 .044 .940 48 .016

Table 15: Z scores of skewness and kurtosis

  Skewness

SE of Skewness

Z score of Skew Kurtosis

SE of Kurtosis

Z score of Kurtosis

BESTMedian1

-0.185 0.285

-0.6491228

07 0.317 0.563 0.563055062BESTMedian2

-0.556 0.343

-1.6209912

54 -0.047 0.674 -0.069732938BESTChange

-0.857 0.343

-2.4985422

74 2.018 0.674 2.994065282Histograms and Boxplots

Figure 11. Histogram of BEST Change score

208

Figure 12. Boxplot of BEST Change score

Figure 13. Boxplot of BEST Median 2

209

Appendix Q: Comparisons of completers and non-completers

Alternative cut-off (session 18)

STEPPS criteria for dropout includes missing a consecutive three sessions, by this definition anyone who attended up to session 18 would be classified as completing treatment (unfortunately many groups did not stick to this criteria and allowed participants to attend following three consecutively missed sessions). 40 participants attended to, at least, session 18, compared to 31 who dropped out before session 18. Comparisons of these groups were examined to explore any significant between group differences.

Completers were compared to non-completers on each of the baseline measures using t-tests (except DSH which was compared using Mann Whitney U test). Using this criterion for drop-out, the groups did not significantly differ on any of the baseline measures tested.

Figure 14. Histogram of BEST Median 2

210

Appendix R: Preliminary Investigations of Relationships with dependent variables

Table 16: Confounding Covariates

Pairwise deletion sample N=48

BEST Median 2 BEST Change

Confounding variables

Age -0.050 (p=0.741) -0.104 (p=0.486)

Years since diagnosis -0.017 (p=0.910) -0.074 (p=0.627)

No. STEPPS groups started 0.200 (p=0.177) -0.154 (p=0.300)

No. STEPPS groups completed 0.038 (p=0.799) -0.109 (p=0.467)

Service usage: Crisis team 0.039 (p=0.798) 0.158 (p=0.300)

Service usage: A&E 0.108 (p=0.480) 0.093 (p=0.543)

Service usage: Home treatment 0.061 (p=0.693) 0.122 (p=0.425)

211

team

Service usage: Psychiatric inpatient 0.108 (p=0.479) -0.391* (p=0.008)

Service usage: General inpatient -0.038 (p=0.803) 0.170 (p=0.265)

No. DSH 0.134 (p=0.373) 0.080 (p=0.597)

Estimated full sample N=71

BEST Median 2 BEST Change

Confounding variables

Age -0.019 (p=0.877) -0.036 (p=0.769)

Years since diagnosis -0.070 (p=0.570) 0.026 (p=0.832)

No. STEPPS groups started 0.127 (p=0.293) -0.067 (p=0.580)

No. STEPPS groups

completed

0.015 (p=0.900) -0.079 (p=0.518)

Service usage: Crisis team 0.048 (p=0.704) 0.064 (p=0.615)

Service usage: A&E 0.085 (p=0.505) 0.032 (p=0.803)

Service usage: Home

treatment team

0.047 (p=0.714) 0.054 (p=0.670)

Service usage: Psychiatric

inpatient

0.082 (p=0.519) -0.325* (p=0.009)

Service usage: General

inpatient

0.018 (p=0.890) 0.033 (p=0.795)

No. DSH -0.009 (p=0.940) 0.233 (p=0.058)

Continuous confounding covariates were correlated with BEST Change score and BEST Median 2 score (BEST dependent variables) to examine any relationship between these variables prior to entering them into regression analyses. There was very little variation in participant’s service usage at baseline and scores were not normally distributed. Scatterplots were generated to examine possible relationships between baseline variables and BEST dependent variables. Only baseline admission to psychiatric hospital at baseline was significantly related to dependent variables. However only one participant had this kind of admission and it was later statistically identified as an outlier.

Categorical baseline variables were also examined to explore their relationship with the BEST dependent variables. T-tests were used to explore differences in BEST dependent variables between the following baseline variable groups; presence of diagnosis, engagement in other treatment and previous engagement in STEPPS. None of these showed significant group differences. However, it is worth noting that there were very unequal numbers between these group. Very few participants were

212

without a BPD diagnosis, only 6 participants were partaking in current additional therapy and only two participants had previously engaged in STEPPS. None of these variables were entered into the regression analyses.

A one-way analysis of variance was conducted to examine differences between the individual STEPPS groups. A total of 14 different groups were included into the final sample, sites 1 and 11 were removed from the analysis as each only had one participant with a BEST Change score. Levene’s test for homogeneity of variance was significant indicating that this assumption was violated, for this reason Welch’s F is reported. Welch’s F= 1.869 (p=0.17) demonstrating that BEST Change score did not significantly differ between groups.

When the estimated full sample (N=71) was examined, there was a significant negative correlation between the number of inpatient psychiatric treatment admissions and BEST Change score (r= -.391, p=0.008).

Predictor variables

Each predictor variable was individually correlated with BEST dependent variables. Scatter plots were generated to examine relationships between variables.

The relationship between BPD severity (ZAN-BPD total score) and BEST dependent variables were investigated using Pearson product-moment correlation coefficient. There was a medium, positive correlation between BPD severity and BEST Change score, r=0.371, n=48, p= 0.009. There was also a medium, positive correlation between BPD severity and BEST Median 2, r=0.330, n=48, p= 0.022. Both of these correlations demonstrated that more severe BPD was associated with greater improvements following treatment. Positive correlations were also observed with the estimated full sample were used (BPD severity and BEST Change score: r=0.247, n=71, p= 0.038. BPD severity and BEST Median 2: r=0.343, n=71, p= 0.003).

Relationships between individual BPD symptoms and BEST dependent variables were investigated using Spearman’s rho. Identity disturbance demonstrated a medium positive correlation, rho= 0.346, n=48, p=0.016, with BEST Change score, with higher scores in identity disturbance associated with larger improvements on the BEST following treatment. Relationship instability also reached statistical significance when correlated with BEST Change score, rho= 0.287, n=48, p=0.048. This was a small to medium positive correlation demonstrating that higher relationship instability scores were associated with larger changes in BEST score. Dissociation demonstrated a medium positive correlation, rho= 0.423, n=48, p=0.003, with BEST Median 2 score, with higher scores in dissociation associated with larger BEST Median 2 score (at the end of treatment). High baseline dissociation was related to higher BPD scores at the end of treatment.

213

No other symptom level measure was significantly correlated with BEST dependent variables. None of the symptom level predictors correlated with each other with a value>0.7 so these variables could be entered into the same regression model without violating assumptions.

When the estimated full sample was examined, emptiness demonstrated a small positive correlation, rho= 0.268, n=71, p=0.024, with BEST Median 2, with higher scores in emptiness associated with higher scores at the end of treatment. Dissociation demonstrated a medium positive correlation, rho= 0.383, n=71, p=0.001, with BEST Median 2, with higher scores on dissociation associated with higher scores at the end of treatment. When symptoms were correlated with BEST Change score, relationship instability also reached statistical significance, rho= 0.246, n=71, p=0.039. This was a small positive correlation demonstrating that higher relationship instability scores were associated with larger changes in BEST score.

Appendix S: Regression analysis additional information

Preliminary analyses were conducted to ensure no violation of the assumptions of normality, linearity, and homoscedasticity.

An outlier was identified by examining a regression scatterplot (below). One participant indicated an inpatient admission at baseline. This was further explored using Mahalanobis distance values, the value for this participant was 62.02 which greatly exceeded the critical value of 13.82 for a regression with two independent variables. For this reason, this participant’s data was removed from all subsequent regression analyses.

214

Collinearity diagnostics

Tolerance values below 0.10 and VIF values above 10.0 are considered a violation of assumptions of multicollinearity.

Figure 15. Scatterplot of Regression with BEST Change score

Table 17. Collinearity diagnostics of regression modelsPairwise Deletion Sample

Model Collinearity StatisticsSeverity Model Tolerance VIF

ZanTotal 1.000 1.000Symptom Model

BEST Median 1 1.000 1.000

BEST Median 1ZanIdenti

ZanDissocZanRelation

.719

.833

.783

.859

1.3091.2001.2771.164

EM full SampleModel Collinearity Statistics

Severity Model Tolerance VIF

ZanTotal 1.000 1.000Symptom Model

BEST Median 1 1.000 1.000BEST Median 1

ZanDissocZanRelation

.783

.816

.861

1.2771.2251.162

215

Assumptions of normality, linearity, and homoscedasticity

Severity Model using pairwise deletion

No cases had standardised residuals over 3.0 or below -3.0. Mahalanobis distance values did not exceed criteria.

216

Severity Model using EM full sample

No cases had standardised residuals over 3.0 or below -3.0. Mahalanobis distance values did not exceed criteria.

Figure 16. PP Plot of severity regression standardised residuals of BEST Change score from pairwise deletion sample

Figure 17. Scatterplot of severity predicotors of BEST Change score with pairwise deletion sample

217

Symptom regression using pairwise deletion sample

One case had standardised residuals below -3.0 however Cook’s maximum distance value (.535) does not suggest this value would have an undue influence on the results based on Tabachnick and Fidell’s criteria of score over 1 being problematic. Mahalanobis distance values did not exceed criteria.

Figure 19. Scatterplot of severity predictors for BEST Change score with EM sample

Figure 18. PP Plot of severity regression standardised residuals of BEST Change score from EM sample

218

Symptom regression using EM full sample

One case had standardised residuals over 3.0 however Cook’s maximum distance value (.198) does not suggest this value would have an undue influence on the results. Mahalanobis distance values did not exceed criteria.

Figure 20. PP Plot of symptom regression standardised residuals of BEST Median 2 from pairwise deletion sample

Figure 21. Scatterplot of symptom predictors with BEST Median 2 from pairwise deletion sample

219

Appendix T: Exploring regression with all 9 symptom measures.

An alternative approach to the symptom level regression analysis would have been to enter all 9 variables into a regression model and work through, removing those which did not significantly contribute to the model. However, this method would have

Figure 23. Scatterplot of symptom predictors with BEST Median 2 from EM sample

Figure 22. PP Plot of symptom regression standardised residuals of BEST Median 2 from EM sample

220

statistically required a larger number of participants to enter all 9 nine variables into a single model. Despite this limitation, this approach was also explored, the approach produced a model with the same variables. When all 9 symptoms are entered into a regression model it remained that only identity disturbance and dissociation made significant contributions to the model as demonstrated in the table below.

Table 18: Regression using all 9 symptom level predictors

Model

Unstandardized

Coefficients

Standardized

Coefficients

t Sig.

95.0% Confidence

Interval for B

B

Std.

Error Beta

Lower

Bound

Upper

Bound

1 (Constant) 3.267 5.764 .567 .574 -8.342 14.877

BESTMedian1 .782 .127 .676 6.154 .000 .526 1.038

2 (Constant) 6.147 6.234 .986 .331 -6.497 18.790

BESTMedian1 .893 .155 .772 5.744 .000 .578 1.208

ZanAnger -1.722 1.438 -.161 -1.198 .239 -4.639 1.194

ZanAffect -1.420 1.396 -.122 -1.017 .316 -4.252 1.412

ZanEmpti .188 1.409 .017 .133 .895 -2.669 3.045

ZanIdenti -3.399 1.203 -.360 -2.826 .008 -5.839 -.960

ZanDSH -.768 .985 -.099 -.780 .440 -2.766 1.229

ZanImpulsi .680 1.050 .079 .648 .521 -1.449 2.809

ZanDissoc 2.891 1.196 .287 2.418 .021 .466 5.315

ZanAbandon .680 .998 .080 .681 .500 -1.345 2.705

ZanRelation -.327 1.218 -.036 -.268 .790 -2.796 2.143

Appendix U: Eigen values of symptom level predictors

Table 19: Eigen values for regression model with symptom level predictors using the pairwise deletion sample

221

Collinearity Diagnosticsa

Mode

l

Dimensio

n

Eigenvalu

e

Conditio

n Index

Variance Proportions

(Constant

)

BESTMedian

1

ZanIden

ti

ZanDisso

c

ZanRelatio

n

1 1 1.977 1.000 .01 .01

2 .023 9.335 .99 .99

2 1 4.517 1.000 .00 .00 .01 .01 .01

2 .266 4.121 .00 .00 .12 .00 .77

3 .110 6.396 .02 .01 .73 .34 .18

4 .086 7.267 .17 .05 .11 .63 .01

5 .021 14.694 .80 .94 .03 .02 .03

a. Dependent Variable: BESTMedian2

Table 20: Eigen values for regression model with symptom level predictors using the EM full sample

Collinearity Diagnosticsa

Model Dimension Eigenvalue

Condition

Index

Variance Proportions

(Constant) BESTMedian1 ZanDissoc ZanRelation

1 1 1.980 1.000 .01 .01

2 .020 10.024 .99 .99

2 1 3.667 1.000 .00 .00 .01 .02

2 .226 4.032 .02 .01 .04 .95

3 .089 6.402 .09 .04 .92 .00

4 .018 14.141 .89 .95 .03 .04

a. Dependent Variable: BESTMedian2

Appendix V: Content Analysis

Qualitative data was gathered from participants to gather information on their perceptions of their suitability for STEPPS treatment.

222

Description of qualitative data collection

Quantitative and qualitative data collection occurred in parallel but qualitative data was only collected at T3. It was intended that a convergent mixed methods framework (Fetters, Curry & Creswell, 2013) would be used to integrate quantitative data from questionnaire measures and qualitative data from open ended feedback questions on participants’ views on their suitability for STEPPS treatment. Although the design would not have been strictly a mixed method as both quantitative and qualitative data were analysed using quantitative analyses, the mixed methods framework was considered a helpful framework for integrating these findings. Qualitative data was gathered alongside quantitative in order to elaborate and contextualise quantitative findings in relation to the participants’ experiences. Qualitative data was gathered through written feedback questions asking about participants’ views about their suitability to treatment.

Measure

Qualitative feedback

An additional qualitative feedback form was administered at T3 to gather supportive qualitative data which could act to disambiguate quantitative results. These questions focused on participants views of their suitability for STEPPS treatment and would be subjected to quantitative content analysis.

Plan for analysis

Separate analyses of the quantitative and qualitative data sets were conducted. Integration of qualitative and quantitative data was to occur through merging these results. Findings were intended to be integrated within the discussion through narrative using a weaving approach for reporting the findings with a commentary on how these data sets relate.

To incorporate and examine participant’s perceptions about their membership in STEPPS treatment qualitative data was gathered through questionnaires asking participants about their views on their own suitability for STEPPS and its relevance to them. Quantitative content analysis was used to reduce the qualitative data gathered from the feedback questions given to participants into themes and categories. An inductive approach was taken with a procedure following that outlined by Elo & Kyngäs (2008). Through examination of the manifest content and substantive features of the text, units of code were extracted and refined. The first stage of the analysis involved open coding of the data, where responses were very broad and varied this information was then collapsed into distinct categories of related content. As additional themes emerged, codes were added to capture these until an exhausted list of categorise were created.

Code book for Content Analysis

223

1. Suitability to treatment (1: Yes, 2: No, 3: Partially/mixed)2. Why suited (1: readiness/stage of change, 2: structured and organised format,

3:long time frame, 4: valuing shared experience, 5: increased understanding, 6:helping others, 7: facilitators, 8:tracking progress, 9: relevant content, 10: skills and techniques (e.g. filters), 11: in-depth, 12: made improvements/ found helpful)

3. Why not suited (1: difficult hearing others’ problems, 2:relaxation, 3: group setting, 4: felt different to others, 5:relationships with other groups members, 6: too short 7: some content irrelevant)

4. Was all content relevant to you (1: Yes, 2: No)5. Aspects not relevant (1: insufficient relaxation, 2:relationships with others, 3:

unspecified, 4: drugs and alcohol, 5: goal setting, 6: work/leisure balance, 7: lifestyle, 8: abuse in relationships, 9: other people’s experiences)

Results

Qualitative data was collected from 42 participants who completed treatment, responses were varied in content and the amount of information provided. As qualitative data was limited, themes were formed around the questions participants answered. Small combinations of words and short phrases were clustered into content categories. A second coder rated 48% of the data, inter-coder reliability was established using Krippendorff's alpha and is presented for each item below each table. Table 1 presents the answers from a closed question and data from open-ended questions are presented in tables 2, 3 and 4. The code book for scoring during the content analysis and sample scoring can be found below.

Table 21. Suitability to treatmentUnit of code Number of responsesYes 34No 2Partly/ mixed 6Krippendorff's alpha coefficient from inter-rating: 1.00

81% of participants who completed treatment and completed the qualitative questionnaires perceived themselves as being suited to STEPPS treatment. Only two participants who completed treatment felt that they were not suited to STEPPS.

Table 22. Why suitedCategories Including Number of responsesReadiness to change Felt ready/ right time 3Program factors Structure/ organisation/ format

Long time frame/ In-depth/ Tracking progress

7

Facilitators 3

224

Group Valuing other group members/ shared experience/ Helping others

7

Content Understanding diagnosis/ experiences/ self Relevant content/ made sense Skills based/ techniques

15

Unspecified 5Krippendorff's alpha: 0.71

There were a number of varied reasons why people felt they were suited to STEPPS, these fell largely into five broad categories; readiness to change, program factors, facilitators, group and content. A final category was included to capture answers which did not specify a reason for suitability. The most popular reasons for feeling suited to STEPPS related to relevant content which helped people understand their diagnosis better. Other popular reasons included feeling suited to the structure and benefiting from working within a group with shared experiences.

Table 23. Why not suitedCategories Including Number of responsesContent Content irrelevant 1Experience of group Difficult hearing others’

problems/ Felt different to others/ No relationships with other group members/ Group setting

3

Format Too short /Disruptive group 1Krippendorff's alpha: 1.00

There were much fewer answers to why participants felt treatment was not suited to them reflecting the fewer participants who felt unsuited to treatment. These answers largely focused on participant’s struggling with attending therapy in a group.

Table 24. Aspects not relevantCategories Including Number of responsesSpecific aspect of group content

Including drug and alcohol misuse/ abuse etc

9

Group Other people’s experiences 1Unspecified 2None 19Krippendorff's alpha: 1.00

45% of participants who completed the questionnaires felt that all aspects of STEPPS were relevant to them. This is fewer than the 81% of participants that felt that STEPPS was well suited to them demonstrating that some participants still experienced STEPPS as well suited despite not relating to all aspects of the treatment

225

program. The aspects which participants identified as not relevant to them largely related to a specific aspect of treatment such as work on addiction or coping with abuse. This is to be expected from a group which is designed to cover a wide array of topics which address different participants’ difficulties.

Discussion

Data from the content analysis came only from those who attended treatment to session 20. The majority of these participants felt that STEPPS was suited to them. Future research which gathered qualitative feedback data from both completers and participants who dropped out of treatment early would be interesting to compare the views of these two groups. It is possible that this may generate further hypotheses about participant’s suitability to treatment

A number of the reasons why participants felt STEPPS was suited to them included having a shared experience with other groups members, understanding more about their diagnosis and experiences and feeling the content was relevant to them.

Appendix W: Descriptive statistics of follow-up data

Follow-up questionnaires were sent out to all participants who had attended past session 15 and who had indicated on their consent form that they were happy to

226

receive follow-up questionnaires. Questionnaires were sent either in the post or electronically via Qualtrics (2017). 51 participants were sent follow-up questionnaires, 47.06% responded, providing 24 completed and returned measures. Due to this low response rate the follow-up data was not used in the main analysis.

Of these 24, 91.7% were female. Ages ranged from 19-55. The mean BEST score at follow-up was 37.25 (SD= 15.32) and ranged from 14.0 to 67.0. This follow-up BEST mean was lower than the mean BEST score at the end of treatment from the pairwise deletion sample (38.10) however this sample only represents a small sub-sample who were motivated to complete follow-up measures. It is possible that this sample may be positively biased with participants with positive experiences more likely to complete follow-up measures. Examination of the BEST histogram (below) shows a normal distribution with a clustering of scores around the lower end. This could demonstrate how participant’s scores remained low following treatment, however any firm conclusions would require a larger sample size for confirmation.

Part 3 – Summary of Clinical Experience

Summary of clinical placement experience from November 2014 – September 2017

227

YEAR 1: Adult Mental Health (November 2014 – September 2015): Split placement

Setting: Split placement: Rehabilitation and Recovery Service and Improving Access to Psychological Therapies

Clients and presenting difficulties: Working age adults (aged 18 – 65 years). Within the Rehab and Recovery team I worked with severe and enduring mental health problems. I largely worked with clients with psychosis but Comorbid diagnoses included generalised anxiety, hoarding disorder and depression, borderline personality disorder and bipolar disorder. Within the IAPT placements I worked with working age adults (aged 18 – 65 years) with mild to moderate mental health problems. Presentations of clients I worked with included depression, social anxiety, post-traumatic stress disorder and trichotillomania.

Main models used: CBT, Compassion focused therapy, narrative approach.

Modes and types of work: Direct individual work, indirect work, consultation, joint working with other professionals, assessment, group formulation with staff team and teaching to staff.

YEAR 2: Children and Adolescents (October 2015 – March 2016): Split placement

Setting: Split placement across Child and Adolescent Mental Health Service: Access Team and The Youth Support Team

Clients and presenting difficulties: Children and adolescents (aged up to 18 years) with moderate to severe mental health problems. Presentations of clients included depression, social anxiety, ASD, challenging behavior, OCD and emerging psychosis.

Main models: CBT, systemic and IPT.

Modes and types of work: Direct individual work, family work, neuropsychological assessments, consultation, joint working with other professionals, presentation and training to staff.

YEAR 2: Older Adults (April 2016 – September 2016)

228

Setting: Split placement across a Dementia Assessment Service, Living Well with Dementia team and an Older Adults Assessment and Treatment team

Clients and presenting difficulties: Older adults (aged 65 + years plus one working age adult with early onset dementia) affected by dementia and/or mental health problems. Presentations included depression, anxiety, early onset dementia and difficulties associated with dementia.

Main models: CBT, systemic, psychodynamic and neuropsychological models.

Modes and types of work: Direct work with individuals and families, indirect work with and staff teams, group facilitation, neuropsychological assessment, case presentation and teaching, joint working and neuropsychological assessments.

YEAR 3: Specialist placement: Eating Disorders (October 2016 – March 2017)

Setting: Split placement across Adult Outpatient Eating Disorders and Adolescent Inpatient Eating Disorders.

Clients and presenting difficulties: Adults (aged 18 + years) with eating disorders. I worked with individuals with bulimia nervosa and anorexia nervosa. Comorobid diagnoses included anxiety, depression, agoraphobia and bipolar disorder as well as emerging personality disorder in adolescent services.

Main models: CBT, Compassion focused therapy and systemic.

Modes and types of work: Direct work with adults and adolescents, assessment, group facilitations with adults and adolescents, supervision of undergraduate psychologists work and teaching presentation.

YEAR 3: Learning Disabilities (April 2017 – September 2017)

Setting: Community Team for People with Learning Disabilities

Clients and presenting difficulties: Adults (aged 18+ years) with learning disabilities. I worked with individuals with ASC, physical health difficulties, challenging behavior and difficulties with self-esteem.

229

Main models: CBT, psychodynamic, systemic and neuropsychological

Modes and types of work: Direct work with individuals, indirect work with family and staff members, consultation, neuropsychological assessment and PBS.

Part 4; PSYCHD CLINICAL PROGAMMETABLE OF ASSESSMENTS COMPLETED DURING TRAINING

Year I AssessmentsASSESSMENT TITLE

WAIS WAIS Interpretation (online assessment)Practice Report of Clinical Activity

Compassion Focused Therapy assessment & formulation of a male in his forties with anxiety and hoarding behaviours

Audio Recording of Clinical Activity with Critical Appraisal

Critical Appraisal of Audio Recording of an intervention session of CBT for depression with a male in his late twenties.

Report of Clinical Activity N=1

Case Report of a Cognitive Behavioural assessment, formulation and intervention for depression with a male in his late twenties.

Major Research Project Literature Survey

Literature Survey of Individual Characteristics Predictive of Outcome in Short-Term Therapy for Borderline Personality Disorder.

Major Research Project Proposal

Effect of severity and subtypes of Borderline Personality Disorder on response to STEPPS treatment

Service-Related Project Service evaluation of former inpatients’ perspectives on contributors to a positive inpatient ward environment

Year II AssessmentsASSESSMENT TITLE

Report of Clinical Activity

Case Report of an Interpersonal Therapy assessment, formulation and intervention for an adolescent female with depression.

PPLD Process Account Personal and Professional Learning Groups Process AccountPresentation of Clinical Activity

Case presentation of a psychodynamically informed formulation, intervention and consultation of early onset dementia in a lady in her mid fifties.

Year III Assessments ASSESSMENT TITLE

Major Research Project Literature Review

A systematic review evaluating severity, symptoms and subgroups of borderline personality disorder as predictors of psychological therapy outcomes.

Major Research Project Empirical Paper

Predictors of response to Systems Training for Emotional Predictability and Problem Solving (STEPPS) for the treatment of Borderline Personality Disorder

Final Reflective Account On becoming a clinical psychologist: A retrospective, developmental, reflective account of the experience of training:

230

The person and the psychologistReport of Clinical Activity – Formal Assessment

A neuropsychological assessment for dementia with a lady in her late 70s