the validity of a clinical test for the diagnosis of lumbar spinal ......lumbar spinal stenosis,...

Accepted Manuscript

The validity of a clinical test for the diagnosis of lumbar spinal stenosis

Robert Dobbs, Stephen May, Philip Hope

PII: S1356-689X(16)30357-5

DOI: 10.1016/j.math.2016.05.332

Reference: YMATH 1860

To appear in: Manual Therapy

Received Date: 11 January 2016

Revised Date: 23 May 2016

Accepted Date: 25 May 2016

Please cite this article as: Dobbs R, May S, Hope P, The validity of a clinical test for the diagnosis oflumbar spinal stenosis, Manual Therapy (2016), doi: 10.1016/j.math.2016.05.332.

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service toour customers we are providing this early version of the manuscript. The manuscript will undergocopyediting, typesetting, and review of the resulting proof before it is published in its final form. Pleasenote that during the production process errors may be discovered which could affect the content, and alllegal disclaimers that apply to the journal pertain.

http://dx.doi.org/10.1016/j.math.2016.05.332

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

Title page


Robert Dobbs1, Stephen May2*, Philip Hope1

1 Physiotherapy Department, Chesterfield Royal Hospital, Chesterfield, Derbyshire

2 38 Collegiate Crescent, Faculty of Health and Wellbeing, Sheffield Hallam University, Sheffield, S10 2BP

[email protected]

0114 2252370

*Corresponding author

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPTThe validity of a clinical test for the diagnosis of lumbar spinal stenosis

Abstract

Background: The diagnosis and management of acquired lumbar spinal stenosis (ALSS) is

an area of growing interest with an increase in its prevalence and detection in the older

population.

Objectives: To investigate the diagnostic accuracy of a modified extension test (MExT) for

diagnosing ALSS in subjects aged fifty or over.

Methods: Symptomatic response of the bi-component MExT was evaluated and

compared against MRI findings in 30 subjects. Estimates of sensitivity, specificity, likelihood

ratios and post-test probabilities were all calculated, and the capability of the test to

discriminate between grade and location of stenosis was also appraised.

Results: MExT sensitivity was high at 92% (95% CI, 72-99%) leading to a significant

negative likelihood ratio at -LR 0.2 (95% CI, 0.03-1.36); conversely, specificity was low at

40% (95% CI, 7-82%) with only a small positive likelihood ratio of +LR 1.53 (95% CI, 0.74-

3.16). All correlations between the MExT and concurrent grade, or location of stenosis

appeared weak and insignificant.

Conclusions: The MExT was found to demonstrate acceptable criterion validity in relation to

ruling-out a diagnosis when a negative result was observed; however, further validation

appears warranted.

Words 184

Keywords: Lumbar, Spinal stenosis, Extension Test, Diagnosis, Validity

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

1


1. INTRODUCTION

World epidemiological projections indicate that the number of people within the

population aged sixty-five or over will have increased from sixteen to twenty five

per cent by the year 2040 (Population Reference Bureau, 2010; Maloney-

Backstrom et al., 2011). In response to such demands, health research agendas

have increasingly shifted focus towards the management of degenerative

conditions proposing direct efficiency savings through enhanced examination and

treatment rigour (IAGG, 2007). One such condition gaining attention is acquired

lumbar spinal stenosis (ALSS). Point prevalence estimates suggest degenerative

lumbar conditions may afflict up to twenty per cent of the older-aged population

(Keller et al., 2003; Lyle et al., 2005). Of those individuals referred to an

orthopaedic spinal specialist, approximately fourteen per cent are found to exhibit

severe stenotic change requiring surgical decompression (Aalto et al., 2006; Slatis

et al., 2007; Watters et al., 2008). As such, ALSS has fast become the leading

cause of spinal surgery in this population (Mannion et al., 2010; Steurer 2010),

whilst its deleterious influence on locomotor capability and psychosocial aspects of

health renders it a precursor for falls, depression and cardiovascular disease

(Middleton and Fish, 2009; Kim et al., 2011).

The clinical syndrome of ALSS develops as a consequence of incremental

damage to spinal tissues. Narrowing of the intervertebral disc reduces the

distance between adjacent vertebrae, whilst an alteration to biomechanical force

initiates arthritic change and ligamentous laxity (Fredrickson et al., 2001; Vo et al.,

2005; Papadakis et al., 2011). Subsequent thickening of the ligamentum flavum,

pedicles or vertebral facets diminish the space available for neural and vascular

structures, with ischaemic change ensuing within the spinal canal (central

stenosis), nerve root canal, or intervertebral foramina (foraminal stenosis) (Vo et

al., 2005). Sufferers of ALSS present with symptoms of lower extremity pain or

paraesthesia, occurring with or without back pain especially with positions of

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

2

extension, which limit walking capacity secondary to neurogenic claudication (Bal

et al. 2006; Malmivaara et al., 2007; Ogikubo et al., 2007).

1.1 Diagnostic methods

However no widely accepted diagnostic criterion for ALSS currently exist (Goh et

al., 2004; Haig et al., 2006; Genevay et al., 2010; Genevay and Atlas, 2010).

Recent reviews have cited heterogeneity and unsatisfactory research standards,

which prevent conclusions on the performance of diagnostic tests being drawn

(De Graaf, 2006; Genevay and Atlas, 2010). Moreover, appraisal of these tests

employed found many to be grounded in anecdotal evidence rather than robust

measurement (Haig et al., 2006; Browne and Roberts, 2008; Deyo et al., 2009).

For this reason supplementary analysis of all diagnostic procedures is needed. A

recent review reported postural effects on symptoms to have the most useful

diagnostic criteria, whereas clinical tests were less consistently useful (de

Schepper et al. 2013).

With a distinctive clinical presentation initial diagnosis of ALSS is made in relation

to patient report alone; however, clinical prediction rules based solely on patient

report have only been done on small populations (Sugioka et al., 2008). Accurate

findings from physical testing would help to validate patient report, however

selection of these tests has proven problematic. Neurological examination is

commonly observed to be normal (Bassewitz and Herkowitz, 2001; Genevay and

Atlas, 2010), thus the most pertinent physical findings typically relate to

symptomatic change in accordance with lumbar movement. Although both

treadmill and bike tests have been postulated as useful when differentiating ALSS

from claudication arising from peripheral vascular disease (Fritz et al., 1997;

Tenhula et al., 2000; Deen et al., 2000; Yukawa et al., 2002), research findings

appear inconsistent (Dong and Porter, 1989; Moon et al., 2005).

Thus postural tests gauging the symptomatic response to lumbar extension

appears pertinent to the initial physical examination. Studies have exhibited the

extension movement to narrow spinal space by up to twenty per cent (Panjabi et

al., 1983; Inufusa et al., 1996; Fritz et al., 1998), with percentages substantially

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

3

increasing when coupled with degenerative changes (Schronstrom and Willen,

2001; Westergaard et al., 2009; Kishner et al., 2010). Regardless of a strong

theoretical foundation, only one study by Katz et al., (1995) is frequently cited by

other authors when appraising validity (Fritz et al., 1998; Lurie, 2005; Vo et al.,

2005; Westergaard et al., 2009). Evaluating a range of physical components this

study observed a small positive likelihood ratio of 1.6 with thirty-seconds of lumbar

extension and provocation of at least thigh pain. Positive findings on coupled

quadrant movements of extension and side-flexion is recognised as ‘Kemps sign’

(Jenis and Howard, 2000), and is stated to occur frequently in foraminal stenosis

(Watanabe et al., 2010; Eguchi et al., 2010). Thus further evaluation of 'Kemps

sign' or combined extension and side-flexion movement seems warranted, as it

would be a simple and straightforward test to employ in the examination of those

with suspected ALSS, because of age and findings from the history.

1.2 Aims

The aim of this study was to prospectively evaluate the validity of the ‘Modified

Extension Test’ (MExT) an extension-based test proposed by the authors. This

test gauges the symptomatic response to lumbar movement, and is postulated to

support the diagnosis of ALSS whether located centrally or foraminally. Moreover,

no consistent information currently exists regarding whether symptoms occur more

rapidly following the commencement of extension in those with severe stenotic

change. Such discernments would support the identification of those individuals

more likely to require surgical consideration (Atlas et al., 1996; Amundsen et al.,

2000; Benoist, 2002; Slatis, 2007; Watters et al., 2008), and hence, was selected

for sub-analysis despite the inconsistencies that there was such a link in previous

literature (Wang et al., 2008; Haig and Tomkins, 2010; Kishner et al., 2010).

2. METHODS

2.1 Ethical approval

This was a prospective blinded concurrent-criterion related validity design to

establish MExT sensitivity, specificity and predictive capability in diagnosing

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

4

ALSS. Ethical approval was granted by the Nottingham Two proportionate review

sub-committee, with site specific permission and insurance obtained in conjunction

with the local CRH NHS trust research and development team. Institutional

clearance was further established on behalf of Sheffield Hallam University Health

and Care Ethics Committee. All patients provided signed, informed consent prior

to participation.

2.2 Sample

Subjects aged fifty or over were recruited from Chesterfield Royal Hospital (CRH)

over a six-month period. Consecutive patients attending either the Extended

Scope Practitioner (ESP) or musculoskeletal (MSK) physiotherapy clinic were

screened, and where deemed eligible were invited to participate. Recruitment was

based on presenting symptoms, with inclusion criterion requiring subjective report

of unilateral or bilateral pain or paraesthesia radiating below the gluteal fold. In

accordance with guidelines for MRI scan referral, symptoms were required to have

been present for a minimum of six weeks (Bussieres et al 2008). Patients who

had a lack of mental capacity to provide informed consent, or were unable to

understand English, or there was a pre-existing diagnosis of peripheral vascular

disease, or of Ankylosing Spondylitis were excluded. A STARD flowchart is

provided in Figure 1.

In the absence of a defined ‘true’ diagnostic gold standard, Magnetic Resonance

Imaging (MRI) was selected as the reference test upon which to evaluate the

index test’s (MExT) validity. Whilst concern has been documented regarding the

relative inconsistency between clinical symptoms and degree of observed

radiographic change (Wang et al., 2008; Haig and Tomkins, 2010; Kishner et al.,

2010), clinical guidelines recommend its use as a primary tool in ALSS diagnosis

(Watters et al., 2008). MRI has demonstrated both significant positive likelihood

ratios, (between 8.1 and 16.2), and negative likelihood ratios (between 0.3-0.19)

establishing its usefulness for ruling in or out a diagnosis (Fritz et al., 1998).

Moreover, the reliability of image interpretation for ALSS has been established

with Lurie et al., (2008) observing Kappa scores of (0.82) and (0.83) for inter-rater

and intra-rater reliability respectively. The definitions used are in tables 1 and 2.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

5

Table 1: Definitions of central ALSS severity based upon cross-sectional area of

the canal and presence of cerebrospinal fluid.

Severity Definition

Severe ALSS >50% narrowing with absence of cerebrospinal fluid

Moderate ALSS >50% narrowing with more nerve root than cerebrospinal fluid

Mild ALSS

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

6

conversely, when a red ball was selected component two, (extension/side-flexion),

was the initial test component.

Each subject received the MExT (Figure 2), whilst an independent rater timed

each component up to the sixty second cut-off point. The result of the MExT was

documented against each component, and when positive, the associated time for

symptoms to occur was recorded. A positive test was deemed to be when pain

below the gluteal fold was exacerbated or produced with one or both components

of the test; exacerbation or production of back pain alone did not count. The

participant was subsequently monitored for ten minutes and advised on what

measures should be undertaken in the event of adverse effects. In order to limit

partial verification bias each participant was referred for an MRI scan irrespective

of MExT result (Pewsner et al., 2004; Whiting et al., 2004). The subsequent

lumbar MRI scan was undertaken on a 1.5 Tesla, Siemens, Avanto scanner within

a week of the index test, before being analysed by a consultant radiologist blinded

to clinical findings and index test result. The results from both the index and

reference tests were returned to the principal investigator and evaluated, whilst

each subject received an ESP follow-up appointment to discuss further

management options.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

7

Figure 2. Index Test ‘MExT’ Procedure

Initial set up

Standing position

S Component 1 Limb Placement

The participant adopts the neutral standing position, with the therapist situated to the side of the individual. The therapist assumes a comfortable step stance posture using the medial aspect of their knee (closest to the participant) to maintain the knee in extension and provide support to the participant (see limb placement image below). The therapist then assumes an appropriate hold upon which to facilitate the test. The participant is asked to place their arm closest to the therapist across their chest with their hand close to the contralateral shoulder. The therapist places one hand on the lower lumbar spine and the other is located across the participants arm. The adopted position allows appropriate comfort for participant and therapist. Prior to test commencement, the participant is advised that they should inform the therapist if any symptomatic change occurs.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

8

Component 1

Component 2

The therapist asks for the timer to commence on the command ‘Go’ as they gently manoeuvre the participant into their full available extension, remaining in that position for up to one minute. The procedure is terminated prior to one minute if either the leg symptoms have been reproduced (a positive test), or if the participant reports excessive lumbar spine discomfort to continue, (an inconclusive test). If the participant fails to report reproduction of their leg symptoms during this time period the test is deemed negative. The therapist terminates the test with the command ‘stop’ before the participant is returned to the neutral position and provided with a one minute rest to allow symptoms to return to normal prior to initiating stage three.

The participant is once again advised to inform the therapist should any symptomatic change occur. The therapist asks for the timer to commence as the participant is manoeuvred into full extension, and then side-flexion towards the symptomatic side. On the occasion that symptoms are bilateral the examiner selects only the most symptomatic side to test, with results only determined positive if symptoms are reproduced on this same side. If symptoms are reproduced on the contralateral side the test is deemed negative. The procedure is terminated in relation to the predefined criteria (as in component one) with all findings documented. The researcher asks the participant to rest for up to ten minutes following the procedure, taking the opportunity to answer any additional questions the participant may have and to ensure no immediate adverse effects have occurred. The physiotherapist only performs the test once to mirror clinical practice, whilst additionally reducing the likelihood of developing post-test soreness.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

9

2.4 Pilot study

Preceding commencement of the trial, a pilot study of five subjects was completed

with results not included in the final analysis. Study procedures were appraised

and adapted accordingly, with inter-rater reliability of both stopwatch recordings

and MRI interpretation found to be above the limit set for acceptable

reliability >0.75 (Streiner and Norman, 2003). The minimal detectable change was

at a 95% confidence level.

One person performed each component of the test while two other people

simultaneously did the stopwatch timing. The intra-class correlation coefficient

(ICC) was 0.99 whilst the standard error of measurement (SEM) was 1.72 and the

minimal detectable change was 4.76. Moreover, MRI interpretation evaluation

was based upon agreement of stenotic grade, with a kappa coefficient value of

0.83 (95% CI 0.48, 1.0).

2.5 Data Analysis

Data was analysed using SPSS (Statistical Package for the Social Sciences)

version 16.0. Construction of a two by two contingency table allowed for initial

identification of test validity, with point-estimates of sensitivity and specificity

ascertained using a calculator alongside 95% confidence intervals (Bland, 2000).

Sensitivity and specificity are measures of test performance, discriminating

between subjects with or without a specific disease. In conjunction with statistical

and clinical recommendation acceptable levels were duly fixed at > 80% (Streiner

and Norman, 2003; Riegelman, 2005).

Positive and negative likelihood ratios were further calculated alongside 95%

confidence intervals to provide primary analysis of test validity. Likelihood ratios

estimate the extent to which an individual with ALSS is more likely to test positive

or negative when compared to someone without the disease. Evaluation utilising

likelihood ratios is preferable to that of the more commonly used positive and

negative predictive values, as likelihood ratios are not dependent upon disease

prevalence (McGee, 2002). As such, a positive likelihood ratio of > 2.0 was

perceived to highlight an important increase in ALSS likelihood (McGee, 2002),

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

10

conversely, a negative likelihood ratio of < 0.5 was adjudged to propose a reduced

ALSS likelihood. Post-test probabilities of a subject having ALSS in relation to a

specific MExT result were established from the likelihood ratios, and confirmed in

association with Fagan’s nomogram (Akobeng, 2007; Campo et al., 2010).

Sub-analysis of the relationship between symptom onset following MExT

commencement and the grading of ALSS whether mild, moderate or severe was

undertaken using Spearman’s rank correlation coefficient. Similarly, the Phi

coefficient was utilised to establish whether an association existed between

positive findings on a specific MExT component and a diagnosis of central or

foraminal ALSS. Significant findings were deemed to be p

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

11

Myotome lossb (%)

Reflex lossc (%)

Positive SLRd (%)

Intermittent Claudication (%)

1 (3)

3 (10).

3 (10)

25 (82) a = areas of sensory abnormality; b = weakness in muscle compared to the other side; c = diminished or absent; d =

reduced range compared to the other side

In relation to index and reference tests, eighty-seven per cent of the sample tested

positive to at least one component of the MExT. In accordance with reference test

findings sample prevalence of ALSS was calculated as eighty-three per cent, with

thirty per cent found to have central stenosis, thirty-three per cent foraminal

stenosis, and seventeen per cent both central and foraminal forms. Three per cent

were positive for stenosis at the lateral recess with ten per cent additionally found

to have an associated degenerative spondylolithesis at either the L4/5 or L5/S1

level.

3.2 Sensitivity, specificity and likelihood ratios

Sensitivity, specificity, and positive and negative likelihood ratios are reported in

table 4. Pre and post-test probabilities were calculated from pre and post-test odds

and confirmed against Fagan’s nomogram. Whilst the pre-test probability of a

subject within the sample having ALSS was 83% the post-test probability

marginally increased to 88% when a positive MExT was observed. Likewise post-

test probabilities based on the negative likelihood ratio observed a decrease in the

likelihood of ALSS to 49% when a negative result was detected.

Sub-analysis of correlations between grade of stenosis and symptom onset

following MExT commencement (Table 5), failed to demonstrate a significant

correlation for either central or foraminal elements in conjunction with Spearman’s

correlation coefficient. In relation to central stenosis a negative correlation of R =

- 0.147 was observed with symptom onset more rapid in severe stenosis, but this

correlation was not statistically significant (p = 0.684). Similar outcomes were

observed in relation to foraminal stenosis with a negative correlation of R = -

0.211, which was not statistically significant (p = 0.418).

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

12

Consideration of MExT selectivity in relation to stenosis location failed to identify

notable correlations between the matched components (Table 6). Findings of

central stenosis and a positive finding on the on the extension component of the

test observed a statistically non-significant negligible association (r = 0 .11, p =

0.54). There was a statistically non-significant association identified between the

extension / lateral flexion component of the test and foraminal stenosis (r = 0.09,

p = 0.62). The extension component of the test and foraminal stenosis had an

association of (r = 0.24, p =0.19) whilst the extension / lateral flexion component of

the test and central stenosis had an association of (r = 0.15, p = 0.41), but neither

was significant. The lack of statistical significance could be due to small sample

size and lack of adequate statistical power.

Table 4: MRI compared to MExT (in participant numbers)

MRI MExT

+ve scan findings -ve scan findings +ve test 23 3

-ve test

2 2

Sensitivity (95% CI) 92% (72-99%) Specificity (95% CI) 40% (07-82% + LR (95% CI) 1.53 (0.74-3.16) - LR (95% CI) 0.2 (0.003-1.36) +ve = one or both components were provocative of symptoms; -ve = neither

component provoked symptoms; CI = confidence intervals; LR = likelihood ratios

Table 5: Analysis of time to onset of symptoms (in seconds), and correlation with

stenosis severity

Stenosis grade (N)*

Mean time to symptom onset when positive on component 1 for central stenosis

Mean time to symptom onset when positive on component 2 for central stenosis

Mean time to symptom onset when positive on component 1 for foraminal stenosis

Mean time to symptom onset when positive on component 2 for foraminal stenosis

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

13

Mild (1)

05.41 S.D. *N/A

16.69 S.D. *N/A

18.24 S.D. *N/A

28.03 S.D. *N/A

Moderate (11)

19.59 S.D. 15.29

28.25 S.D. 21.24

16.14 SD 18.72

15.89 SD 10.50

Severe (13)

05.07 SD 3.31

13.78 SD 13.32

14.63 SD 13.03

18.13 SD 25.51

* five patients had a mixed picture moderate in the spinal canal, but severe in the lateral canal or vice versa

Table 6: Relationship between site of stenosis and MExt test (in participant

numbers)

Central stenosis

Foraminal stenosis

Central and foraminal stenosis

No stenosis

MExT extension positive / lateral flexion negative

1 1

MExt extension negative / lateral flexion positive

1 1 1 1

MExT extension / lateral flexion positive

6 8 4 2

MExT extension / lateral flexion negative

1 1 2

4. DISCUSSION

To the authors' knowledge, the current investigation is the first to propose and

evaluate the validity of the MExT, a bi-component movement test, postulated to

support the diagnosis of ALSS through physical examination of individuals who

were 50 years or older, and had unilateral or bilateral symptoms in the lower

extremity radiating distal to the gluteal fold. The current evaluation observed the

MExT to demonstrate high sensitivity (92%) with a clinically significant negative

likelihood ratio (-LR 0.2). In accordance with negative post-test probabilities, the

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

14

likelihood of a subject within our cohort having ALSS reduced from 83 % to 49 %

when a negative result was observed. Such findings would support the

prospective validity of the MExT to assist in ruling-out a diagnosis when a negative

result is observed. In relation to the test’s proficiency to assist in ruling-in a

diagnosis, the converse appeared true. Specificity point estimates were low (40%),

and positive likelihood ratios were only small, (+LR 1.53). Neither value achieved

the desired level for clinical acceptability, (>80% or > 2.0 respectively) with only a

five per cent increase in subject post-test probability when a positive result was

observed.

Analysis of whether a relationship existed between stenosis grade and symptom

onset following MExT commencement failed to demonstrate a significant

correlation for either central or foraminal elements. Correlations were statistically

non-significant irrespective of whether stenotic types were stratified by the

theoretical preferential component or calculated in relation to the component of

quickest onset of symptoms. Analysis of alternate MExT components in relation to

stenotic location additionally failed to identify a clear association. Whilst ‘Kemps

sign’ has been endorsed as an important clinical observation for foraminal

stenosis (Lyle et al., 2005; Watanabe et al., 2010), findings from this current study

would alternatively promote the evaluation of symptomatic response to both

extension alone and extension with side-flexion, irrespective of the stenotic

location suspected. As such, inclusion of both movement components would

allow for greater test sensitivity than any one movement alone.

As identified in an updated systematic review of the accuracy of diagnostic tests

(de Schepper et al. 2013) items from the history had the highest diagnostic value:

radiating leg pain worse on standing, absence of pain when sitting, easing of pain

when bending, and a wide-based gait. MRI was the most promising imaging test;

and about 50 clinical tests were also examined. These included abnormal gait,

weakness, reflexes, a combination of signs and symptoms, a questionnaire, pain

drawings, and walking on a treadmill. On the whole clinical tests had either good

levels of sensitivity or good levels of specificity, but not both; similar to the findings

in this study.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

15

Whilst the current cohort was consistent with other studies of this nature in regards

to age and gender (Katz et al., 1995; Lyle et al., 2005; Konno et al., 2007);

appraisal of post-test validity against previous extension tests varied. Katz et al.

(1995) reported a small positive likelihood ratio (+LR 1.6) on ‘thigh pain with thirty

seconds extension’; however, point estimates were notably different to the current

study with reduced sensitivity (51%), and greater specificity (69%). The increase in

test sensitivity may have related to longer test duration. Subjects in the MExT

maintained the extended posture for up to thirty seconds longer than previous

studies. Whilst test modifications may assist identification of ALSS with greater

frequency than traditional extensions tests, such alterations may additionally

account for the noted drop in specificity.

In relation to alternative clinical diagnostic tools the MExT observed a comparative

capability to rule out ALSS diagnosis. Sensitivity estimates and negative

likelihood ratios were similar to those of the prediction-rule grounded, ‘ALSS

diagnostic support tool’ (Konno et al., 2007). However, in relation to overall

properties the MExT appeared subordinate, failing to achieve the same level for

specificity (72 %) or positive likelihood ratios (+LR 3.31). Likewise greater

discriminative capability to rule-in a diagnosis was observed during a two-stage

treadmill test with all positive likelihood ratios exceeding those of the MExT

(+LR >2.5) (Fritz et al., 1997).

4.1 Limitations

The findings of this study should be considered in relation to a number of study

limitations. Through recruitment of only a small sample size (n = 30), the

evaluation had an increased likelihood of type II error. The small sample size and

lack of statistical power may have had the effect of producing non-significant

results. Confidence intervals around the primary data appeared wide, and hence,

estimations of test validity must be interpreted with caution. Prevalence of ALSS

within the cohort was high as a result of the inclusion criterion employed.

Specificity estimates may have suffered due to inadequate evaluation against

subjects with an alternative lumbar, neurogenic or vascular pathology.

Recruitment methods through secondary care may have inadvertently biased the

sample towards more advanced pathology. A large proportion of the cohort was

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

16

observed to exhibit moderate to severe ALSS. As such, it was not possible to fully

evaluate the MExT against more mild variants limiting its external validity to

primary care settings. Furthermore in such settings there would be a higher

degree of diagnostic uncertainty, but in such a context tests could have the

greatest ability to influence clinical decision making if they perform well. Whilst

MRI was selected as the reference standard due to its objectivity, recognised

discriminative capability and consistent clinical application, it must be

acknowledged that inconsistency between clinical symptoms and degree of

radiographic change exists. Another limitation was the lack of assessment of the

reliability between testers on decision making regarding whether there was a

positive or negative response to the tests.

4.2 Clinical and research implications

In relation to clinical practice it is not currently possible to support or refute the

MExT’s diagnostic validity due to the limitations discussed. Whilst preliminary

estimates suggest the MExT may demonstrate part criterion validity, further

evaluation is required. As such, the current evaluation should act as a feasibility

study upon which to base further investigation into extension test validity. Future

studies should aim to reproduce test procedures within larger cohorts across a

range of primary and secondary care settings. Replication of current findings

would support the value of the MExT in ruling out a diagnosis of ALSS, and hence,

would form a valuable part of the physical examination process.

Until such validation has been completed, clinician’s should continue to evaluate

all available subjective and objective findings in order to support or refute a

potential diagnosis. Moreover, in clinical practice clinician’s should continue to

use subjective report of radicular symptoms and the impact upon the individual’s

functionality as the pre-requisite for onward referral.

5. CONCLUSION

Preliminary examination of a modified extension test’s validity to diagnose lumbar

spinal stenosis demonstrated conflicting results. Whilst the test demonstrated a

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

17

clinically significant capability to rule-out diagnosis, concurrent capability to rule-in

diagnosis was unacceptable. All aspects of the test were not capable of

determining the severity of stenosis or of differentiating between central and

foraminal types. Supplementary research on larger cohorts is required to fully

evaluate test validity.

Word count 4,231 including figures

References

Aalto T, Malmivaara, A, Kovacs F, Herno A, Alen M, Salmi L, et al. Preoperative predictors for postoperative clinical outcome in lumbar spinal stenosis: systematic review. Spine 2006;31(18):E648-E663. Akobeng A. Understanding diagnostic tests1: sensitivity, specificity and predictive values. Acta Paediatrica 2007;96(3):338-341. Amundsen, T., Weber, H., Nordal, H., Magnaes, B., Abdelnoor, M., Lilleas, F. Lumbar spinal stenosis: conservative or surgical management: A prospective 10-year study. Spine 2000;25 (11):1424-1436. Bal S, Celiker R, Palaoglu S, Cila A. F wave studies of neurogenic intermittent claudication in lumbar spinal stenosis. Am J Phys Med Rehab 2006;85 (2):35–40. Bassewitz H, Herkowitz H. Lumbar stenosis with spondylolithesis: current concepts of surgical treatment. Clin Orth Rel Res 2001;384:54-60. Bland M. An Introduction to Medical Statistics (3rd edition). Oxford: Oxford University Press, 2000. Browne J, Roberts L. Extended scope practitioners management of lumbar spinal stenosis. J Bone Joint Surg 90(B);Supp III:488. Bussieres A, Taylor J, Peterson C. Diagnostic imaging practice guidelines for musculoskeletal complaints in adults-an evidence-based approach-part 3: spinal disorders. J Manip Physio Thera 2008;31(1):33-38. Campo M, Shiyko M, Lichtman S. Sensitivity and specificity: A review of related statistics and controversies in the context of physical therapist education. J Phys Thera Educ 2008;24(3):69-89. Deen H, Zimmerman R, Lyons M, McPhee M, Verheijde J, Lemens S. Test-retest reproducibility of the exercise treadmill examination in lumbar spinal stenosis. 2000;75 (10):1002-1007. De Graaf, Prak I, Bierma-Zeinstra S, Thomas S, Peul B, Koes B. Diagnosis of lumbar spinal stenosis: A systematic review of the accuracy of diagnostic tests. 2006;31(10):1168-1176.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

18

De Schepper EIT, Overdevest GM, Suri P, Peul WC, Oei EHG, Koes BW, Bierma-Zeinstra SMA, Luijsterburg PAJ. Diagnosis of lumbar spinal stenosis. An updated systematic review of the accuracy of diagnostic tests. Spine 2013;38:E469-E481. Deyo R, Mirza S, Turner S. Overtreating chronic back pain: time to back off?’ J Board Fam Med 2009;22:62-68. Dong G, Porter RW. Walking and cycling tests in neurogenic and intermittent claudication. Spine 1989;14(9):965–969. Eguchi Y, Ohtori S, Yamashita M, Yamauchi K, Susuki M, Orita S, et al. Diagnosis of lumbar foraminal stenosis. Chiba Med J 2010;86 (2):43. Fredrickson M, Lee S, Welsh J. Changes in posterior disc bulging and intervertebral foraminal size associated with flexion-extension movement: a comparison between L4-5 and L5-S1 levels. Spine 2001;26(1):10-17. Fritz J, Erhard R, Delitto A, Welch W, Nowakowski PO. Preliminary results of the use of a two-stage treadmill test as a clinical diagnostic tool in the differential diagnosis of lumbar spinal stenosis. J Spinal Dis 1997;10:410-416. Fritz J, Delitto A, Welch W, Erhard E. Lumbar spinal stenosis: A review current concepts in evaluation, management, and outcome measurements. Arch Phys Med Rehab 1998;79:700-708. Ganz J. Lumbar spinal stenosis: postoperative results in terms of preoperative posture-related pain. J Neurosurg 1990;72 (1):71-74. Genevay, S., Atlas, S. Lumbar spinal stenosis. Best Pract Res Clin Rheum 2010;24:253-265. Genevay S, Atlas S, Katz J. Variation in eligibility criteria from studies of radiculopathy due to a herniated disc and of neurogenic claudication due to lumbar spinal stenosis. Spine 2010;35 (7):803-811. Goh K, Khalifa W, Anslow P, Cadoux T, Donaghy M. The clinical syndrome associated with lumbar spinal stenosis. Euro Neurol 2004;52(4):242–249. Haig A, Tong H, Yamakawa K, Parres C, Quint D, Chiodo A et al. Predictors of pain and function in persons with spinal stenosis, low back pain, and no back pain. Spine 2006;31(25):2950–2957. Haig A, Tomkins C. Diagnosis and management of lumbar spinal stenosis. J Am Med Assoc 2010;303(1):71-72. IAGG. International Association of Gerontology and Geriatrics. Research agenda on ageing for the 21st century. 2007. Available at http://www.iagg.com.br/webforms/iaggNewsletter.aspx Inufusa A, An H, Lim T, Hasegawa T, Haughton V, Nowicki B. Anatomic changes of the spinal canal and intervertebral foramen associated with flexion-extension movement. Spine 1996;21(21):2412–2420. Jenis L, Howard S. Spine update: Lumbar foraminal stenosis. Spine 2000;25(3):389-394. Johnson O. Information theory and central limit theorem. Imperial College Press, 2004. Katz J, Dalgas M, Stucki G. Degenerative lumbar spinal stenosis. Diagnostic value of the history and physical examination. Arth Rheum 1995;38:1236-1241.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

19

Keller T, Szpalski M, Gunzburg R, Spratt K. Assessment of trunk function in single and multi-level spinal stenosis: a prospective clinical trial. Clin Biomech 2003;18:173-180. Kim H, Chun H, Han C, Moon S, Kang K, Kim H, et al. The risk assessment of a fall in patients with lumbar spinal stenosis. Spine 2011;36(9):E588-E592. Kinnear P, Gray C. SPSS 15 Made Simple. New York Psychology Press, 2008. Kishner S, Gunduz O, Munshi S, Gupta S, Goyeneche N. Electrodiagnosis in lumbar spinal stenosis: A review. Turkish J Phys Med Rehab 2010;56:75-80. Kranzler J. Statistics for the Terrified (4th edition). Pearson Education, London, 1997. Konno S, Hayashino Y, Fukuhara S, Kikuchi S, Kaneda K, Chiba K, et al. Development of a clinical diagnosis support tool to identify patients with lumbar spinal stenosis. Eur Spine J 1997;16(11):1951-1957. Lurie D. What diagnostic tests are useful for low back pain? Best Pract Res Clin Rheum 2005;19(4):557-575. Lurie J, Tosteson A, Tosteson T, Carragee E, Carrino J, Kaiser J,et al. Reliability of readings of magnetic resonance imaging features of lumbar spinal stenosis. Spine 2008;33(14):1605-1610. Lyle M, Manes S, Mcguiness M, Ziaei S, Iversen M. Relationship of physical exam findings and self-reported symptom severity & physical function in patients with degenerative lumbar conditions. Phys Ther 2005;85(2):120-133. Malmivaara A, Slatis P, Heliavaara M, Sainio P, Hirvonen N, Kankare J. Surgical or non-operative treatment for lumbar spinal stenosis? A randomized controlled trial. Spine 2007;32(1):1-8. Maloney-Backstrom K, Whitman J, Flynn T. Lumbar spinal stenosis – diagnosis and management of the aging spine. Man Ther 2011;16:308-317. Mannion A, Denzler R, Dvorak J, Grob D. Five-year outcome of surgical decompression of the lumbar spine without fusion. Eur Spine J 2010;19(11):1883-1891. McGee S. Simplifying likelihood ratios. J Gen Int Med 2010;17(8):647-650. Middleton K, Fish D. Lumbar spondylosis: clinical presentation and treatment approaches. Curr Rev Musc Med 2009;2(2):94-104. Moon E, Kim H, Park J, Shin D, Ha J, Shim D, et al. Comparisons of the predictive value of myelography, Computer tomography and MRI on the treadmill test in lumbar spinal stenosis. Yonsei Med J 2005;46:806-811. Ogikubo O, Forsberg L, Hansson T. The relationship between the cross-sectional area of the cauda equine and the preoperative symptoms in central lumbar spinal stenosis. Spine 2007;32(13):1423-1428. Papadakis M, Sapkas G, Papadopoulos E, Katonis P. Pathophysiology and biomechanics of the aging spine. Open Ortho J 2011;5:335-342. Panjabi M, Takata K, Goel V. Kinematics of the lumbar intervertebral foramen. Spine 1983;8:348-357. Petty NJ. Neuromusculoskeletal Examination and Assessment. A Handbook for Therapists (3rd Edition). Elsevier, Churchill Livingstone, Edinburgh, 2006.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

20

Pewsner D, Battaglia M, Minder C, Marx A, Bucher H, Egger M. Ruling a diagnosis in or out with SpPin and SnNout a note of caution. Br Med J 2004;329:209-213. Population Reference Bureau http://www.pssrb.org/publications/graphicsbank/PopulationTrends.aspx;2010. Porter R. Spinal stenosis and neurogenic claudication. Spine 1996;21(17):2046–2052. Riegelman R. Studying a study and testing a test: how to read the medical evidence. Lippincot-Williams and Wilkins, 2005 (5th Edition). Rigby A, Summerton N. Statistical methods in epidemiology. VIII. On the use of likelihood ratios for diagnostic testing with an application to general practice. Dis Rehab 2005;27(9):475-480. Schronstrom N, Willen J. Imaging lumbar spinal stenosis. Radiolog Clin Nth Am 2011;39:31-53. Slatis P, Malmivaara A, Heliovaara M, Saino P, Seitsalo S, Hurri H. Surgical or nonoperative treatment for lumbar spinal stenosis? A six-year follow-up of a randomized controlled trial. J Bone Joint Surg 2007;91B: Supp II, 290-281. Streiner D, Norman G. Health Measurement Scales (3rd edition). Oxford University Press, Oxford, 2003. Steurer J, Nydegger A, Held U, Brunner F, Hodler J, Porchet F et al. Lumbar spinal stenosis outcome study.’ BMC Musculoskeletal disorders. 2010;11:254. Sugioka T, Hayashino Y, Konno S, Kikuchi S, Fukuhara S. Predictive value of self-reported patient information for the identification of lumbar spinal stenosis. Family Pract 2008;25(4):237-244. Tenhula J, Lenke L, Bridwell K, Gupta P, Riew, D. Prospective functional evaluation of the surgical treatment of neurogenic claudication in patients with lumbar spinal stenosis. J Spine Dis 2000;13(4):276-282. Vo A, Kamen L, Shih V, Bitar A, Stitik T, Kaplan R. Rehabilitation of orthopaedic and rheumatologic disorders, 5. Lumbar spinal stenosis. Arch Phys Med Rehab 2005;86(3): Supp 1:69-76. Wang Y, Jeng C, Wu C, Chang H, Huang Y, Wang Y, et al. Dynamic effects of axial loading on the lumbar spine during MRI in patients with suspected spinal stenosis. J Formosan Med Assoc 2008;107(4):334-339. Watanabe K, Yamazaki A, Morita O, Sano A, Katsumi K, Ohashi M. Clinical outcomes of posterior lumbar interbody fusion for lumbar foraminal stenosis: preoperative diagnosis and surgical strategy. Indian J Orth 2014;48:107-10. Watters W, Baisden J, Gilbert T, Kreiner S, Resnik D, Bono C, et al. Degenerative lumbar spinal stenosis: an evidence-based clinical guideline for the diagnosis and treatment of degenerative lumbar spinal stenosis. Spine J 2008;8:305-310. Westergaard L, Hauerberg J, Springborg J. Outcome after surgical treatment for lumbar spinal stenosis: the lumbar extension test is not a predictive factor. Spine 2009;34(25):E930-E935.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

21

Whiting P, Rutjes A, Dinnes J, Reitsma J, Bossuyt P, Kleijnen, J. Development and validation of methods for assessing the quality of diagnostic accuracy studies. Health Technology Assessment 2004;8:25. Yukawa Y, Lenke L, Tenhula J, Bridwell K, Riew D, Blanke K. A comprehensive study of patients with surgically treated lumbar spinal stenosis with neurogenic claudication. J Bone Joint Surg 2002;84(A):1954-1959.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

Acknowledgements

The authors would like to thank Pieter Meiring and other staff at Chesterfield Royal

Hospital for facilitating the study, and the first author’s family for on-going

encouragement throughout the process.

Conflict of interest

The authors declare no conflict of interest.

Funding

No funding was received for this study.

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

Figure 1. STARD Flow Diagram

Excluded patients Reasons n= 1 - declined MRI

MeXT test n= 30

Positive MeXT n= 26

Negative MeXT n= 4

Inconclusive result n= 0

No MRI n= 0

No MRI n= 0

No MRI n= 0

MRI n= 4

MRI n= 26

MRI n= 0

Inconclusive n= 0

Inconclusive n= 0

Inconclusive n= 0

ALSS Present n= 23

ALSS absent n= 3

ALSS Present

n= 2

ALSS absent n= 2

ALSS Present

n= 0

ALSS absent n= 0

Eligible patients n= 31

Potentially eligible patients n= 276

Exclusion reason (N = 245):

Upper quadrant symptoms (109) ≤ 50 years (60)

No referred symptoms (76) No informed consent (0)

Peripheral vascular disease (0) Ankylosing Spondylitis (0)

MAN

USCR

IPT

ACCE

PTED

ACCEPTED MANUSCRIPT

Highlights

• Spinal stenosis is likely to be more commonly encountered in clinical practice. • The validity of a simple clinical test was compared to diagnosis by MRI. • The test had a reasonable sensitivity of 92%, but a poor specificity of 40%. • The test may be useful to rule out a diagnosis with a negative test result.

the validity of a clinical test for the diagnosis of lumbar spinal ......lumbar spinal stenosis,...

Documents