uva-dare (digital academic repository) systematic reviews ... › ws › files › 1652292 ›...

UvA-DARE is a service provided by the library of the University of Amsterdam (http://dare.uva.nl)

UvA-DARE (Digital Academic Repository)

Systematic reviews of imaging gynecological and gastrointestinal malignancies for developingevidence-based guidelines

Bipat, S.

Link to publication

Citation for published version (APA):Bipat, S. (2007). Systematic reviews of imaging gynecological and gastrointestinal malignancies for developingevidence-based guidelines. Amsterdam: Shandra Bipat.

General rightsIt is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s),other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons).

Disclaimer/Complaints regulationsIf you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, statingyour reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Askthe Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam,The Netherlands. You will be contacted as soon as possible.

Download date: 13 Jul 2020

Systematic reviews of imaging gynecological

and gastrointestinal malignancies for

developing evidence-based guidelines

� ��

Thesis: Systematic reviews of imaging gynecological and gastrointestinal malignancies for developing evidence-based guidelinesCopyright: 2007, Shandra Bipat, Amsterdam, The Netherlands

This thesis was prepared at the Department of Radiology, Academic Medical Center, University of Amsterdam, The Netherlands. Part of the research was financially supported by the Dutch Order of Medical Specialists.

Cover by: Shandra Bipat and Jaap StokerLayout: Chris Bor, Medical Photography and Illustration, Academic Medical Center, Amsterdam and Buijten & Schipperheijn, Amsterdam. Printed by: Buijten & Schipperheijn, Amsterdam, The NetherlandsISBN: 978-90-9021715-4

� ��

Systematic reviews of imaging gynecological

and gastrointestinal malignancies for

developing evidence-based guidelines

ACADEMISCH PROEFSCHRIFT

ter verkrijging van de graad van doctor

aan de Universiteit van Amsterdam

op gezag van de Rector Magnificus

prof.dr. J.W. Zwemmer

ten overstaan van een door het college voor promoties ingestelde

commissie, in het openbaar te verdedigen in de Aula der Universiteit

op vrijdag 30 maart 2007, te 10:00 uur

Shandra Bipat

geboren te Nickerie, Suriname

� ��

Promotiecommissie

Promotoren: Prof. dr. J.Stoker

Prof. dr. P.M.M. Bossuyt

Co-promotor: Prof. dr. A.H. Zwinderman

Overige leden: Prof. dr. J.S. Laméris

Prof. dr. M.P.M. Burger

Prof. dr. D.J. Gouma

Prof. dr. M.G.M. Hunink

Prof. dr. W.P.Th. M. Mali

Prof. dr. H.C.W. de Vet

Faculteit der Geneeskunde

� ��

Table of ConTenTs

Chapter 1 General introduction and Outline of the thesis 6

Chapter 2 Is there a role for magnetic resonance imaging in the evaluation of 14

inguinal lymph node metastases in patients with vulva carcinoma?

Chapter 3 Computed tomography and magnetic resonance imaging in staging of 28

uterine cervical carcinoma: a systematic review

Chapter 4 Rectal cancer: local staging and assessment of lymph node involvement 42

with endoluminal US, CT, and MR imaging: a meta-analysis

Chapter 5 Ultrasonography, computed tomography and magnetic resonance imaging 68

for diagnosis and determining resectability of pancreatic adenocarcinoma:

a meta-analysis

Chapter 6 Imaging and treatment of patients with colorectal liver metastases in the 86

Netherlands: a survey

Chapter 7 Colorectal liver metastases: CT, MR imaging, and PET for diagnosis. 98

Meta-analysis

Chapter 8 Evidence-based guideline on management of colorectal liver metastases 120

in the Netherlands

Chapter 9 Multivariate random-effects approach: for meta-analysis of cancer staging 142

studies

Summary and Conclusions 166

Samenvatting en Conclusies 176

List of Publications 188

Dankwoord en Curriculum Vitae 192

1C h a p t e r

General introduction and Outline of the thesis

Chapter 1

GeneRal InTRoDUCTIon

Radiological modalities are playing a major role in the management of patients with abdominal

and pelvic malignancies. Improvements in radiological techniques during the last fifteen years, such

a launching of spiral computed tomography, magnetic resonance imaging, and contrast agents,

have led to an increasing use of these modalities for accurate diagnosis and staging of pelvic and

abdominal malignancies [1-6]. For several areas in diagnosis and staging of abdominal and pelvic

malignancies, a large body of scientific evidence is available. The findings in all these studies are not

always univocal, which may account for some of the substantial practice variation in the use of these

techniques.

Clinical guidelines are systematically developed statements to support decision-making. They

can reduce variability in practice, improve the process and outcomes of health care, and optimize

resource utilization. For the development of guidelines for diagnosis and staging, the evidence-

based approach has been put forward, which relies on a combination of the best available evidence

with clinical expertise [7-10]. The more recent development of methods for performing systematic

reviews of diagnostic accuracy studies has further increased the ability to synthesize the available

evidence.

All systematic reviews involve a number of key stages that set them apart from the more tradi-

tional narrative reviews: they start from a well defined clinical question, they rely on comprehensive

literature searching to identify all potentially relevant studies, use explicit selection criteria to include

eligible studies, assess the methodological quality of included studies, explore heterogeneity and its

likely sources and, if possible, to synthesize study results, and arrive at valid and precise summary

estimates (meta-analysis) [11-14].

Meta-analysis is a set of statistical techniques to quantitatively summarize the results obtained at a

systematic review [15-17]. The benefits of meta-analysis over individual trials include increased preci-

sion and statistical power and the ability to identify and explore heterogeneity in the results from

individual studies. The results of well-conducted systematic reviews and meta-analysis can provide

guideline developers and decision makers the highest quality of scientific information.

The overall goal of a systematic review is to combine results of previous studies to arrive at sum-

mary estimates for the relevant study question. In radiology, systematic reviews and meta-analyses

are most often used to estimate the summary diagnostic accuracy of an imaging test, such as

ultrasonography, computed tomography or magnetic resonance imaging [18-25].

Another possible aim of a systematic review can be the identification and explanation of inconsis-

tencies in the results of primary studies [26]. Systematic reviews can help practitioners keep abreast

of the medical literature by summarizing large bodies of evidence and helping to explain differences

among studies on the same question. Systematic reviews can also help in identifying gaps in the

evidence and may provide a quantitative basis for new research initiatives [27].

Systematic reviews are usually time consuming and can therefore be expensive. The use of meta-

analyses to synthesize the evidence from randomized controlled trials is growing in popularity and

the methodology has well advanced. In contrast, the statistical methodology for meta-analyses of

diagnostic accuracy studies is still developing but important steps have been made [28-31].

General introduction and outline of the thesis

oUTlIne of THe THesIs

Diagnostic accuracy studies express the level of agreement between the results of a test and the

results of the reference standard. For several pelvic and abdominal malignancies, including rectal

cancer, colorectal liver metastases, pancreatic cancer, uterine cervical cancer, a substantial body of

evidence is available. It may therefore be very relevant to summarize and explore this in a systematic

way, and when feasible, to perform a meta-analysis.

For some clinical questions, the evidence is more limited, to the extent that performing systematic

reviews or meta-analysis will not be helpful for developing clinical guidelines. In these cases, appropri-

ate diagnostic studies of sufficient methodological quality can be helpful in providing the evidence to

support decision making.

This thesis summarizes the diagnostic accuracies of several imaging modalities, such as ultraso-

nography (US), computed tomography (CT), magnetic resonance imaging (MRI), and positron emis-

sion tomography with 18-fluorodeoxyglucose (FDG-PET) in the evaluation of rectal cancer, colorectal

liver metastases, pancreatic cancer, uterine cervical cancer and vulva carcinoma either by means of

systematic reviews or by performing an appropriate diagnostic study. In addition, several parts are

used to develop guideline and this is also presented in this thesis.

Chapter 2 of this thesis focuses on the impact of MRI for lymph node detection in patients with

vulva carcinoma. The goal of the study was to retrospectively determine the accuracy of MRI for

lymph node detection in patients with vulva carcinoma by comparing the MRI findings with histo-

pathological findings obtained by sentinel node procedure or surgery.

In chapter 3 we systematically review the available evidence on the diagnostic accuracy of CT

and MRI in staging uterine cervical carcinoma. The aim of this study was to obtain precise and valid

estimates of the diagnostic performance of CT and MRI in the evaluation of parametrial invasion,

bladder and rectum invasion, and lymph node involvement.

Chapter 4 reports results of a meta-analysis comparing endoluminal US (EUS), CT, and MRI in

local staging (T-staging) and assessment of lymph node involvement in patients with rectal cancer.

The aim of this study was to obtain summary estimates of the accuracy of EUS, CT, and MRI for

correct identification of the T-stages and malignant lymph nodes.

In chapter 5 the results of a meta-analysis comparing US, CT, and MRI for the diagnosis and for

determining resectability of pancreatic adenocarcinoma are reported. Our aim was to obtain sum-

mary estimates of the accuracy of conventional CT, helical CT, MRI, and US.

In chapter 6 we summarize the findings of a survey performed on the management of patients

with colorectal liver metastases in the Netherlands. The survey documented the extent of variation

in the diagnosis and treatment strategies. A second aim was to obtain relevant background informa-

tion for developing and implementing evidence-based guidelines.

In chapter 7 the results of a meta-analysis on the diagnostic accuracy of CT, MRI, and FDG-PET for

the detection of colorectal liver metastases are summarized. In the systematic review we collected

studies on the diagnostic accuracy of CT, MRI, and FDG-PET for the detection of colorectal liver

metastases on a per-patient and on a per-lesion basis.

Chapter 1

A large section of the results summarized in this chapter was used to develop a Dutch evidence-

based guideline for diagnosis and treatment of patients with colorectal liver metastases. This evi-

dence-based guideline is summarized in chapter 8.

The clinical care of cancer patients relies not only on the differentiation between disease and

non-disease but also on correct staging, understaging and overstaging by imaging modalities. In the

meta-analytic approaches used in chapters 3, 4, 5, and 7, we needed thresholds to construct 2 x 2

tables for the calculation of sensitivity, specificity or the diagnostic odds ratio. By defining the thresh-

olds, inevitably information was lost. In chapter 9 an alternative approach is presented to deal with

this type of data sets. The aim of this approach is to report results on correct staging, understaging

and overstaging of tumor.

Finally, the results of this thesis are summarized and implications are made.

General introduction and outline of the thesis

��

References

1. Chang JM, Lee JM, Lee MW, et al. Superparamagnetic iron oxide-enhanced liver magnetic resonance im-aging: comparison of 1.5 T and 3.0 T imaging for detection of focal malignant liver lesions. Invest Radiol 2006;41:168-174.

2. Choi HJ, Roh JW, Seo SS, et al. Comparison of the accuracy of magnetic resonance imaging and positron emis-sion tomography/computed tomography in the presurgical detection of lymph node metastases in patients with uterine cervical carcinoma: a prospective study. Cancer 2006;106:914-922.

3. Gearhart SL, Frassica D, Rosen R, Choti M, Schulick R, Wahl R. Improved staging with pretreatment positron emission tomography/computed tomography in low rectal cancer. Ann Surg Oncol 2006;13:397-404.

4. Imbriaco M, Smeraldo D, Liuzzi R, et al. Multislice CT with single-phase technique in patients with suspected pancreatic cancer. Radiol Med (Torino) 2006;111:159-166

5. Karabulut N, Elmas N. Contrast agents used in MR imaging of the liver. Diagn Interv Radiol 2006;12:22-30

6. Tatli S, Mortele KJ, Breen EL, Bleday R, Silverman SG. Local staging of rectal cancer using combined pelvic phased-array and endorectal coil MRI. J Magn Reson Imaging 2006; 23:534-540

7. McArthur J. The systematic review: an essential element of an evidence based approach to nursing. Nurs Prax N Z 1997;12:10-15.

8. Craig JC, Irwig LM, Stockler MR. Evidence-based medicine: useful tools for decision making. Med J Aust 2001;174:248-253.

9. Manser R, Walters EH. What is evidence-based medicine and the role of the systematic review: the revolution coming your way. Monaldi Arch Chest Dis 2001;56:33-38.

10. Hess DR. What is evidence-based medicine and why should I care? Respir Care 2004;49:730-741.

11. Deville WL, Buntinx F, van der Windt DA, et al. Didactic Guidelines for conducting systematic reviews of stud-ies evaluating the accuracy of diagnostic tests. 2001. Knottnerus JA, editor. The evidence Base of diagnosis. London: BMJ Publishing Group.

12. Deville WL, Buntinx F, Bouter LM, et al. Conducting systematic reviews of diagnostic studies: didactic guide-lines. BMC Med Res Methodol 2002;2:9.

13. Khan KS, Dinnes J, Kleijnen J. Systematic reviews to evaluate diagnostic tests. Eur J Obstet Gynecol Reprod Biol 2001;95:6-11.

14. Khan KS. Systematic reviews of diagnostic tests: a guide to methods and application. Best Pract Res Clin Obstet Gynaecol 2005;19:37-46.

15. Acton GJ. Meta-analysis: a tool for evidence-based practice. AACN Clin Issues 2001; 12:539-545.

16. Gallagher EJ. Systematic reviews: a logical methodological extension of evidence- based medicine. Acad Emerg Med 1999;6:1255-1260.

17. Imperiale TF. Meta-analysis: when and how. Hepatology 1999;29:26S-31S.

18. Bipat S, Glas AS, Slors FJ, Zwinderman AH, Bossuyt PM, Stoker J. Rectal cancer: local staging and assess-ment of lymph node involvement with endoluminal US, CT, and MR imaging--a meta-analysis. Radiology 2004;232:773-783.

19. Koelemay MJW, Nederkoorn PJ, Reitsma JB, Majoie CB. Systematic review of computed tomographic angiog-raphy for assessment of carotid artery disease. Stroke 2004;35:2306-2312.

20. Terasawa T, Blackmore CC, Bent S, Kohlwes RJ. Systematic review: computed tomography and ultrasonogra-phy to detect acute appendicitis in adults and adolescents. Ann Intern Med 2004;141:537-546.

21. Visser K, Hunink MG. Peripheral arterial disease: gadolinium-enhanced MR angiography versus color-guided duplex US--a meta-analysis. Radiology 2000;216:67-77.

22. Oei EH, Nikken JJ, Verstijnen AC, Ginai AZ, Myriam Hunink MG. MR imaging of the menisci and cruciate liga-ments: a systematic review. Radiology 2003;226:837-848.

23. Midgette AS, Stukel TA, Littenberg B. A meta-analytic method for summarizing diagnostic test performances: receiver-operating-characteristic-summary point estimates. Med Decis Making 1993;13:253-257.

Chapter 1

��

24. Nelemans PJ, Leiner T, de Vet HC, van Engelshoven JM. Peripheral arterial disease: meta-analysis of the diag-nostic performance of MR angiography. Radiology 2000;217:105-114

25. Fraquelli M, Colli A, Casazza G, et al. Role of US in detection of Crohn disease: meta-analysis. Radiology 2005; 236:95-101.

26. Nelson EA. The value of systematic reviews in research. Prof Nurse 1998;14:24-28

27. Cooper NJ, Jones DR, Sutton AJ. The use of systematic reviews when designing studies. Clin Trials 2005;2:260-264.

28. Gatsonis CA. Random-effects models for diagnostic accuracy data. Acad Radiol 1995; 2 Suppl 1:S14-S21

29. Moses LE, Shapiro D, Littenberg B. Combining independent studies of a diagnostic test into a summary ROC curve: data-analytic approaches and some additional considerations. Stat Med 1993;12:1293-1316.

30. Reitsma JB, Glas AS, Rutjes AW, Scholten RJ, Bossuyt PM, Zwinderman AH. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol 2005;58:982-990.

31. Walter SD. Properties of the summary receiver operating characteristic (SROC) curve for diagnostic test data. Stat Med 2002;21:1237-1256

��

2C h a p t e r

Is there a role for magnetic resonance imaging in the evaluation of inguinal lymph node metastases in patients with vulva carcinoma?

Shandra BipatGerwin A. FransenAnje M. SpijkerboerJacobus van der VeldenPatrick M. M. BossuytAeilko H. ZwindermanJaap Stoker

Gynecologic Oncology 2006;103:1001-1006

Chapter 2

absTRaCT

Objective: To study the accuracy of magnetic resonance imaging (MRI) in lymph node detection in

patients with vulva carcinoma.

Methods: Sixty patients with diagnosed vulva carcinoma underwent MRI examination for preop-

erative evaluation of lymph nodes. MR images were read independently and retrospectively by

two radiologists, both unaware of physical examination and surgery findings. The following char-

acteristics of each lymph node with a short-axis diameter of ≥ 8 mm were recorded: size (axial,

sagittal, and coronal); aspect (homogeneous, with fatty center or partial fat); margin (smooth,

lobulated/speculated or indistinct); shape (round, ovoid or elongated). Based on these charac-

teristics, each lymph node was classified as malignant or benign, and subsequently each groin

was classified as malignant or benign. Histopathology obtained at sentinel node procedure or by

inguinofemoral lymphadenectomy was used as reference standard. Per groin sensitivity, specific-

ity, positive and negative predictive values were calculated. Kappa statistics on per groin basis

were calculated to express interobserver agreement.

Results: Onehundred nineteen groins were examined either by sentinel node procedure or surgery,

of which 23 groins were malignant. Sensitivity, specificity, positive, and negative predictive values

were 52%, 85%, 46%, and 87% for observer 1 and 52% 89%, 52% and 89% for observer 2. The

interobserver agreement was 104/119 (kappa 0.62), representing good agreement.

Conclusion: At this stage there is no role for standard MRI in evaluating lymph node involvement

in patients with vulva carcinoma.

Role of MRI in evaluating lymph nodes in patients with vulva carcinoma

InTRoDUCTIon

Cancer of the vulva accounts for 3 to 5% of primary gynecologic malignancies [1, 2]. For

several decades, the standard treatment has been radical vulvectomy with unilateral or bilateral

inguinofemoral lymphadenectomy through separate incisions. Despite modifications of the surgical

treatment, this operation still results in considerable morbidity. Wound breakdown (17%) and/or

infection (39%) of the groin, lymphatic cyst formation (40%), and lymph edema (28%) all can occur

after standard treatment [3]. Moreover, 10 to 25% of patients in early stage disease have node

involvement and thus between 75% and 90% of patients have unnecessary nodal surgery and the

associated morbidity [4].

Over the last decades investigators tried to identify the patients in which they could predict the

status of the lymph nodes in order to plan groin surgery. Unfortunately it has been proven very

difficult to define such groups. One of the main problems is the lack of the clinical assessment of

nodal disease. In a recent study the negative predictive value of groin palpation was found to be only

77% [5].

The sentinel node procedure is a promising technique that has recently emerged with the poten-

tial to identify those patients with groin node metastases. This technique is becoming increasingly

accepted in the management of cutaneous melanoma and breast cancer [6-9]. Several studies have

shown promising results in vulva cancer [10-12] reporting a negative predictive value close to 100%.

A disadvantage of this technique is that the drainage pathway can sometimes be blocked because of

lymph nodes totally replaced by tumor. In these cases metastatic lymph nodes will not be recognized

[13, 14].

Magnetic Resonance Imaging (MRI) is a non-invasive method, being used more often in the

diagnostic work up of gynecologic malignancy to stage both the primary tumor and regional lymph

nodes [15, 16]. The role of imaging in staging the primary vulva tumor is limited, as the manage-

ment primary depends on the size of the primary tumor which can be accurately assessed clinically.

Patients with tumor less than 4 cm in diameter and clinically non-suspicious groin nodes (cN0) will

undergo radical excision of the tumor in combination with sentinel node procedure. Patients with

tumor larger than 4 cm in diameter and with suspected groin nodes will undergo radical excision

and lymphadenectomy. Only malignant lymph nodes will alter the management to perform either

sentinel node procedure or lymphadenectomy.

In two pilot studies, MRI has been evaluated for lymph node detection in patients with vulva

carcinoma. In a study of Sohaib et al [17], evaluating 22 patients, sensitivity and specificity of respec-

tively 50% and 100% were obtained, while in a study of Hawnaur et al [18], evaluating 10 patients,

sensitivity and specificity of 89% and 91% were obtained.

As lymphatic spread remains an important prognostic factor in patients with vulva carcinoma

and no uniform results are described in the literature, concerning the value of MRI in lymph node

detection, we retrospectively analyzed the accuracy of MRI in lymph node detection in 60 patients

with vulva carcinoma. The results of MRI were compared with the pathological results of the sentinel

node or lymphadenectomy specimen. In addition, we evaluated interobserver variability of MRI for

this purpose.

��

Chapter 2

MaTeRIals anD MeTHoDs

PatientsBetween January 2002 and October 2005, 63 patients were referred in the Academic Medical

Center in Amsterdam for MRI examination for the preoperative evaluation of lymph nodes in pa-

tients with diagnosed vulva carcinoma. All patients were diagnosed as having vulva cancer based

on signs and symptoms, including vulvar pain, itching, bleeding and detection of a palpable mass

at physical examination by gynecological oncologists and by histopathology (squamous carcinoma,

melanoma).

Magnetic Resonance Imaging MRI was performed on a 1.5 T unit (Signa Horizon EchoSpeed, General Electric Medical Systems,

Milwaukee, Wisconsin, USA) using a phased array multicoil. All patients had fasted (4 hr) prior to

examination. To reduce bowel peristalsis, 20 mg butyl scopalaminebromide (Buscopan, Boehringer,

Ingelheim, Germany) was administrated intramuscularly. The following sequences were performed:

(a) Coronal three-dimensional T1-weighted gradient sequence (TR/TE: 11.5/4.2 ms) with 2.8 mm

slice thickness, field of view (FOV) of 35 cm and matrix of 192 x 256; (b) Axial and sagittal fast

spin echo 2500/70 ms T2-weighted with 4 mm section thickness, 4 mm slice gap, 30 cm FOV and

512 x 256 matrix; (c) Coronal fast spin echo 600/10 ms T1-weighted with fat saturation and after

administration of Magnevist® (Schering AG, Berlin, Germany), with 4 mm section thickness, 4 mm

slice gap, 30 cm FOV and 256 x 256 matrix; and (d) For the upper abdomen, an axial Half-Fourier

acquisition single shot turbo SE (HASTE) (TR/TE;964/60 ms) with 7.0 mm slice thickness, FOV of 40

cm and matrix of 256 x 160.

Image analysisMR images were read independently and retrospectively by an experienced abdominal radiologist

(10 year experience CT and MRI of the abdomen) and independently by a radiologist (6 year experi-

ence of which five years as resident). Both were unaware of the findings at previous reading, of the

results of the physical examination (except for the presence of a vulva tumor) and the findings from

surgery and histopathology.

MRI examination quality

The following characteristics of MRI examination were recorded: (a) the quality of MRI examination:

classified as good, moderate or poor (non-diagnostic); (b) presence and type (e.g. movement and

peristalsis) of artifacts; and (c) whether all regions (inguinal right, inguinal left, iliacal right, iliacal left)

could be evaluated. In addition, both observers recorded the review times.

Lymph node assessment

Each observer recorded independently the following characteristics of all lymph nodes with a

minimal short-axis diameter of at least 8 mm depicted on MR images: (a) size: in axial, sagittal,

and coronal plane; (b) localization: lymph nodes at the groin were subdivided into two groups:

��

superficial inguinal lymph nodes and femoral lymph nodes. Superficial inguinal lymph nodes are

located in the superficial fascia parallel to the inguinal ligament and along the terminal part of the

greater saphenous vein. Femoral lymph nodes are located along the medial side of the femoral

vessels (figures 1 and 2); (c) aspect: homogeneous lymph node, lymph node with fatty center or

partial fat in nodes; (d) margin: smooth, lobulated/speculated or indistinct as defined by Kim et

al [19]; (e) shape: round (ratio of long/short-axis diameter of 1); ovoid (ratio of long/short-axis

diameter between 1 and 1.5) or elongated (ratio of long/short-axis diameter larger than 1.5);

and (f) diagnosis: an overall expression based on the above mentioned characteristics was used

to classify each lymph node as malignant or benign.

Reference standardHistopathology was used as the reference standard, either obtained at sentinel node procedure or

at operation (inguinofemoral lymphadenectomy). Patients with tumors less than 4 cm in diameter

and clinically non-suspicious groin nodes (cN0) were included in a sentinel node study after giving

consent [20]. The details of the sentinel lymph node procedure are published elsewhere [10, 21-23].

After removal of the first sentinel lymph node, the groin was re-examined for radioactivity and if

radioactivity was detected at a level greater than 10% of the first excised sentinel lymph node, the

dissection was continued in search of additional sentinel lymph nodes. The removed sentinel lymph

nodes were examined by the pathologist at frozen sections.

Patients with a negative sentinel node had no further surgery of groin nodes. All other patients,

including patients with positive frozen sentinel nodes, were treated by a local radical excision and an

ipsilateral (in case of a unilateral tumor) or bilateral inguinofemoral lymph node dissection. Both the

primary tumor and lymphadenectomy specimens were sent to the pathologist separately for routine

histopathologic examination.

Figure 1. A 78-year old woman with vulva carcinoma and left superficial lymph node metastases.(a) Axial T2-weighted fast spin echo MRI image shows enlarged lymph node with a short-axis of 10 mm node (arrow). (b) Coronal T1-weighted gradient echo image showing the same lymph node (arrow).

Chapter 2

StatisticsQuality assessment

Data on the quality of MRI examination, on the presence of artifacts, and on the evaluation of the

different regions (inguinal right, inguinal left, iliacal right, and iliacal left) were summarized for

each observer. Differences between observers with respect to quality of MRI examinations, pres-

ence of artifacts and non-evaluable regions were analyzed with the McNemar test. Differences in

review times between observers were assessed by Wilcoxon signed ranks test.

Quantity assessment: per groin or per lymph node

Per groin analysis: Each groin was classified as either negative or positive based on MRI data and

separately based on pathology report. Sensitivity, specificity, positive predictive and negative

predictive values per groin were calculated for each observer. For this purpose both superficial

inguinal and femoral lymph nodes were analyzed jointly, due to the low prevalence of malignant

lymph nodes. Kappa statistics on per groin basis to express interobserver agreement were calcu-

lated.

Per lymph node analysis: The number of malignant and benign nodes (superficial inguinal and femo-

ral nodes) depicted on MRI were summarized for each observer. In addition association between the

type of nodes (malignant vs. benign) and characteristics were evaluated by means of Chi-square test

statistic. Short-axis diameters between malignant and benign nodes were compared by independent

Student t-test.

Finally, the number of malignant nodes depicted on histology per groin was also compared

with the number of malignant lymph nodes depicted on MR images in the corresponding groin by

both observers. This was done to obtain an expression of either under or overestimation of MRI.

Figure 2. A 76-year old woman with vulva carcinoma and right femoral lymph node metastases. (a) Axial T2-weighted fast spin echo MRI image shows adjacent to femoral vessels a 14 mm (short-axis) lymph node (arrow). (b) Coronal T1-weighted gradient echo image showing the same lymph node (arrow). The lymph node is hypo-intense at T1 indicating that the hyper-intensity on the T2 image does not represent fat.

��

Data were analyzed using SPSS software (Version 9.0 for Windows, SPSS, Inc, Chicago, IL). P

values of less than 0.05 were considered to indicate statistical significance results.

ResUlTs

PatientsThree of the 63 patients, did not undergo either sentinel node procedure or inguinofemoral lymph-

adenectomy and were excluded form the study; one patient due to cardiac problems, one did

undergo vulvectomy only and the third patient was planned for radiation therapy. The demographic

characteristics of patients are listed in table 1. Fifty-seven patients had squamous cell carcinoma and

the other three had melanoma.

Table 1. Demographic characteristics of patients undergoing MRI

Mean age (range) 69 (36-92)

Clinical stage

Squamous cell carcinomas (57)

T1N0M0

T2N0M0

T3N0M0

T1N1Mx

T2N1Mx

T3N1Mx

Melanoma (3)

Breslow II

Breslow III

Size of tumor (mm) a

≤ 20 mm

20-40 mm

≥ 40 mm

Site of tumor

Median

Unilateral

Bilateral

Final pathological stage

Malignant lymph nodes present

Malignant lymph nodes absent

x: M-stage unknown. a 7 patients had two lesions

��

Chapter 2

Reference standardA sentinel node procedure was performed in 36 patients. Of the 72 groins examined, 61 showed

radioactivity. A total of 108 sentinel lymph nodes were excised (33 groins with 1 node, 15 with 2, 11

with 3, and 2 groins with 6 nodes) as well as 8 non-sentinel nodes. Frozen section showed metastatic

disease in 7 groins in 4 patients. These patients and the remaining 24 patients, in whom no sentinel

procedure was performed, did undergo either bilateral or unilateral resection of inguinal lymph

nodes (27 bilateral and 1 unilateral resection). A total of 119 groins were examined either by sentinel

node procedure or surgery of which 23 groins (16 patients) were positive and 96 were negative.

MRI findings Quality assessment

The results of the quality assessment for both observers are listed in table 2. Both observers

agreed on the quality of MRI examinations in 53 (88%) patients (52 good, 1 moderate); in the

other 7 MRI examinations discrepancies were found. Artifact were reported in 4 and 10 MRI ex-

aminations by observer 1 and observer 2, respectively (P = 0.031) (table 2). Two (2/240) regions

in 2 patients and 21 regions (21/240) in 11 patients were not evaluable according to observer

1 and observer 2 (P < 0.0001) respectively. Observer 2 scored more regions as non-evaluable,

due to incomplete imaging in 10 patients; however no influence on lymph node evaluation was

recorded.

The review time for observer 1 ranged between 4 and 25 minutes (modus: 7 minutes) and for

observer 2 between 5 and 20 minutes (modus 5 minutes), with significantly lower review times

for observer 2 (P = 0.02).

Table 2. Results of the quality assessment for both observers

Quality of assessment Observer 1 Observer 2

Quality MRI examinations

Moderate, either

Presence of artifacts

Movements/Peristalsis

Hip prosthesis

Inadequate imaging

a 1 patient had also movement/peristalsis artifacts and 1 patient had a hip prosthesis.

Per groin analysis

Of the 23 positive groins, both observers detected 12 (sensitivity 52%). Of the 96 negative groins, 14

and 11 were scored as positive by observers 1 and 2, respectively (specificity of 85% and 89% respec-

��

tively). Positive and negative predictive values for observer 1 were 46% and 87% and for observer 2,

52% and 89%, respectively. Sensitivity, specificity, positive predictive and negative predictive values

for both observers were comparable. The interobserver agreement was 104/119, producing a kappa

of 0.62, and therefore representing good agreement.

Per lymph node analysis

In total 33 and 28 malignant lymph nodes and 100 and 86 benign lymph nodes were identified on

MRI, by observer 1 and 2, respectively. Table 3 presents the MRI characteristics of the malignant

nodes and benign nodes as scored by both observers, independent of the histological findings.

Malignant nodes had larger short-axis compared to benign nodes, according to both observers.

However, only observer 2 showed a significant difference in the short-axes between malignant and

benign nodes, respectively 10.6 and 9.1 mm (P < 0.0001). According to both observers, malignant

nodes were associated with homogenous aspect, while benign nodes were associated with fatty

center or partial fat in the nodes (P < 0.0001). Smooth margin was associated with benign nodes

(P < 0.0001). Finally oval and elongated shapes were associated with benign nodes (P values of

respectively 0.004 and 0.01 for observer 1 and 2).

On histopathology, 38 malignant lymph nodes were found in 23 groins. The number of malignant

lymph nodes depicted on MR images in the corresponding 23 groins is presented in figure 3. Ob-

Table 3. Characteristics of malignant and benign lymph nodes assessed by both observers based on MRI features

MRI characteristics Observer 1 Observer 2

33 malignant 100 benign 28 malignant 86 benign

Short-axis diameter (mm) (mean ± SE)

10.1 ± 0.38 9.8 ± 0.16 10.6 ± 0.36 9.1 ± 0.17

Aspect

Homogeneous 19 1 14 8

Fatty center 1 70 0 46

Partial fat 8 23 10 32

Other 5 0 4 0

Margin

Smooth 4 91 5 66

Lobulated/speculated 25 9 16 20

Indistinct 4 0 7 0

Ratio long to short- axis

Round (ratio 1) 4 3 8 6

Oval (ratio1-1.5) 26 62 11 42

Elongated (ratio > 1.5) 3 35 9 38

��

Chapter 2

server 1 underestimated the number of positive lymph nodes in 14 groins and overestimated the

number of positive lymph nodes in 3 groins, while observer 2 underestimated the number of positive

lymph nodes in 15 groins and overestimated the number of lymph nodes in 1 groin.

In addition, observer 1 falsely identified 17 lymph nodes as malignant in 14 groins and observer

2 falsely identified 13 lymph nodes as malignant in 11 groins.

DIsCUssIon

This is the largest to date study evaluating a large number of groins in patients with vulvar cancer

undergoing MRI for preoperative work-up. On per groin analysis, MRI has low sensitivity and positive

predictive values compared to the reference standard (sentinel node procedure or inguinofemoral

lymph node dissection and subsequent histopathology). Specificity values were 85% and 88% for

observer 1 and observer 2 respectively. Positive predictive values for observer 1 were 88% and 89%

respectively. On per lymph node analysis, the observers either underestimated or overestimated

(false positives) the number of malignant lymph nodes.

Although several associations between nodes characteristics and the type of nodes obtained on

MRI were found. Malignant nodes had larger short-axis compared to benign nodes and were associ-

ated with homogenous aspect, while benign nodes were associated with fatty center or partial fat in

the nodes. However, no recommendation could be made, as the diagnostic value of MRI was limited

(sensitivity and positive predictive value).

A previous study of Sohaib et al, evaluating 22 patients (9 positive groins) and reported similar

results: high specificity values for the identification of superficial inguinal nodes (97%) and deep

inguinal lymph nodes (100%), low sensitivity of 40% and 50%, respectively [17]. In their study

each lymph node was assessed as either benign or malignant, without predefined criteria. A study

of Hawnaur et al [18], evaluating 10 patients reported a sensitivity of 89% (8 positive groins) and

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

des Reference standard

MRI by observer 1

MRI by observer 2

Malignant lymph nodes

Figure 3. Number of malignant nodes observed by both observers on MRI compared with the number of malignant nodes obtained by histopathology in the positive groins.

��

specificity of 91%. In the latter study the following criteria were considered as malignant: long axis

diameter > 21 mm, short-axis > 10 mm, long/short-axis > 1.5, irregular margin, cystic aspect of lymph

nodes. The prevalence (9/10) of positive lymph nodes in the groins in this study was higher than in a

general patient population with vulvar cancer; in addition 5 of the 10 patients had locally advanced

diseases (Stage III or IV).

The study of Sohaib et al reported a sensitivity of 40% and a specificity of 97% for the de-

tection of malignant superficial inguinal nodes, if short-axis exceeded 10 mm. For detection of

malignant deep inguinal nodes the sensitivity and specificity were 50% and 100% respectively if

short-axis > 8 mm. In our study therefore all lymph nodes with a minimal short-axis of at least 8

mm depicted on MR images were recorded by both observers independently. We attempted to

study superficial inguinal lymph nodes and the femoral lymph nodes separately as performed by

Sohaib et al; yet the low number of malignant lymph nodes limited the accuracy and relevance of

such separate analysis.

Although the sensitivity and prevalence of malignant groins were low, the observers showed

good agreement in assessing lymph nodes (kappa 0.62).

One limitation of this study was the retrospective design, however, all imaging protocols were

comparable and the management (sentinel node or lymphadenectomy) was not based on MRI find-

ings, but on the clinical findings and therefore this retrospective design could not influence clinical

management and thus the reference standard. Another limitation of this study is the lack of a node

by node analysis. However, given the poor results per groin, these results can be expected even to

be inferior to the per groin results.

In the light of high negative predictive values (approaching 100%) of the sentinel node procedure

it is clear that MRI with 85% and 89% negative predictive values does not play a major role in the

decision to perform lymphadenectomy in patients with vulvar cancer. Theoretically, MRI could play

a role in triaging patients for sentinel node procedures; when a positive lymph node is suspected on

MRI and confirmed by fine-needle aspiration (FNA), these patients could be saved from a sentinel

node procedure. However due to the low positive predictive values (approximately 50%), half of the

patients will undergo unnecessary FNA procedures.

Moreover, ultrasonography combined with fine-needle aspiration (FNA) is an alternative imaging

technique to assess inguinal lymph nodes with sensitivity and specificity values up to 93% and 100%

respectively [24, 25]. When a positive lymph node is found with the combination of ultrasound and

FNA this may prevent unnecessary sentinel node procedure in a much cost effective way compared

to MRI.

Recent advantages on lymph node specific MRI contrast agents have shown improvements in the

differentiation between benign and malignant lymph nodes in pelvic cancer. Sensitivity values for the

detection of malignant pelvic lymph nodes are reported up to 100% [26-31]. To our knowledge, no

research has been performed or is being performed on the evaluation of ultrasmall paramagnetic

iron oxide (USPIO) in patients with vulva carcinoma.

At this stage there is no role for routine MRI in evaluating lymph node involvement in patients

with vulva carcinoma. Assessing lymph node involvement remains a major challenge for current im-

aging modalities and needs to be improved as lymphatic spread is an important prognostic factor.

Chapter 2

References

1. Podratz KC, Symmonds RE, Taylor WF, Williams TJ. Carcinoma of the vulva: analysis of treatment and survival. Obstet Gynecol 1983; 61:63-74.

2. Wharton JT, Gallager S, Rutledge FN. Microinvasive carcinoma of the vulva. Am J Obstet Gynecol 1974; 118:159-162.

3. Gaarenstroom KN, Kenter GG, Trimbos JB, et al. Postoperative complications after vulvectomy and inguino-femoral lymphadenectomy using separate groin incisions. Int J Gynecol Cancer 2003; 13:522-527.

4. Cavanagh D, Hoffman MS. Controversies in the management of vulvar carcinoma. Br J Obstet Gynaecol 1996; 103:293-300.

5. Bosquet JG, Kinney WK, Russell AH, Gaffey TA, Magrina JF, Podratz KC. Risk of occult inguinofemoral lymph node metastasis from squamous carcinoma of the vulva. International Journal of Radiation Oncology Biology Physics 2003; 57:419-424.

6. Essner R. The role of lymphoscintigraphy and sentinel node mapping in assessing patient risk in melanoma. Semin Oncol 1997; 24:S8-10.

7. Giuliano AE, Kirgan DM, Guenther JM, Morton DL. Lymphatic mapping and sentinel lymphadenectomy for breast cancer. Ann Surg 1994; 220:391-398.

8. Krag DN, Weaver DL, Alex JC, Fairbank JT. Surgical resection and radiolocalization of the sentinel lymph node in breast cancer using a gamma probe. Surg Oncol 1993; 2:335-339.

9. Nieweg OE, Jansen L, Valdes Olmos RA, et al. Lymphatic mapping and sentinel lymph node biopsy in breast cancer. Eur J Nucl Med 1999; 26:S11-S16.

10. de Hullu JA, Hollema H, Piers DA, et al. Sentinel lymph node procedure is highly accurate in squamous cell carcinoma of the vulva. J Clin Oncol 2000; 18:2811-2816.

11. Decesare SL, Fiorica JV, Roberts WS, et al. A pilot study utilizing intraoperative lymphoscintigraphy for identi-fication of the sentinel lymph nodes in vulvar cancer. Gynecol Oncol 1997; 66:425-428.

12. Levenback C, Coleman RL, Burke TW, Bodurka-Bevers D, Wolf JK, Gershenson DM. Intraoperative lymphatic mapping and sentinel node identification with blue dye in patients with vulvar cancer. Gynecol Oncol 2001; 83:276-281.

13. de Hullu JA, Oonk MH, Ansink AC, Hollema H, Jager PL, van der Zee AG. Pitfalls in the sentinel lymph node procedure in vulvar cancer. Gynecol Oncol 2004; 94:10-15.

14. Fons G, ter RB, Sloof G, de Hullu J, van der Velden J. Failure in the detection of the sentinel lymph node with a combined technique of radioactive tracer and blue dye in a patient with cancer of the vulva and a single positive lymph node. Gynecol Oncol 2004; 92:981-984.

15. Ascher SM, Imaoka I, Hricak H. Diagnostic imaging techniques in gynecologic oncology. In Hoskins WJ, Perez CA, Young RC, eds. Principles of Gynecologic Oncology, Philadelphia: Lippincott Williams & Wilkins, 2000: 629.

16. Togashi K, Nishimura K, Sagoh T, et al. Carcinoma of the cervix: staging with MR imaging. Radiology 1989; 171:245-251.

17. Sohaib SA, Richards PS, Ind T, et al. MR imaging of carcinoma of the vulva. AJR Am J Roentgenol 2002; 178:373-377.

18. Hawnaur JM, Reynolds K, Wilson G, Hillier V, Kitchener HC. Identification of inguinal lymph node metastases from vulval carcinoma by magnetic resonance imaging: an initial report. Clin Radiol 2002; 57:995-1000.

19. Kim JY, Harisinghani MG. MR imaging staging of pelvic lymph nodes. Magn Reson Imaging Clin N Am 2004; 12:581-586.

20. de Hullu JA, Oonk MH, van der Zee AG. Modern management of vulvar cancer. Curr Opin Obstet Gynecol 2004; 16:65-72.

21. de Hullu JA, Doting E, Piers DA, et al. Sentinel lymph node identification with technetium-99m-labeled nanocol-loid in squamous cell cancer of the vulva. J Nucl Med 1998; 39:1381-1385.

22. de Hullu JA, van der Zee AG. Sentinel node techniques in cancer of the vulva. Curr Womens Health Rep 2003; 3:19-26.

23. de Hullu JA, van der Zee AG. Groin surgery and the sentinel lymph node. Best Pract Res Clin Obstet Gynaecol 2003; 17:571-589.

24. Hall TB, Barton DP, Trott PA, et al. The role of ultrasound-guided cytology of groin lymph nodes in the manage-ment of squamous cell carcinoma of the vulva: 5-year experience in 44 patients. Clin Radiol 2003; 58:367-371.

25. Moskovic EC, Shepherd JH, Barton DP, Trott PA, Nasiri N, Thomas JM. The role of high resolution ultrasound with guided cytology of groin lymph nodes in the management of squamous cell carcinoma of the vulva: a pilot study. Br J Obstet Gynaecol 1999; 106:863-867.

26. Bellin MF, Lebleu L, Meric JB. Evaluation of retroperitoneal and pelvic lymph node metastases with MRI and MR lymphangiography. Abdom Imaging 2003; 28:155-163.

27. Harisinghani MG, Saini S, Weissleder R, et al. MR lymphangiography using ultrasmall superparamagnetic iron oxide in patients with primary abdominal and pelvic malignancies: radiographic-pathologic correlation. AJR Am J Roentgenol 1999; 172:1347-1351.

28. Keller TM, Michel SC, Frohlich J, et al. USPIO-enhanced MRI for preoperative staging of gynecological pelvic tumors: preliminary results. Eur Radiol 2004; 14:937-944.

29. Kim JH, Beets GL, Kim MJ, Kessels AG, Beets-Tan RG. High-resolution MR imaging for nodal staging in rectal cancer: are there any criteria in addition to the size? Eur J Radiol 2004; 52:78-83.

30. Koh DM, Brown G, Temple L, et al. Rectal cancer: mesorectal lymph nodes at MR imaging with USPIO versus histopathologic findings--initial observations. Radiology 2004; 231:91-99.

31. Rockall AG, Sohaib SA, Harisinghani MG, et al. Diagnostic performance of nanoparticle-enhanced magnetic resonance imaging in the diagnosis of lymph node metastases in patients with endometrial and cervical cancer. J Clin Oncol 2005; 23:2813-2821.

3C h a p t e r

Computed tomography and magnetic resonance imaging in staging of uterine cervical carcinoma: a systematic review

Shandra BipatAfina S. GlasJacobus van der VeldenAeilko H. ZwindermanPatrick M. M. BossuytJaap Stoker

Gynecologic Oncology 2003;91:59-66.

�0 ��

Chapter 3

�0 ��

absTRaCT

Objective: The goal of this article is to systematically review the available evidence on the diag-

nostic performance of computed tomography (CT) and magnetic resonance imaging (MRI) in

staging of cervical carcinoma.

Methods: A comprehensive computer literature search was performed in MEDLINE and EMBASE

databases from January 1985 to May 2002. Two reviewers independently scored method-

ological quality of included studies and extracted relevant data for data analysis. A bivariate

random-effects approach was used to summarize estimates of sensitivity and specificity values.

Covariates were added to this model to study the influence of sample size, publication year,

methodological criteria, and MRI techniques on summary estimates.

Results: Fifty-seven articles were included. In 49 articles one imaging modality was evaluated

(MRI, 38; CT, 11), and in 8 articles, both. Inclusion criteria were: minimum of 10 patients includ-

ed, histopathology as reference standard, sufficient data presented to construct 2 × 2 tables.

The exclusion criterion was: data reported elsewhere in more detail. Sensitivity estimates for

parametrial invasion were 74% (95% CI: 68%-79%) for MRI and 55% (95% CI: 44%–66%) for

CT and for lymph node involvement 60% (95% CI: 52%-68%), and 43% (95% CI: 37%-57%),

respectively. MRI and CT had comparable specificities for parametrial invasion and lymph node

involvement. For bladder invasion and rectum invasion the sensitivities for MRI were respec-

tively 75% (95% CI: 66%-83%) and 71% (95% CI: 53%-83%), higher compared with CT. The

specificity in evaluating bladder invasion for MRI was significantly higher compared with CT:

91% (95% CI: 83%-95%) for MRI and 73% (95% CI: 52%-87%) for CT. The specificities for

rectum invasion were comparable. Differences in patient sample size, publication year, meth-

odological criteria and MRI techniques had no effect on the summary estimates.

Conclusion: For overall staging of cervical carcinoma, MRI is more accurate than CT.

�0 ��

CT and MRI in staging uterine cervical carcinoma: systematic review

�0 ��

InTRoDUCTIon

Cervical carcinoma is one of the most common cancers in developing countries, accounting for

6% of all malignancies in women (estimated 470,000 cases in 2000) and is associated with a high

mortality (approximately 233,000 worldwide) [1]. Most patients with cervical cancer are detected

through symptoms, such as abnormal vaginal bleeding and vaginal discharge. Investigation of an

abnormal cervical smear followed by histopathology usually establishes the diagnosis.

The Federation Internationale de Gynecologie et d’Obstetrique (FIGO) staging system is used

worldwide for the clinical staging of cervical carcinoma and does not include evaluation of lymph

node involvement. However, the prognosis of patients with cervical cancer depends heavily on

lymph node involvement [2].

The choice to perform surgery (radical hysterectomy with bilateral pelvic lymphadenectomy

for stages IA, IB, and IIA) or (chemo) radiotherapy (intracavitary and external-beam radiotherapy

for stages IIB, III, and IV) depends on both the FIGO stage of the disease and the lymph node

involvement. The clinical staging system relies primarily on findings at physical examination (gyne-

cological examination) and basic imaging techniques such as chest film, intravenous urography,

barium enema, cystoscopy, and sigmoidoscopy. Discrepancies of approximately 25% (in early

stage ≤ IIA) and 65–90% (in advanced stages ≥ IIB) between clinical and surgical staging have

been reported [3-6]. Furthermore little or no information on lymph node involvement is obtained

by clinical examination and basic imaging techniques [7].

In the search for more accurate diagnostic tools, cross-sectional imaging modalities such as

computed tomography (CT) and magnetic resonance imaging (MRI) [8-18] have been proposed.

Despite extensive research efforts on the diagnostic performance of these modalities in staging of

cervical carcinoma, no uniform approach has emerged. The decision to use one of these imaging

modalities should be based on reliable evidence of their diagnostic performance.

The purpose of the study was to summarize the available evidence and to obtain precise and

valid estimates of the diagnostic performance of CT and MRI in the evaluation of parametrial inva-

sion, bladder and rectum invasion, and lymph node involvement. For this purpose, we conducted

a systematic review in which we also studied to what degree differences in techniques and differ-

ences in study design could account for the variability in results on diagnostic accuracy.

Data sources A comprehensive computer literature search of English and German language studies in human

subjects was performed to identify articles on the diagnostic performance of CT and MRI in stag-

ing of cervical carcinoma compared with histopathology as reference standard.

The MEDLINE and EMBASE databases from January 1985 to May 2002 were searched for

the following terms: “cervix neoplasms, tomography/X-ray computed/or magnetic resonance

��

Chapter 3

��

imaging/or nuclear magnetic resonance” as medical subject headings (MeSH) and “specificity/or

false negative/or accuracy” as text words [19-23].

The list of articles was supplemented by extensive cross-checking of the reference lists.

Review articles, letters, comments, case reports, and articles not presenting raw data were not

selected.

Study selectionStudies were included when all of the following inclusion criteria were met: (1) minimum sample

size of 10 patients; (2) histopathology of specimens obtained by surgery, laparotomy, postmor-

tem, biopsy, or fine-needle aspiration as reference standard: (3) sufficient data to construct a 2

× 2 contingency table in which the findings from the imaging technique can be compared with

those from the reference standard (cells labeled as true positives, false positives, true negatives,

and false negatives). The exclusion criterion was data reported elsewhere in more detail (dupli-

cate publication).

Data extractionTwo observers independently performed data extraction using a standard form. Discrepancies in

judgment were solved by the independent judgment of a third reviewer. For each study sensitivity

and specificity were calculated from the 2 × 2 tables for detection of parametrial invasion, blad-

der invasion, rectal invasion, and lymph node involvement. We also recorded whether data were

analyzed per patient or per site analysis (e.g. per parametrium for parametrial invasion and per

site/node for lymph node metastases).

In addition, the following items were extracted: (a) year of publication; (b) CT technique: type

of scanner (conventional, spiral/helical CT), use of contrast (oral, rectal, intravenous), and slice

thickness; (c) MRI technique: magnetic field, type of coil (body, phased array, surface, endorec-

tal coil), type of sequences, and use of intravenous contrast medium (gadolinium); and (d) total

sample size.

The following methodological design criteria were scored for all included studies [23-26]: (a)

patient selection (consecutive or non-consecutive); (b) interpretation of results (blinded or not

blinded); (c) method of verification (partial verification or complete); (d) method of data collec-

tion (prospective, retrospective and unknown); and (e) description of study population, diagnostic

test(s), and reference test (sufficient or insufficient).

The reference test (histopathology of specimens obtained by surgery, laparotomy, post-

mortem, biopsy, or fine-needle aspiration) and the diagnostic test, CT and/or MRI, had to be

described with sufficient detail to allow for replication, validation, and generalization of the study.

Descriptions of these tests were scored as sufficient if clear definitions of positive and negative

test results were mentioned in the text. Description of the study population was judged to be suf-

ficient if at least the following characteristics were described: age of participants and distribution

of symptoms (FIGO classification).

��

Data analysisBecause of the anticipated heterogeneity of sensitivity and specificity between studies we used a

bivariate random-effects approach to analyze the data [27, 28]. We assumed a bivariate normal

distribution for the logit-transformed sensitivity and specificity values, taking into account both

the estimation error of the sensitivity and specificity values in each study as well as heterogeneity

between studies due to differences in patient population and study design.

Using this model we obtained estimates of the mean logit-transformed sensitivity and specific-

ity values across studies. Summary estimates of sensitivity and specificity with confidence intervals

were calculated after antilogarithm transformation of these logit estimates.

Covariate adjustmentTo determine whether results were significantly affected by heterogeneity between individual

studies, the influence of the following covariates was analyzed: sample size (> 50 vs. ≤ 50)

and publication period (1985–1991 vs. 1992–1997 vs. 1998–2002). We also evaluated whether

shortcomings in methodology (patient selection, unblinded interpretation of test results, verifica-

tion bias, and retrospective collection of data), had an effect on diagnostic accuracy.

For this purpose we adjusted for these criteria by adding covariates simultaneously to the

bivariate approach.

Subgroup analysisSubgroup analysis compared MRI techniques: (a) T2-weighted with Gd-enhanced T1-weighted

sequences; (b) Body coil with body and additional coil (e.g., surface, endorectal coil, or phased

array coil); and (c) Low to medium magnetic field strength (< 1.5 T) with high field magnetic field

strength (≥ 1.5 T).

Subgroup analysis was not possible for different MRI sequences (e.g., fast spin echo, gradient

echo, turbo spin echo) and technical CT factors (e.g. scan thickness, use of contrast, and type of

scanner) due to the diversity, and small number of data.

The z test was used to test for differences between subgroups. A P value of 0.05 or less was

considered statistically significant. Calculations and analysis were performed with Microsoft Excel

2000 (Microsoft, Seattle, WA, USA), SPSS 10.0 for Windows (SSPS, Chicago, IL, USA) and the SAS

statistical software version 8.02 (SAS Institute Inc., NC, USA).

ResUlTs

Literature search and study selectionWith the computer search and extensive cross-checking of the reference lists, 111 articles could

be identified. In this set 87 papers were found to be eligible after reading the abstract. Of the 87

potentially eligible articles, 61 fulfilled all inclusion criteria. Four studies had to be excluded due to

publication in more detail elsewhere, resulting in 57 articles. In 49 articles, one imaging modality

��

Chapter 3

�� Figure 1. Result of individual studies plotted in Receiver Operating Characteristics (ROC) spaces

Methodological quality elements Score N*

Patient selection Consecutive 25

Non-consecutive 32

Interpretation of test results Blinded 29

Not blinded 28

Method of verification Complete 43

Partial 14

Description of study population Sufficient 34

Insufficient 23

Description of diagnostic test(s) Sufficient 54

Insufficient 3

Description of reference test Sufficient 25

Insufficient 32Method of data collection Prospective 30

Retrospective 14

Unknown 13* Number of articles

Table 1. Results of the assessment of methodological quality elements

��

alone was evaluated (MRI: 38, CT: 11), and in 8 articles both imaging modalities were evaluated.

Table 1 lists the results of the methodological quality assessment of these articles. Many

studies suffered from selective patient sampling, suboptimal interpretation of results, incomplete

verification methods, and poor description of study population and of reference test. A list of all

included articles with relevant characteristics is available on request from the authors.

Data analysisFigure 1 shows the distribution of sensitivity and specificity between studies. Sensitivity as well

as specificity in the evaluation of parametrial invasion was heterogeneous for MRI; for CT a small

number of data sets were included. For lymph node involvement, heterogeneity was observed

mainly in the sensitivity values, for both CT and MRI. Only a small number of studies evaluated

invasion into the bladder and rectum.

Results of the bivariate data analysis on the diagnostic values of CT and MRI in the detection

of parametrial, bladder and rectum invasion, and lymph node involvement are summarized in

figure 2.

The results of the bivariate approach show a significantly higher sensitivity in the evaluation of

parametrial invasion by MRI compared with CT: 74% (95% CI: 68%–79%) for MRI and 55% (95%

CI: 44%–66%) for CT (P = 0.0027). The sensitivity for lymph node involvement by MRI was also

found to be significantly higher compared with CT: 60% (95% CI: 52%–68%) for MRI and 43%

(95% CI: 37%–57%) for CT (P = 0.047). The specificities in the evaluation of parametrial invasion

and lymph node involvement for MRI and CT were comparable.

Figure 2. Sensitivity and specificity estimates (with confidence intervals) of CT and MRI for staging of cervix carcinoma. n = number of data sets included. *Significant difference when compared with CT (sensitivity esti-mates), P < 0.05. **Significant difference when compared with CT (specificity estimates).

�6 �7

Chapter 3

�6 �7

Figure 4. Results of the bivariate approach with covariates for evaluation of lymph node involvement by MRI. Summary sensitivity and specificity values are represented with confidence intervals.

Figure 3. Results of the bivariate approach with covariates for evaluation of parametrial invasion by MRI. Summary sensitivity and specificity values are represented with confidence intervals.

�6 �7

The sensitivity for bladder invasion and rectum invasion by MRI were higher compared with CT:

respectively 75% (95% CI: 66%–83%) and 71% (95% CI: 53%–83%) for MRI compared with

64% (95% CI: 39%–82%) and 45% (95% CI: 20%–73%) for CT. These differences were not sta-

tistically significant. The specificity in the evaluation of bladder invasion for MRI was significantly

higher compared with CT: 91% (95% CI: 83%–95%) for MRI and 73% (95% CI: 52%–87%) for

CT (P = 0.0324). The specificities for rectum invasion were comparable.

Covariate adjustmentCovariate adjustment (size of patient population and year of publication) and subgroup analysis

(for MRI techniques) was possible only in the MRI data sets obtained for parametrial invasion

and lymph node involvement. Population size (>50 vs. ≤ 50) and publication period (1985–1991

vs. 1992–1997 vs. 1998–2002) had no influence on both the sensitivity and specificity estimates

(Figure 3 and Figure 4).

Shortcomings in methodology (patient selection, unblinded interpretation of test results, verifi-

cation bias, and retrospective collection of data) also had no effect on diagnostic accuracy.

Subgroup analysisNo differences were observed in subgroups comparing MRI techniques; T2-weighted vs. Gd-

enhanced T1-weighted sequences, body coil vs. body and additional coil, and low to medium

magnetic field strength (< 1.5 T) vs. high field magnetic field strength (≥ 1.5 T) (Figure 3 and

Figure 4).

DIsCUssIon

The aim of this systematic review was twofold: (1) to summarize the available evidence and to

obtain valid and precise summary estimates of the diagnostic performance of CT and MRI for

staging of cervical carcinoma, and (2) to explore potential sources of heterogeneity in findings

between studies.

In this analysis of 17 years of published literature we found many studies, with MRI being the

most extensively evaluated procedure. We found a significantly higher sensitivity for the evalua-

tion of parametrial invasion and lymph node involvement by MRI compared with CT. Sensitivity

for MRI in the evaluation of bladder invasion and rectum invasion was also higher compared with

values for CT, but these differences were not statistically significant. The specificity estimates in

the evaluation of parametrial invasion, lymph node involvement, and rectum invasion for MRI and

CT were comparable. The specificity estimate for bladder invasion by MRI was significantly higher

than that by CT. For the staging of cervical carcinoma, the clinical (FIGO) staging is used world-

wide and proposed to be the official staging method. The inherent limitations of this staging

system concerning invasion outside the cervix and presence of lymph node metastases are espe-

cially prominent in advanced disease (higher prevalence).

��

Chapter 3

��

Because of these limitations of the clinical staging system, we studied the two currently used cross-

sectional imaging modalities, CT and MRI, in evaluating all factors influencing treatment modality

and prognosis: parametrial, bladder and rectum invasion, and lymph node involvement.

Additionally, we scored methodological quality features to test whether shortcomings in meth-

odology affected diagnostic accuracy. Most studies suffered from methodological weaknesses

(see table 1) such as insufficient description of the reference standard, non-consecutive patient

selection, unblinded interpretation of data, verification bias, and retrospective collection of data,

which can cause over- and underestimation of diagnostic accuracies [25]. Arrive et al. [24], using

another scale for the assessment of methodological criteria in radiological studies, also showed

the methodological weaknesses of most radiological studies. These results imply that only a small

minority of the available data can be used in clinical practice, as shortcomings in study design

characteristics can affect the diagnostic accuracy [25, 29-31]. We feel that investigators in the

field of radiology should be aware of optimal designs of their studies and readers should be aware

of weaknesses in design when interpreting the results.

Furthermore, we made an attempt to minimize some of the well-known limitations of meta-

analysis. If primary studies suffer from inadequate scientific quality, combining them in an analytic

approach will yield poor results. We attempted to minimize this problem by (1) applying inclusion

criteria, (2) combining results in a bivariate random-effects approach, which accounts for varia-

tion in results, and (3) using covariate adjustment to explore heterogeneity based on study design

characteristics.

This bivariate random-effects approach [27, 28] is more convenient than the model described

by Moses and colleagues [32, 33], which focuses on the summary receiver operating characteris-

tic (SROC) curve. The bivariate approach produces summary estimates of sensitivity and specificity

as outcomes, which are more familiar to clinicians. Summary estimates of sensitivity and specific-

ity can also be estimated from the SROC curve but this is not clear-cut, since many different pairs

of sensitivity and specificity can be chosen along the SROC curve. One other advantage of this

bivariate random is that both the error of estimation of the sensitivity and specificity values in

each study and the heterogeneity between studies due to different population or threshold set-

tings are taken into account. With this model it is also possible to evaluate the effects of study

characteristics on sensitivity and specificity separately.

One meta-analysis [34] evaluated CT and MRI in the detection of lymph node involvement. In

that study, CT and MRI were found to be comparable in the detection of lymph node metastases

from cervical carcinoma. However, the analysis addressed only a subset of methodological quality

criteria and did not include more recent studies with current technology and data were combined

in a SROC. Moreover, the analysis evaluated only lymph node involvement, whereas the present

analysis focused on the overall staging (FIGO and lymph node involvement).

What are the shortcomings of our systematic review? By including only published data, we

cannot exclude publication bias, which tends to cause overestimation of diagnostic performance

because of the greater likelihood of publication of positive rather than negative results [35].

Another limitation of this study is that we were not able to categorize patients for subgroup

analysis, due to variations in the staging obtained by pathology.

��

Despite our attempt to explore the heterogeneity, a considerable variation in accuracy between

the included studies remained unexplained. It cannot be excluded that other factors such as dif-

ferences in CT and MRI equipment, institutions, or patient population (spectrum of disease) have

a systematic impact on the diagnostic accuracy.

CT and MRI are limited as it is impossible to differentiate metastatic nodes from non-meta-

static hyperplastic nodes of similar size and shape; the only CT and MRI criteria that are generally

accepted in the evaluation of pelvic node metastases are the size and shape of the node. In the

past decade, a 1-cm diameter has become the preferred criterion, as either the maximum or

minimum transverse diameter. By using this criterion, the reported sensitivity values for CT and

MRI were low, whereas the specificity values were high.

On the basis of our meta-analysis of imaging findings, the following clinical practice guidelines

can be formulated for staging cervical carcinoma. In clinically early-stage cancer, the prevalence

of spread of disease outside the cervix is low and therefore the additional value of MRI is limited.

In more advanced disease, MRI can play an important role as clinical staging has significant limita-

tions in advanced disease. Costs of CT and MRI are difficult to evaluate.

Preoperative MRI has been shown to be cost minimizing, as using MRI as initial imaging exami-

nation requires fewer tests and fewer procedures compared with the standard workup including

CT [36]. However, more MRI studies satisfying all methodological criteria are needed to fully

evaluate MRI techniques and protocols, and to obtain a uniform MRI strategy.

�0 ��

Chapter 3

�0 ��

References

1. World Health Organisation Globoscan 2000, Cancer incidence, mortality and prevalence worldwide, Version 1.0. IARC Cancer Base No. 5. , IARC Press, Lyon (2001).

2. Tanaka Y, Sawada S and Murata T. Relationship between lymph node metastases and prognosis in patients irradiated postoperatively for carcinoma of the uterine cervix. Acta Radiol Oncol 1994;23:455–459.

3. Subak LL, Hricak H, Powell CB, Azizi L, Stern JL. Cervical carcinoma: computed tomography and magnetic resonance imaging for preoperative staging. Obstet Gynecol 1995;86: 43–50.

4. Averette HE, Ford, JH Jr, Dudan RC, Girtanner RE, Hoskins WJ, Lutz MH. Staging of cervical cancer. Clin Obstet Gynecol 1975;18:215–232.

5. Lagasse LD, Creasman WT, Shingleton HM, Ford JH, Blessing JA. Results and complications of operative staging in cervical cancer: experience of the Gynecologic Oncology Group. Gynecol Oncol 1980;9:90–98.

6. Van NJ Jr, Roddick JW, Lowin DM. The staging of cervical cancer: inevitable discrepancies between clinical staging and pathologic findings. Am J Obstet Gynecol 1971;110: 973–978.

7. Lanza A, Re A, D’Addato F, Morino M, Wierdis T, Caldarola B, Ferraris G. Lymph nodal metastases and the clinical stage of cervix carcinoma. Eur J Gynaecol Oncol 1987;8: 61–67.

8. Bandy LC, Clarke-Pearson DL, Silverman PM, Creasman WT. Computed tomography in evaluation of extra-pelvic lymphadenopathy in carcinoma of the cervix. Obstet Gynecol 1985;65:73–76.

9. Angel C, Beecham JB, Rubens DJ, Thornbury JR, Stoler MH. Magnetic resonance imaging and pathologic correlation in stage IB cervix cancers. Gynecol Oncol 1987;27: 357–367.

10. Botsis D, Gregoriou O, Kalovidouris A, Tsarouchis K, Zourlas PA. The value of computed tomography in staging cervical carcinoma. Int J Gynaecol Obstet 1988;27:213–218.

11. Abe Y, Yamashita Y, Namimoto T, Takahashi M, Katabuchi H, Tanaka N, Okamura H. Carcinoma of the uterine cervix: high-resolution turbo spin-echo MR imaging with contrast-enhanced dynamic scanning and T2-weighting. Acta Radiol 1998;39:322–326.

12. deSouza NM, Whittle M, Williams AD, et al. Magnetic resonance imaging of the primary site in stage I cervical carcinoma: a comparison of endovaginal coil with external phased array coil techniques at 0.5T. J Magn Reson Imaging 2000;12:1020–1026.

13. Fujiwara K, Yoden E, Asakawa T, et al. Role of magnetic resonance imaging (MRI) in early cervical cancer. Gan To Kagaku Ryoho 2000;27 suppl 2:576–581.

14. Oellinger JJ, Blohmer JU, Michniewicz K, et al. Pre-operative staging of cervical cancer: comparison of magnetic resonance imaging (MRI) and computed tomography (CT) with histologic results. Zentralbl Gynakol 2000;122:82–91.

15. Sheu MH, Chang CY, Wang JH, Yen MS. Cervical carcinoma: assessment of parametrial invasion and lymph node metastasis with magnetic resonance imaging. Chung Hua I Hsueh Tsa Chih (Taipei) 2000;63:634–640.

16. Yang WT, Lam WW, Yu MY, Cheung TH, Metreweli C. Comparison of dynamic helical CT and dynamic MR imaging in the evaluation of pelvic lymph nodes in cervical carcinoma. AJR Am J Roentgenol 2000;175:759–766.

17. Ascher SM, Takahama J, Jha RC. Staging of gynecologic malignancies. Top Magn Reson Imaging 2001;12:105–129.

18. Pannu HK, Corl FM, Fishman EK. CT evaluation of cervical cancer: spectrum of disease. Radiographics 2001;21:1155–1168.

19. Deville WL, Bezemer PD, Bouter LM. Publications on diagnostic test evaluation in family medicine journals: an optimal search strategy. J Clin Epidemiol 2000;53:65–69.

�0 ��

20. Deville WL, Buntinx F, van der Windt DA, et al. Didactic guidelines for conducting systematic reviews of studies evaluating the accuracy of diagnistic tests. In: J.A. Knottnerus, Editor, The evidence base of diag-nosis, BMJ Publishing Group, London (2001).

21. Deeks JJ. Systematic reviews in health care: systematic reviews of evaluations of diagnostic and screening tests. BMJ 2001;323:157–162.

22. Assendelft WJ, van Tulder MW, Scholten RJ, Bouter LM. [The practice of systematic reviews. II. Searching and selection of studies]. De praktijk van systematische reviews. II. Zoeken en selecteren van literatuur. Ned Tijdschr Geneeskd 1999;143: 656–661.

23. van der Windt DA, Zeegers MP, Kemper HC, Assendelft WJ, Scholten RJ. [Practice of systematic reviews. VI. Searching, selection and methodological evaluation of etiological research]. De praktijk van systema-tische reviews. VI. Zoeken, selecteren en methodologisch beoordelen van etiologisch onderzoek. Ned Tijdschr Geneeskd 2000;144:1210–1214.

24. Arrive L, Renard R, Carrat F, et al. A scale of methodological quality for clinical studies of radiologic exami-nations. Radiology 2000;217:69–74.

25. Lijmer JG, Mol BW, Heisterkamp S, et al. Empirical evidence of design-related bias in studies of diagnostic tests. JAMA 1999;282:1061–1066.

26. Assendelft WJ, Scholten RJ, van Eijk JT, Bouter LM. [The practice of systematic reviews. III. Evaluation of methodological quality of research studies] De praktijk van systematische reviews. III. Methodologische beoordeling van onderzoeken. Ned Tijdschr Geneeskd 1999;143:714–719.

27. Van Houwelingen HC, Zwinderman KH,. Stijnen T. A bivariate approach to meta-analysis. Stat Med 1993;12:2273–2284.

28. Van Houwelingen HC, Arends LR, Stijnen T. Advanced methods in meta-analysis: multivariate approach and meta-regression. Stat Med 2002;21:589–624.

29. Khan KS, Daya S, Jadad A. The importance of quality of primary studies in producing unbiased systematic reviews. Arch Intern Med 1996;156:661–666.

30. Panzer RJ, Suchman AL, Griner PF. Workup bias in prediction research. Med Decis Making 1987;7:115–119.

31. Ransohoff DF and Feinstein AR. Problems of spectrum and bias in evaluating the efficacy of diagnostic tests. N Engl J Med 1978;299:926–930.

32. Littenberg B and Moses LE. Estimating diagnostic accuracy from multiple conflicting reports: a new meta-analytic method. Med Decis Making 1993;13:313–321.

33. Moses LE, Shapiro D, Littenberg B. Combining independent studies of a diagnostic test into a summa-ry ROC curve: data-analytic approaches and some additional considerations. Stat Med 1993;12:1293–1316.

34. Scheidler J, Hricak H, Yu KK, Subak L, Segal MR. Radiological evaluation of lymph node metastases in patients with cervical cancer: a meta-analysis. JAMA 1997;278:1096–1101.

35. Irwig L, Macaskill P, Glasziou P, Fahey M. Meta-analytic methods for diagnostic test accuracy. J Clin Epide-miol 1995;48:119–130.

36. Hricak H, Powell CB, Yu KK, Washington E, Subak LL, Stern JL et al. Invasive cervical carcinoma: role of MR imaging in pretreatment workup: cost minimization and diagnostic efficacy analysis. Radiology 1996;198: 403–409.

4C h a p t e r

Rectal cancer: local staging and assessment of lymph node involvement with endoluminal US, CT and MR imaging: a meta-analysis

Shandra BipatAfina S. GlasFrederik J. M. SlorsAeilko H, ZwindermanPatrick M. M. BossuytJaap Stoker

Radiology 2004:232:773-783

��

Chapter 4

absTRaCT

Purpose: To perform a meta-analysis to compare endoluminal ultrasonography (EUS), computed

tomography (CT), and magnetic resonance imaging (MRI) in rectal cancer staging.

Methods: Relevant articles published between 1985 and 2002 were included if more than 20

patients were studied, histopathologic findings were the reference standard, and data were

presented for 2 x 2 tables; articles were excluded if data were reported elsewhere in more de-

tail. Two reviewers independently extracted data on study characteristics and results. Bivariate

random-effects approach was used to obtain summary estimates of sensitivity and specificity

for invasion of muscularis propria, perirectal tissue, and adjacent organs, and for lymph node

involvement. Summary receiver operating characteristic (ROC) curves were fitted for perirectal

tissue invasion and lymph node involvement.

Results: Ninety articles fulfilled all inclusion criteria. For muscularis propria invasion, EUS and

MRI had similar sensitivities; specificity of EUS (86% [95% CI: 80%-90%]) was significantly

higher than that of MRI (69% [95% CI: 52%-82%]) (P = 0.02). For perirectal tissue invasion,

sensitivity of EUS (90% [95% CI: 88%-92%]) was significantly higher than that of CT (79%

[95% CI: 74%-84%]) (P < 0.001) and MRI (82% [95% CI: 74%-87%]) (P= 0.003); specificities

were comparable. For adjacent organ invasion and lymph node involvement, estimates for

EUS, CT, and MRI were comparable. Summary ROC curve for EUS of perirectal tissue invasion

showed better diagnostic accuracy than that of CT and MRI. Summary ROC curves for lymph

node involvement showed no differences in accuracy.

Conclusion: For local invasion, EUS was most accurate and can be helpful in screening patients

for available therapeutic strategies.

staging of rectal cancer with endoluminal Us, CT and MRI: meta-analysis

��

InTRoDUCTIon

Rectal cancer is a common cancer and a major cause of mortality in Western countries. The diag-

nosis is usually established by means of clinical examination (rectal digital examination), endoscopy

(sigmoidoscopy and colonoscopy), double-contrast enema examination, and histologic confirma-

tion, supplemented by biochemistry (e.g. blood carcinoembryonic antigen measurement). All of

these techniques are poor indicators of the depth of invasion and lymph node involvement, which

are both important features for prognosis [1-5].

Accurate preoperative assessment of these prognostic factors is an important first step in

assigning patients to one of the available treatment strategies, which include transanal local

excision, transanal endoscopic microsurgery, total mesorectal excision, preoperative irradiation,

and preoperative chemotherapy. From the clinical point of view, it is important to select patients

for local therapy, such as transanal local excision or transanal endoscopic microsurgery (mainly

stage T1 or lower) [6-9]; total mesorectal excision (mainly stages T2 and T3); and a long course of

preoperative (chemotherapeutic) radiation therapy, aimed at downsizing and downstaging the

tumor(s) (mainly stage T4) [10-12].

In patients considered suitable for total mesorectal excision, the spread of tumor to the me-

sorectal fascia is the second important feature that needs to be assessed [13-15]. This relation

determines if a patient can be treated directly with or without a short course of preoperative

radiation therapy or whether the patient should be considered to have a locally advanced tumor

necessitating a long course of chemotherapeutic radiation therapy. This is an important next step

in the selection of patients for the proper treatment strategy. The identification and the role of

mesorectal fascia are still under investigation, however; therefore, the assessment of the depth

of cancer invasion (T stage) remains the primary and most important feature in the treatment

of patients with rectal cancer. The presence of lymph node involvement is relevant for clinical

decision making in two circumstances: (1) if local excision in the absence of lymphadenopathy

is performed and (2) if lymph node metastases are present outside the endopelvic envelope, in

which case the tumor is considered to be locally advanced.

Non-invasive radiologic modalities such as endoluminal ultrasonography (EUS), computed to-

mography (CT), and magnetic resonance imaging (MRI) have proved to be important and have

been widely used diagnostic tools in the assessment of depth of cancer invasion and/or lymph

node involvement. Extensive research on the diagnostic performance of these modalities in the

staging of rectal cancer has been performed [16-20], yet studies on the evaluation of all three

imaging modalities within the same patient population are limited.

Furthermore, a wide variation in study design, patient population, imaging techniques, and

results exists. These factors make it difficult for workers in this field to know the diagnostic per-

formance of these imaging modalities.

A meta-analysis of diagnostic tests represents a powerful tool to summarize findings in the

literature by taking into account and enabling analysis of differences between studies [21, 22].

Thus, the purpose of our study was to perform a meta-analysis to compare EUS, CT, and MRI in

the staging of rectal cancer.

Chapter 4

Literature SearchA comprehensive computerized systematic literature search [23] was performed (S.B.) to identify

abstracts of English-language articles from studies involving human subjects. Relevant studies on

the diagnostic performance of EUS, CT, and MRI in the staging of rectal cancer were identified.

The MEDLINE database from January 1985 to December 2002 was searched with the follow-

ing keywords: (a) “rectal neoplasms” (medical subject heading, or MeSH) and (b) “magnetic reso-

nance imaging” (MeSH) or “tomography, x-ray computed” (MeSH) or “ultrasonography” (MeSH)

and (c) “specificity” or “false negative” or “accuracy” as text words.

The EMBASE, Cochrane, and CANCERLIT databases were also checked for relevant articles by

using (a) “rectal cancer” and (b) “magnetic resonance imaging” or “computed tomography” or

“ultrasonography” and (c) “specificity” or “false negative” or “accuracy” as text words. To identify

additional relevant references, the reference lists of the articles retrieved were checked manually.

After reading the abstracts, one reviewer (S.B.) examined all potentially eligible articles in

which EUS, CT, and/or MRI were evaluated. Reviews, letters, comments, case reports, and articles

that did not present raw data were excluded.

Study SelectionStudies were selected if they fulfilled all of the following inclusion criteria: (1) more than 20

patients had histologically proved rectal adenocarcinoma or carcinoma and were not treated with

preoperative chemotherapy and/or radiation therapy; (2) histopathologic findings (specimens

obtained at surgery, laparoscopy, laparotomy, lymph node biopsy) were used as the reference

standard; (3) sufficient data were presented to construct a 2 x 2 contingency table (either raw 2 x

2 data or sensitivity and/or specificity with absolute numbers of positive and negative findings or

the standard errors) of the imaging modalities compared with the reference standard for invasion

of the submucosa, muscularis propria, perirectal tissue, or adjacent organs or lymph node involve-

ment (perirectal or distant lymph nodes).

Studies were excluded if data were reported elsewhere in more detail. When data were pub-

lished more than once, the study with the most details or the most patients was included.

Data ExtractionTwo reviewers (S.B., F.J.M.S.) independently extracted relevant data from each article, including

study characteristics and test results, by using a standardized data extraction form. The reviewers

were not blinded to authors, journal name, or year of publication. Both reviewers extracted data

from all articles. In cases of discrepancies, a third blinded reviewer assessed all discrepant items,

and majority opinion was used for analysis.

Study CharacteristicsThe following study design characteristics were scored: (a) patient selection: consecutive or non-

consecutive; (b) interpretation of test results: blinded or not blinded; (c) verification: complete or

partial—in cases in which more than 10% of the study group was not subjected to the reference

test, the study was scored as applying partial verification; all other cases were scored as complete

verification; (d) methods of data collection: prospective, retrospective, or unknown—data collec-

tion was categorized as either prospective or retrospective; in case of doubt, the method of data

collection was scored as unknown; (e) reporting of study population: sufficient or insufficient—a

description of the study population was judged to be sufficient if at least the age of participants

and male-to-female ratio were included; (f) reporting of diagnostic test(s): sufficient or insuf-

ficient; and (g) reporting of reference test: sufficient or insufficient.

In a study of diagnostic accuracy, both the reference test and the diagnostic test(s) should be

described with sufficient detail to allow for replication, validation, and generalization of the study.

Descriptions of the tests were scored as sufficient if clear definitions of positive and negative test

results were mentioned in the text.

Additionally, the following study characteristics were recorded for each article: (a) year of

publication; (b) sample size (number of patients); and (c) mean age of patients.

Examination ResultsThe following imaging techniques were recorded in the assessment of retrieved articles: for EUS,

type of probe and frequency of transducer; for CT, type of contrast material (oral, rectal, or

intravenous), section thickness, and use of spiral mode; and for MRI, magnetic field strength,

sequence, intravenous contrast material (used or not used), and type of coil used (body coil with

or without additional coil [e.g. phased-array or endorectal coil]).

For local staging, 2 x 2 tables were extracted or reconstructed from reported sensitivity and

specificity values and absolute numbers of positive and negative findings as follows:

(a) For invasion of muscularis propria, stage T2 or higher versus stage T1.

(b) For invasion of perirectal tissue, stage T3 or higher versus stage T2 or lower.

(c) For invasion of adjacent organs, stage T4 versus stage T3 or lower.

For invasion of the submucosa, no 2 x 2 tables could be extracted or reconstructed because of

the limited data on negative results (T0 and Tis). We extracted or reconstructed 2 x 2 tables for

lymph node involvement (perirectal, iliac, or mesenteric lymph nodes). Cut-off values for positive

lymph nodes were also extracted.

To avoid selection of data sets, in articles in which investigators tabulated the results for dif-

ferent readers (interobserver), for multiple observations per reader (intraobserver), for multiple

MRI systems, and for multiple MRI sequences, all tabulated results (2 x 2 tables) were considered

separate data sets.

Statistical AnalysisA bivariate random-effects model [24, 25] was used to obtain summary estimates of sensitivity

and specificity and to fit summary receiver operating characteristic (ROC) curves that correspond-

ed to the observed ranges of sensitivity and specificity values.

��

Chapter 4

Bivariate Random-Effects AnalysisIn this model, we assumed that the true values of sensitivity and specificity followed a bivariate

normal distribution around some common mean value of logit-transformed sensitivity (logit-sens)

and logit-transformed specificity (logit-spec) with a variance matrix Σ (σA and σB describe the vari-

ance among studies in logit-sens and logit-spec, respectively, and σAB is the covariance between

logit-sens and logit-spec).

Covariance

Logit-sens and logit-spec were calculated as follows: logit-sens = ln[sens/(1 – sens)] and logit-spec

= ln[spec/(1 – spec)], where ln is the natural logarithm, sens is sensitivity, and spec is specificity.

Because of this transformation of sensitivity and specificity into logit-sens and logit-spec, these

values will be approximately normally distributed with squared standard error 1/[nx(1 – x)], where

n is the number of cases or control subjects and x is sensitivity or specificity.

The random-effects model produces estimates of the mean logit-sens and logit-spec with their

standard errors. Sensitivity and specificity estimates with their 95% confidence intervals (CIs) were

calculated after anti–logit transformation of the mean logit-sens and logit-spec. The random-ef-

fects model also produces the associated variances (σA and σB) and the covariance (σAB).

To display summary ROC curves, we estimated the intercept (α) and slope (β) of the linear

regression line: logit-sens = α + (β*logit-spec) [26]. The slope (β) of this regression line equals the

covariance between logit-sens and logit-spec (σAB) divided by the variance of logit-spec (σB): β =

σAB / σB.

After calculation of the slope, the intercept (α) was calculated by solving the regression equa-

tion between the mean values of logit-sens and logit-spec. After anti–logit transformation of the

regression line, a summary ROC curve was obtained.

Bivariate Analysis with CovariatesTo determine whether results were significantly affected by heterogeneity between individual

studies, the following covariates were added to the bivariate random-effects model for modality:

year of publication (continuous variable: 2000 was set to 0, 1999 to –1, 1998 to –2, 1997 to

–3, etc), sample size (> 50 vs. ≤ 50 patients), interpretation of results (blinded vs. not blinded),

verification (complete vs. partial), patient selection (consecutive vs. non-consecutive), and method

of data collection (prospective vs. retrospective or unknown). Year 2000 was chosen as the refer-

ence year because of the low number of publications after 2000.

We considered variables to be explanatory if their regression coefficients were statistically

significant (P < 0.05). Subsequently, we performed bivariate regression analysis with multiple

covariates for each stage per modality. In this analysis, previously identified explanatory variables

were analyzed with a backward elimination procedure, where the variable with the highest P-

value was excluded first. Variables were considered statistically significant if P < 0.1.

��

Summary ROC CurvesFor each modality, a model was obtained that was adjusted for significant variables that were set

to 1, indicating the ideal design versus 0, as appropriate. The intercept and slope were estimated

for the regression line (logit-sens = α + [β*logit-spec]), and a summary ROC curve was fitted after

anti-logit transformation of this regression line.

The position of the summary ROC curve indicates the difference in diagnostic performance

among the imaging modalities. A summary ROC curve located near the upper left corner indicates

the better diagnostic modality.

Summary Estimates of Sensitivity and SpecificityTo compare the estimates for EUS, CT, and MRI, a final model was obtained that was adjusted for

variables that significantly affected the estimates of the imaging modalities (set to 1, indicating

the ideal design vs. 0), as appropriate.

Since tabulated results for different readers (interobserver), for multiple observations per read-

er (intraobserver), for multiple MRI systems, and for multiple MRI sequences were considered

to be separate data sets, correlations were taken into account. For this approach, the empirical

standard error calculated by means of the “sandwich estimator” was used, which is possible with

the SAS software (version 8.02; SAS Institute, Cary, NC) procedure mixed [27]. This approach

was also used for intramodality intrapatient correlation (in some studies, different modalities were

compared in the same patient population).

Studying the histograms of the residuals and the random-effects estimates confirmed the

goodness of fit of the model. To evaluate the difference between estimates for EUS, CT, and

MRI, we included in our model a factor that indicated type of diagnostic modality; a P value of

less than 0.05 of the regression coefficient of this factor was considered to indicate a significant

difference.

Subgroup AnalysisOnly the groups of studies on MRI and EUS contained sufficient numbers to allow subgroup

analysis of technical differences, although the numbers did not allow an analysis of the effects

of covariates. We compared the following MRI techniques: (a) use of body coil versus surface

and/or phased-array and/or endorectal coil; (b) unenhanced MRI (T1- and T2-weighted imaging)

versus gadolinium-enhanced T1-weighted MRI; and (c) magnetic field strength used (< 1.5 T vs.

≥ 1.5 T). In a subgroup analysis of the EUS techniques, low-frequency probes (< 7.5 MHz) versus

high-frequency probes (≥ 7.5 Mhz) were compared.

All analyses were performed by using Microsoft Excel 2000 (Microsoft, Seattle, Wash), SPSS

10.0 for Windows (SSPS, Chicago, Ill), and SAS statistical software (SAS Institute).

Chapter 4

ResUlTs

Search Results and Study SelectionAfter the computerized search was performed and reference lists were extensively cross-checked,

357 articles were identified. We found 146 articles to be potentially eligible after reading the

abstract, of which 90 fulfilled the criteria for inclusion. Reasons for not including studies were

the following: less than 20 patients studied (n = 31); lack of reference standard (histopathologic

findings obtained at surgery, laparotomy, laparoscopy, or lymph node biopsy) (n = 1); incomplete

or inconclusive data to allow calculation of true-positive, false-positive, true-negative, and false-

negative findings (n = 19); and data reported elsewhere in more detail (n = 5).

Table 1. Study and patient characteristics of included data sets for each cancer stage

Stage No of Data Sets

No of Patients/Prevalence (%)

Years of Publication

References

T2-stage

EUS 39 2881/ 73.1% 1985-2002 16, 17, 20, 28-57

CT 2 65/96.9% 1986,1994 31, 41

MRI 13 630/83.5% 1993-2002 16, 17, 20, 45, 58-65

T3-stage

EUS 61 3904/52.7% 1985-2002 16, 17, 20, 28-41, 44-57, 66-89

CT 18 994/61.1% 1985-2002 18, 19, 31, 42, 66, 67, 72, 74, 75 78, 84, 89-94

MRI 17 746/58.2% 1993-2002 16, 17, 20, 45, 58-65, 82, 83, 95, 96

T4-stage

EUS 37 2686/7.4% 1985-2002 17, 20, 28-41, 44, 47-53, 56, 57, 68, 69, 73, 75, 81, 83, 87, 97

CT 9 397/6.6% 1985-2002 18, 19, 31, 39, 75, 83, 98

MRI 11 537/8.4% 1993-2002 17, 20, 40, 58, 60-65

N-stage

EUS 55 3879/39.9% 1986-2002 16, 33, 35-38, 40, 42, 44-48, 51, 52, 53, 55, 67-69, 71, 72, 74, 76-79, 81, 82, 84-88, 97, 99-107

CT 18 1123/40.8% 1985-2002 18, 19, 67, 72, 74, 78, 83, 84, 89-91, 93, 94, 100, 106, 108

MRI 19 1003/32.5% 1986-2002 16, 20, 45, 58-60, 64, 65, 82, 83, 95, 96, 106, 108-110

��

Data ExtractionFrom the 90 articles included, 299 data sets were retrieved. Most data sets suffered from selec-

tive patient sampling (64%), suboptimal interpretation of results (77%), and poor description of

the reference standard (73%). The other study design characteristics were distributed as follows:

complete verification of results (90%), sufficient description of patient populations (66%), suf-

ficient description of diagnostic tests (89%), and prospective collection of data (50%). For each

cancer stage and imaging modality, included data sets with corresponding numbers of patients,

years of publication, and references are presented in table 1.

A full list of all included articles with all relevant study characteristics and complete examination

results (for each stage and imaging modality) is available from the authors upon request.

Table 2. Predictors identified by means of backward regression analysis for each stage

Variable Regression coefficient P value

Muscularis propria invasion by EUS (39 data sets) Year of publication* Size of patient population (> 50)‡

0.08 (0.004, 0.15)†

1.16 (0.46, 1.85)0.040.001

Perirectal tissue invasion by EUS (61 data sets) Patient selection (consecutive)‡ -0.43 (-0.84, -0.01) 0.04

Perirectal tissue invasion by CT (18 data sets) Year of publication‡ -0.07 (-0.11, -0.04)† < 0.001

Perirectal tissue invasion by MRI (17 data sets) Method of data collection (prospective)‡ 0.92 (0.006, 1.85) 0.05

Adjacent organs invasion by EUS (37 data sets) Year of publication* Size of patient population (> 50)*

Size of patient population (> 50)‡

0.06 (0.02, 0.09)†

0.59 (0.15, 1.03)0.59 (0.08, 1.09)

0.0050.0080.02

Adjacent organs invasion by MRI (11 data sets) Year of publication‡ -0.31(-0.44, -0.17)† < 0.001

Lymph node involvement by EUS (55 data sets) Year of publication‡

Method of data collection (prospective)‡ 0.04 (0.002, 0.08)†

0.48 (0.06, 0.89)0.060.03

Lymph node involvement by CT (18 data sets) Verification of results (complete)‡ -1.81 (-2.12, -1,51) < 0.001

Lymph node involvement by MRI (19 data sets) Year of publication‡

Interpretation of results (blind)‡0.15 (0.02, 0.28)†

0.80 (0.15, 1.45)0.030.01

Note.—Numbers in parentheses are 95% confidence intervals. A positive regression coefficient indicates better discriminatory power of the imaging modality in studies with that characteristic than in studies without the corresponding characteristics. A negative regression coefficient indicates reduced diagnostic performance in studies with that characteristic. * Effect on sensitivity;† Value is change per year (year 2000 was coded as 0, 1999 as –1, 1998 as –2, 1997 as –3, etc); ‡ Effect on specificity.

��

Chapter 4

Data AnalysisAfter backward stepwise regression analysis was performed, several variables were identified as

significant predictors of the diagnostic performance of EUS, CT, and MRI for evaluation of inva-

sion of the muscularis propria, perirectal tissue, and adjacent organs and lymph node involvement

from rectal cancer (table 2).

Summary ROC CurvesWith the regression models, intercepts and slopes of the summary ROC curves for EUS, CT, and

MRI were calculated. Summary ROC curves could be fitted for data on perirectal tissue invasion

and lymph node involvement. For invasion in the muscularis propria and adjacent organs, no

intercepts and slopes could be defined because of the homogeneity of either the sensitivity or

specificity values.

The summary ROC curves for perirectal tissue invasion have different positions in ROC space

(figure 1); indicating differences in diagnostic performance between the imaging modalities, with

the summary ROC curve for EUS located nearest to the upper left corner. The summary ROC

curves for lymph node involvement have similar positioning in ROC space, indicating no differ-

ences in diagnostic performance of the imaging modalities.

0 0,2 0,4 0,6 0,8 1

False positive rate

EUS results

sROC EUS

CT results

sROC CT

MRI results

sROC MRI0

0 0,2 0,4 0,6 0,8 1

False positive rate

EUS results

sROC EUS

CT results

sROC CT

MRI results

sROC MRI

Lymph node involvementPerirectal tissue invasion

Figure 1. Summary ROC curves based on the final regression model for evaluation of perirectal tissue inva-sion and lymph node involvement with EUS, CT, and MRI and results of individual data sets. Left: Summary ROC curves for perirectal tissue invasion included consecutive patient selection for EUS, publication year for CT, and prospective data collection for MRI. Right: Summary ROC curves for lymph node involvement included publication year and prospective data collection for EUS, complete verification of results for CT, and publica-tion year and blinded interpretation of results for MRI. Year 2000 was chosen as the reference year. A summary ROC curve located near the upper left corner indicates better diagnostic modality. Curves for perirectal tis-sue invasion indicate differences in diagnostic performance among imaging modalities, with summary ROC curve for EUS located nearest to the upper left corner. Curves for lymph node involvement indicate no differences in diagnostic performance of imaging modalities.

��

Summary Estimates of Sensitivity and SpecificityMuscularis propria invasion.

In the final model, publication year and sample size (> 50 patients) were included as covariates

for EUS. Because of the few CT data sets available for evaluation, no data analysis was performed

for this aspect. No significant variables were identified for MRI (table 2). EUS and MRI have similar

sensitivity estimates of 94%. The specificity value for EUS (86%) was significantly higher (P = 0.02)

than that for MRI (69%), indicating overstaging of T1 (or lower) tumors with MRI (table 3).

Perirectal tissue invasion

The final model included consecutive patient selection for EUS, publication year for CT and pro-

spective data collection for MRI as covariates. The sensitivity estimates for EUS, CT, and MRI were

90%, 79%, and 82%, respectively, with a significantly higher sensitivity estimate for EUS than for

CT (P < 0.001) and MRI (P = 0.003), indicating understaging of T3 (or higher) tumors with CT

and MRI. The specificity estimates for EUS, CT, and MRI were comparable: 75%, 78%, and 76%,

respectively (table 3).

Adjacent organ invasion

The model included year of publication and sample size (> 50 patients) for EUS and publication

year for MRI as covariates. No significant predictors were found for the diagnostic performance

of CT. Sensitivity estimates of all imaging modalities were comparable: 70% for EUS, 72% for

CT, and 74% for MRI. Specificity estimates were also comparable: 97% for EUS, 96% for CT, and

96% for MRI (table 3).

Lymph node involvement

Year of publication and prospective data collection for EUS, complete verification for CT, year of

publication, and blind interpretation of results for MRI were included as covariates in the final

model. Sensitivity estimates for EUS, CT, and MRI were comparably low: 67%, 55%, and 66%,

respectively. Specificity values were also comparable: 78% for EUS, 74% for CT, and 76% for MRI

(table 3).

In all models in which publication year was included as the covariate, year 2000 was chosen as the

reference year because in some data sets, only one data set was available for publications after

2000. For invasion of muscularis propria and adjacent organs and for lymph node involvement,

more significant variables were found for EUS than for CT and MRI (table 2). Adjustment of the

models for EUS with more variables and the models for CT and MRI with fewer variables could

lead to overestimation of the EUS estimates and underestimation of the CT and MRI estimates.

To evaluate this, models were also studied in which all variables that significantly affected the

estimates of EUS, CT or MRI were included simultaneously. The results of these models were

comparable to the results of the models in which adjustment per modality was performed. To

avoid repetition, these results are not presented.

��

Chapter 4

Subgroup AnalysisSince there was a sufficient number of data sets for perirectal tissue invasion, subgroup analysis

could be performed for this stage.

The results of the subgroup analysis for MRI techniques (use of a body coil vs. a body coil with

an additional coil, unenhanced MRI vs. gadolinium-enhanced MRI, and low vs. high magnetic field

strength) and EUS techniques (low vs. high frequency) are presented in table 4. Summary ROC

curves are presented in figure 2.

No significant differences were found between the techniques, as shown by the summary

estimates and the summary ROC curves.

Table 3. Summary estimates of sensitivity and specificity for EUS, CT, and MRI in the staging of rectal cancer

Stage Imaging Modality Sensitivity (%) Specificity (%)

Muscularis propria invasion EUS 94 (90-97) 86 (80-90)

CT N.A N.A

MRI 94 (89-97) *69 (52-82)*

Perirectal tissue invasion EUS 90 (88-92) 75 (69-81)

CT *79 (74-84)* 78 (73-83)

MRI *82 (74-87)* 76 (65-84)

Adjacent organs invasion EUS 70 (62-77) 97 (96-98)

CT 72 (64-79) 96 (95-97)

MRI 74 (63-83) 96 (95-97)

Lymph node involvement EUS 67 (60-73) 78 (71-84)

CT 55 (43-67) 74 (67-80)

MRI 66 (54-76) 76 (59-87)

Note.—Numbers in parentheses are 95% confidence intervals. EUS = endoluminal US, NA = not applicable. * Significantly lower than EUS.

Table 4. Subgroup analysis on MRI and EUS techniques for perirectal tissue invasion

Imaging Modality and Technique Sensitivity (%) Specificity (%)

MRI with body coil 83 (70-91) 75 (54-88)

MRI with body and additional coil 79 (68-87) 73 (57-84)

MRI without contrast material 80 (61-91) 76 (52-90)

MRI with contrast material 81 (72-87) 71 (59-81)

MRI at < 1.5 T 86 (70-94) 73 (48-89)

MRI at ≥ 1.5 T 80 (70-87) 74 (60-84)

EUS at < 7.5 MHz 91 (85-94) 79 (76-82)

EUS at ≥ 7.5 MHz 89 (85-92) 79 (71-85)

Note.—Numbers in parentheses are 95% confidence intervals

��

MRI coil0

0 0.2 0.4 0.6 0.8 1

False positive rate

Body coil results

sROC: body coil

Body + additional coil results

sROC: body + additional coil

MRI contrast0

0 0.2 0.4 0.6 0.8 1

False positive rate

Without contrast results

sROC: without contrast

With contrast results

sROC: with contrast

MRI magnetic �eld0

0 0.2 0.4 0.6 0.8 1

False positive rate

< 1.5 T results

sROC: < 1.5 T

≥ 1.5 T results

sROC: ≥ 1.5 T

EUS probe frequency0

0 0.2 0.4 0.6 0.8 1

False positive rate

< 7.5 Mhz results

sROC: < 7.5 Mhz

≥ 7.5 Mhz results

sROC: ≥ 7.5 Mhz

False positive rate

Figure 2. Summary ROC curves for different subgroups in the evaluation of perirectal tissue invasion and for results of individual data sets.Upper left: Summary ROC curves for MRI performed with body coil alone versus body coil with additional coil. Upper right: Summary ROC curves for unenhanced versus gadolinium-enhanced MRI. Lower left: Summary ROC curves for low magnetic field strength (< 1.5 T) versus high magnetic field strength (≥ 1.5 T). Lower right: Sum-mary ROC curves for EUS: low frequency (< 7.5 MHz) versus high frequency (≥ 7.5 MHz). Summary ROC curve located near the upper left corner indicates the better diagnostic modality. Summary ROC curves for subgroups showed similar positioning in ROC space, indicating no differences between subgroups.

Chapter 4

DIsCUssIon

In this meta-analysis, we obtained summary estimates and summary ROC curves for the diagnostic

accuracy of EUS, CT, and MRI in the staging of patients with rectal cancer. EUS was found to be

the most accurate modality when compared with CT and MRI for evaluation of local invasion of

rectal cancer.

For lymph node involvement, the results were comparable, with low sensitivity values. EUS was

used to evaluate only perirectal or mesorectal lymph nodes, whereas CT and MRI were also used

to evaluate iliac and mesenteric or retroperitoneal lymph nodes. However, these data were not

presented separately. We found no significant differences between MRI and EUS techniques in a

subgroup analysis.

In our meta-analysis of literature from a 16-year period, we attempted to minimize some of

the well-known limitations of meta-analysis by applying (a) data extraction by two reviewers inde-

pendently, since differences in interpretation and extraction of data can lead to biased results; (b)

explicit inclusion criteria, such as use of histopathologic findings as the reference standard—dif-

ferential verification has been shown to lead to overestimation of results [111]; (c) exclusion of

duplicate publications—positive results are more likely to be published more than once and could

lead to overestimation of results; (d) extraction of study characteristics to study the effects on the

diagnostic performance. Lijmer et al [111] showed that bias in study design characteristics led to

either over- or underestimation of diagnostic performance; and (e) combination of results in a

bivariate random-effects approach to account for variation in results.

The outcomes of the bivariate approach are both (a) summary estimates of sensitivity and

specificity, which are more familiar to clinicians, and (b) a covariance matrix to fit summary ROC

curves. The advantage of this regression analysis over regular summary ROC analysis ln [DOR] =

α+ β*S, where DOR is the diagnostic odds ratio and S is sum [112] (Appendix), is that this model

accounts not only for the heterogeneity between studies due to different threshold settings but

also for the error of estimation of the sensitivity and specificity values in each study. This random

model also accounts for the residual heterogeneity that may remain even after adjusting for study

characteristics and imaging techniques [113].

To avoid missing important articles, additional databases such as EMBASE, Cochrane, and

CANCERLIT were checked. In addition, the reference lists of original articles and reviews, retrieved

by means of electronic search in the MEDLINE database, were checked manually to identify rel-

evant articles. To avoid exclusion of relevant articles, the literature search was performed for the

years 1985–2002. The question arises whether techniques used in the earlier period represent

outdated technology with inferior results. We therefore performed a subgroup analysis for differ-

ent techniques. No significant differences could be observed. We performed covariate adjustment

for publication year. The year of publication had different effects (increased or decreased) on the

diagnostic accuracy. This may be explained not only by technical developments and different tech-

niques employed over the years, but also by variation in training and expertise of the investigators.

Until now, the latter has rarely been reported in the literature.

An important problem in the performance of meta-analysis is the possibility of publication bias.

However, we attempted to study the aspect of publication bias by evaluating whether the size of

studies affected the diagnostic accuracy. In particular, small studies with optimistic results may be

published more easily than small studies with unfavorable results. Larger studies with optimistic

results may also be published more easily than larger studies with unfavorable results, but this

difference will be smaller. The evaluation of the effect of sample size did not show a better diag-

nostic performance of smaller studies compared with larger studies in all data sets. Funnel plot

analysis was not performed, since the limited number of data points for some data sets could

have decreased the power of detecting publication bias. We also did not quantify the number of

unpublished studies, since authors are reluctant to provide information on unpublished studies.

One other limitation is the consideration of 2 x 2 tables for different readers, for multiple

observations per reader, and for multiple MRI sequences as separate data sets. This has been

performed to avoid selection bias. We are aware of the dependency inherent in data sets from

the same patient population. Studying this dependency is not possible with our software, since

SAS procedure mixed (SAS Institute) is only able to adjust for this potential dependency if the

same amounts of data sets are available in each study. We studied this correlation by using the

empirical standard error calculated by means of the “sandwich estimator,” which is possible in

SAS procedure mixed (SAS Institute) [27]. We also used this approach to adjust for correlations

between imaging modalities applied in the same patient population.

Another possible limitation of this meta-analysis is that a multiple backward stepwise regression

analysis was performed with six covariates, and the final model was adjusted for significant vari-

ables. In our study, more data sets were available for EUS than for CT or MRI. For some stages (mus-

cularis propria invasion, adjacent organ invasion, and lymph node involvement), more significant

predictors were found for EUS; this can be explained by the larger number of data sets for EUS. By

adjusting the CT and MRI models with fewer variables, the accuracy of CT and MRI could be under-

estimated. However, we also obtained models for each stage in which we simultaneously adjusted

for all variables that significantly affected the estimates of three imaging modalities per stage.

These results were comparable with the results of the models in which adjustment per modality was

performed and are therefore not presented in this article. Moreover, because of the few MRI stud-

ies suitable for subgroup analysis, the sensitivity and specificity values had wide 95% CIs that over-

lapped completely or partially, indicating no significant differences between several techniques.

Patient characteristics (disease stage, age, or sex distribution) are also important for diagnostic

accuracy, but variation in data presentation made it impossible to study the effect of these vari-

ables. In general, the time interval between performance of diagnostic tests and the reference

test should be short. A longer period between performance of the diagnostic test and the refer-

ence test will lead to a greater change in the disease status and decrease in the discriminatory

power of the diagnostic test. This implies that the comparison should ideally be performed on the

same day. In most of the studies, this time period was not described or was diverse; therefore,

this variable could not be analyzed. A large interval is not likely, however, given the disease under

consideration.

��

Chapter 4

Finally, positron emission tomography with fluorine 18 fluorodeoxyglucose is another imaging

modality that can add incremental information to the preoperative assessment of patients with

rectal cancer. However, data on this issue were limited [114-116] and were therefore not included

in this meta-analysis.

We are aware of one other systematic review on the diagnostic performance of these imaging

modalities in the staging of rectal cancer. Kwok and colleagues [117] reported summary sensitiv-

ity values of 93%, 78%, and 86% with specificity values of 78%, 63%, and 77%, respectively, for

EUS, CT, and MRI in the determination of wall penetration (stage T3). In the assessment of lymph

node involvement, the sensitivity of EUS, CT, and MRI was found to be 71%, 52%, and 65%

with specificity of 76%, 78%, and 80%, respectively. Kwok et al [117] also found EUS to be the

most accurate modality when compared with MRI and CT in the assessment of wall penetration.

However, when evaluating studies of MRI that included use of an endorectal coil, this technique

was found to be as effective as EUS in the assessment of wall penetration and was the most

effective technique in the assessment of nodal involvement. The conclusion was that MRI with

use of an endorectal coil offers the maximum amount of information by a single modality in the

staging of rectal cancer. The latter is not in concordance with our findings, which is most likely

caused by methodologic differences. Their analysis was based on a descriptive analysis by simply

pooling data without accounting for (a) heterogeneity between studies due to different threshold

settings (between-study variation) and (b) errors of estimation of sensitivity and specificity values

in each study (within-study variation). Moreover, no 95% CIs or P values were reported to reflect

the statistical precision of the observed differences.

Although EUS has proved to be a better imaging modality than CT and MRI in the present

study, it has several limitations: operator dependency; limitation to tumors located 8–10 cm from

the anal verge when a rigid probe is used; and no assessment of stenotic tumors. Selection of pa-

tients may therefore lead to biased results. Because of the lack of detailed information on patient

selection, we could not study whether the selection of patients for EUS was different from that

of CT and MRI; however, the prevalences per cancer stage were comparable (table 1).

Moreover, EUS is not able to depict lymph nodes that are outside the range of the transducer

and cannot discriminate between lymph nodes inside or outside the mesorectal fascia, since the

fascia is not identified at EUS. The latter is also of importance in determining the spread of stage

T3 tumors considered for total mesorectal excision. This may explain the more recent widespread

use of MRI, since these limitations do not apply to MRI with external coils. To improve the sensitiv-

ity values of MRI for lymph node detection, newer techniques, such as use of new lymph node-

specific MRI contrast agents (ultrasmall iron-based particles taken up by the lymphatic system),

may provide a more sensitive MRI method to detect lymph node involvement [118-122].

However, both the use of new lymph node-specific MRI contrast agents and the identification of

the mesorectal fascia are still under investigation.

CT has limitations in differentiating and distinguishing the different layers of the rectal wall,

demonstrating the mesorectal fascia, and depicting tumor invasion in surrounding pelvic struc-

tures. The introduction of multisection CT scanners may improve the diagnostic value of this

��

modality, as the assessment of local disease with improved visualization of the mesorectal fascia

can be combined with assessment of liver involvement and lung metastases [123-124].

On the basis of the results of this meta-analysis, EUS seems to be a better diagnostic imaging

test for local staging than are CT and MRI. Because of the limited information on the identification

of the mesorectal fascia with MRI [125-127] and spiral CT, at present, EUS might be helpful in

selecting patients for available therapeutic strategies. The identification of lymph nodes with EUS,

CT, and MRI remains a major point of concern.

Chapter 4

References

1. Harrison JC, Dean PJ, el Zeky F, Vander ZR. From Dukes through Jass: pathological prognostic indicators in rectal cancer. Hum Pathol 1994; 25:498-505.

2. Jass JR, Love SB. Prognostic value of direct spread in Dukes’ C cases of rectal cancer. Dis Colon Rectum 1989; 32:477-480.

3. Tang R, Wang JY, Chen JS, et al. Survival impact of lymph node metastasis in TNM stage III carcinoma of the colon and rectum. J Am Coll Surg 1995; 180:705-712.

4. Willett CG, Badizadegan K, Ancukiewicz M, Shellito PC. Prognostic factors in stage T3N0 rectal cancer: do all patients require postoperative pelvic irradiation and chemotherapy? Dis Colon Rectum 1999; 42:167-173.

5. Wolmark N, Fisher B, Wieand HS. The prognostic value of the modifications of the Dukes’ C class of colorectal cancer. Ann Surg 1986; 203:115-122.

6. Akasu T, Kondo H, Moriya Y, et al. Endorectal ultrasonography and treatment of early stage rectal cancer. World J Surg 2000; 24:1061-1068.

7. Blair S, Ellenhorn JD. Transanal excision for low rectal cancers is curative in early-stage disease with favor-able histology. Am Surg 2000; 66:817-820.

8. Gao JD, Shao YF, Bi JJ, Shi SS, Liang J, Hu YH. Local excision carcinoma in early stage. World J Gastroen-terol 2003; 9:871-873.

9. Langer C, Liersch T, Markus P, et al. Transanal endoscopic microsurgery (TEM) for minimally invasive resec-tion of rectal adenomas and “low-risk” carcinomas (uT1, G1–2). Z Gastroenterol 2002; 40:67-72.

10. Chen ET, Mohiuddin M, Brodovsky H, Fishbein G, Marks G. Downstaging of advanced rectal cancer fol-lowing combined preoperative chemotherapy and high dose radiation. Int J Radiat Oncol Biol Phys 1994; 30:169-175.

11. Kaminsky-Forrett MC, Conroy T, Luporsi E, et al. Prognostic implications of downstaging following pre-operative radiation therapy for operable T3–T4 rectal cancer. Int J Radiat Oncol Biol Phys 1998; 42:935-941.

12. Rau B, Wust P, Hohenberger P, et al. Preoperative hyperthermia combined with radiochemo therapy in locally advanced rectal cancer: a phase II clinical trial. Ann Surg 1998; 227:380-389.

13. Hermanek P, Hermanek P, Hohenberger W, Klimpfinger M, Kockerling F, Papadopoulos T. The pathologi-cal assessment of mesorectal excision: implications for further treatment and quality management. Int J Colorectal Dis 2003; 18:335-341.

14. Nagtegaal ID, van Krieken JH. The role of pathologists in the quality control of diagnosis and treatment of rectal cancer: an overview. Eur J Cancer 2002; 38:964-972.

15. Wiggers T, van de Velde CJ. The circumferential margin in rectal cancer: recommendations based on the Dutch Total Mesorectal Excision Study. Eur J Cancer 2002; 38:973-976.

16. Gualdi GF, Casciani E, Guadalaxara A, d’Orta C, Polettini E, Pappalardo G. Local staging of rectal cancer with transrectal ultrasound and endorectal magnetic resonance imaging: comparison with histologic find-ings. Dis Colon Rectum 2000; 43:338-345.

17. Hunerbein M, Pegios W, Rau B, Vogl TJ, Felix R, Schlag PM. Prospective comparison of endorectal ultra-sound, three-dimensional endorectal ultrasound, and endorectal MRI in the preoperative evaluation of rectal tumors: preliminary results. Surg Endosc 2000; 14:1005-1009.

18. Chiesura-Corona M, Muzzio PC, Giust G, Zuliani M, Pucciarelli S, Toppan P. Rectal cancer: CT local staging with histopathologic correlation. Abdom Imaging 2001; 26:134-138.

19. Matsuoka H, Nakamura A, Masaki T, et al. Preoperative staging by multidetector-row computed tomogra-phy in patients with rectal carcinoma. Am J Surg 2002; 184:131-135.

20. Blomqvist L, Machado M, Rubio C, et al. Rectal tumour staging: MR imaging using pelvic phased-array and endorectal coils vs endoscopic ultrasonography. Eur Radiol 2000; 10:653-660.

21. Kinkel K, Lu Y, Both M, Warren RS, Thoeni RF. Detection of hepatic metastases from cancers of the gastrointestinal tract by using noninvasive imaging methods (US, CT, MR imaging, PET): a meta-analysis. Radiology 2002; 224:748-756.

22. Oei EH, Nikken JJ, Verstijnen AC, Ginai AZ, Myriam Hunink MG. MR imaging of the menisci and cruciate ligaments: a systematic review. Radiology 2003; 226:837-848.

23. Deville WL, Bezemer PD, Bouter LM. Publications on diagnostic test evaluation in family medicine journals: an optimal search strategy. J Clin Epidemiol 2000; 53:65-69.

24. Van Houwelingen HC, Zwinderman KH, Stijnen T. A bivariate approach to meta-analysis. Stat Med 1993; 12:2273-2284.

25. Van Houwelingen HC, Arends LR, Stijnen T. Advanced methods in meta-analysis: multivariate approach and meta-regression. Stat Med 2002; 21:589-624.

26. Rao CR. Linear statistical inference and its applications 2nd ed. New York, NY: Wiley, 1973; 417-420.

27. Liang KY, Zeger SL. Regression analysis for correlated data. Annu Rev Public Health 1993;14:43-68.

28. Hildebrandt U, Feifel G. Preoperative staging of rectal cancer by intrarectal ultrasound. Dis Colon Rectum 1985; 28:42-46.

29. Konishi F, Muto T, Takahashi H, Itoh K, Kanazawa K, Morioka Y. Transrectal ultrasonography for the as-sessment of invasion of rectal carcinoma. Dis Colon Rectum 1985; 28:889-894.

30. Beynon J, Foy DM, Roe AM, Temple LN, Mortensen NJ. Endoluminal ultrasound in the assessment of local invasion in rectal cancer. Br J Surg 1986; 73:474-477.

31. Kramann B, Hildebrandt U. Computed tomography versus endosonography in the staging of rectal carci-noma: a comparative study. Int J Colorectal Dis 1986; 1:216-218.

32. Feifel G, Hildebrandt U, Dhom G. Assessment of depth of invasion in rectal cancer by endosonography. Endoscopy 1987; 19:64-67.

33. Glaser F, Schlag P, Herfarth C. Endorectal ultrasonography for the assessment of invasion of rectal tumours and lymph node involvement. Br J Surg 1990; 77:883-887.

34. Orrom WJ, Wong WD, Rothenberger DA, Jensen LL, Goldberg SM. Endorectal ultrasound in the preopera-tive staging of rectal tumors: a learning experience. Dis Colon Rectum 1990; 33:654-659.

35. Napoleon B, Pujol B, Berger F, Valette PJ, Gerard JP, Souquet JC. Accuracy of endosonography in the stag-ing of rectal cancer treated by radiotherapy. Br J Surg 1991; 78:785-788.

36. Boyce GA, Sivak MV, Jr, Lavery IC, et al. Endoscopic ultrasound in the pre-operative staging of rectal carcinoma. Gastrointest Endosc 1992; 38:468-471.

37. Derksen EJ, Cuesta MA, Meijer S. Intraluminal ultrasound of rectal tumours: a prerequisite in decision making. Surg Oncol 1992; 1:193-198.

38. Herzog U, von Flue M, Tondelli P, Schuppisser JP. How accurate is endorectal ultrasound in the preopera-tive staging of rectal cancer? Dis Colon Rectum 1993; 36:127-134.

39. Houvenaeghel G, Delpero JR, Giovannini M, et al. Staging of rectal cancer: a prospective study of digi-tal examination and endosonography before and after preoperative radiotherapy. Acta Chir Belg 1993; 93:164-168.

40. Hulsmans FJ, Tio TL, Fockens P, Bosma A, Tytgat GN. Assessment of tumor infiltration depth in rectal cancer with transrectal sonography: caution is necessary. Radiology 1994; 190:715-720.

41. Kim NK, Choi JS, Sohn SK, Min JS. Transrectal ultrasonography in preoperative staging of rectal cancer. Yonsei Med J 1994; 35:396-403.

42. Rafaelsen SR, Kronborg O, Fenger C. Digital rectal examination and transrectal ultrasonography in staging of rectal cancer: a prospective, blind study. Acta Radiol 1994; 35:300-304.

43. Adams WJ, Wong WD. Endorectal ultrasonic detection of malignancy within rectal villous lesions. Dis Colon Rectum 1995; 38:1093-1096.

44. Fedyaev EB, Volkova EA, Kuznetsova EE. Transrectal and transvaginal ultrasonography in the preoperative staging of rectal carcinoma. Eur J Radiol 1995; 20:35-38.

Chapter 4

45. Starck M, Bohe M, Fork FT, Lindstrom C, Sjoberg S. Endoluminal ultrasound and low-field magnetic reso-nance imaging are superior to clinical examination in the preoperative staging of rectal cancer. Eur J Surg 1995; 161:841-845.

46. Kaneko K, Boku N, Hosokawa K, et al. Diagnostic utility of endoscopic ultrasonography for preoperative rectal cancer staging estimation. Jpn J Clin Oncol 1996; 26:30-35.

47. Nielsen MB, Qvitzau S, Pedersen JF, Christiansen J. Endosonography for preoperative staging of rectal tumours. Acta Radiol 1996; 37:799-803.

48. Akasu T, Sugihara K, Moriya Y, Fujita S. Limitations and pitfalls of transrectal ultrasonography for staging of rectal cancer. Dis Colon Rectum 1997; 40(suppl 10):S10-S15.

49. Hunerbein M, Schlag PM. Three-dimensional endosonography for staging of rectal cancer. Ann Surg 1997; 225:432-438.

50. Maier AG, Barton PP, Neuhold NR, Herbst F, Teleky BK, Lechner GL. Peritumoral tissue reaction at tran-srectal US as a possible cause of overstaging in rectal cancer: histopathologic correlation. Radiology 1997; 203:785-789.

51. Sailer M, Leppert R, Kraemer M, Fuchs KH, Thiede A. The value of endorectal ultrasound in the assessment of adenomas, T1- and T2-carcinomas. Int J Colorectal Dis 1997; 12:214-219.

52. Massari M, De Simone M, Cioffi U, Rosso L, Chiarelli M, Gabrielli F. Value and limits of endorectal ultraso-nography for preoperative staging of rectal carcinoma. Surg Laparosc Endosc 1998; 8:438-444.

53. Akahoshi K, Kondoh A, Nagaie T, et al. Preoperative staging of rectal cancer using a 7.5 MHz front-loading US probe. Gastrointest Endosc 2000; 52:529-534.

54. Carmody BJ, Otchy DP. Learning curve of transrectal ultrasound. Dis Colon Rectum 2000; 43:193-197.

55. Kim JC, Cho YK, Kim SY, Park SK, Lee MG. Comparative study of three-dimensional and conventional endorectal ultrasonography used in rectal cancer staging. Surg Endosc 2002; 16:1280- 1285.

56. Steele SR, Martin MJ, Place RJ. Flexible endorectal ultrasound for predicting pathologic stage of rectal cancers. Am J Surg 2002; 184:126-130.

57. Garcia-Aguilar J, Pollack J, Lee SH, et al. Accuracy of endorectal ultrasonography in preoperative staging of rectal tumors. Dis Colon Rectum 2002; 45:10-15.

58. Okizuka H, Sugimura K, Ishida T. Preoperative local staging of rectal carcinoma with MR imaging and a rectal balloon. J Magn Reson Imaging 1993; 3:329-335.

59. Schnall MD, Furth EE, Rosato EF, Kressel HY. Rectal tumor stage: correlation of endorectal MR imaging and pathologic findings. Radiology 1994; 190:709-714.

60. Okizuka H, Sugimura K, Yoshizako T, Kaji Y, Wada A. Rectal carcinoma: prospective comparison of con-ventional and gadopentetate dimeglumine enhanced fat-suppressed MR imaging. J Magn Reson Imaging 1996; 6:465-471.

61. Pegios W, Vogl J, Mack MG, et al. MRI diagnosis and staging of rectal carcinoma. Abdom Imaging 1996; 21:211-218.

62. Hadfield MB, Nicholson AA, MacDonald AW, et al. Preoperative staging of rectal carcinoma by magnetic resonance imaging with a pelvic phased-array coil. Br J Surg 1997; 84:529-531.

63. Vogl TJ, Pegios W, Mack MG, et al. Accuracy of staging rectal tumors with contrast-enhanced transrectal MR imaging. AJR Am J Roentgenol 1997; 168:1427-1434.

64. Drew PJ, Farouk R, Turnbull LW, Ward SC, Hartley JE, Monson JR. Preoperative magnetic resonance staging of rectal cancer with an endorectal coil and dynamic gadolinium enhancement. Br J Surg 1999; 86:250-254.

65. Kim NK, Kim MJ, Park JK, Park SI, Min JS. Preoperative staging of rectal cancer with MRI: accuracy and clinical usefulness. Ann Surg Oncol 2000; 7:732-737.

66. Beynon J, Mortensen NJ, Foy DM, Channer JL, Virjee J, Goddard P. Pre-operative assessment of local inva-sion in rectal cancer: digital examination, endoluminal sonography or computed tomography? Br J Surg 1986; 73:1015-1017.

67. Rifkin MD, Wechsler RJ. A comparison of computed tomography and endorectal ultrasound in staging rectal cancer. Int J Colorectal Dis 1986; 1:219-223.

68. Saitoh N, Okui K, Sarashina H, Suzuki M, Arai T, Nunomura M. Evaluation of echographic diagnosis of rectal cancer using intrarectal ultrasonic examination. Dis Colon Rectum 1986; 29:234-242.

69. Alessi G, Giuliani A, Risi D, Meli C, Schillaci A, Stipa S. Intrarectal ultrasonography in the preoperative stag-ing of rectal cancer. Ital J Surg Sci 1987; 17:239-244.

70. Beynon J, Roe AM, Foy DM, Channer JL, Virjee J, Mortensen NJ. Preoperative staging of local invasion in rectal cancer using endoluminal ultrasound. J R Soc Med 1987; 80:23-24.

71. Di Candio G, Mosca F, Campatelli A, Cei A, Ferrari M, Basolo F. Endosonographic staging of rectal carci-noma. Gastrointest Radiol 1987; 12:289-295.

72. Holdsworth PJ, Johnston D, Chalmers AG, et al. Endoluminal ultrasound and computed tomography in the staging of rectal cancer. Br J Surg 1988; 75:1019-1022.

73. Salo JO, Mecklin JP, Jarvinen H, Lehtonen T. Endo-ultrasonography for preoperative staging of rectal cancer. Ann Chir Gynaecol 1988; 77:57-60.

74. Rifkin MD, Ehrlich SM, Marks G. Staging of rectal carcinoma: prospective comparison of endorectal US and CT. Radiology 1989; 170:319-322.

75. Rotte KH, Kluhs L, Kleinau H, Kriedemann E. Computed tomography and endosonography in the preopera-tive staging of rectal carcinoma. Eur J Radiol 1989; 9:187-190.

76. Dershaw DD, Enker WE, Cohen AM, Sigurdson ER. Transrectal ultrasonography of rectal carcinoma. Can-cer 1990; 66:2336-2340.

77. Jochem RJ, Reading CC, Dozois RR, Carpenter HA, Wolff BG, Charboneau JW. Endorectal ultrasonographic staging of rectal carcinoma. Mayo Clin Proc 1990; 65:1571-1577.

78. Goldman S, Arvidsson H, Norming U, Lagerstedt U, Magnusson I, Frisell J. Transrectal ultrasound and computed tomography in preoperative staging of lower rectal adenocarcinoma. Gastrointest Radiol 1991; 16:259-263.

79. Lindmark G, Elvin A, Pahlman L, Glimelius B. The value of endosonography in preoperative staging of rectal cancer. Int J Colorectal Dis 1992; 7:162-166.

80. Scialpi M, Andreatta R, Agugiaro S, Zottele F, Niccolini M, Dalla PF. Rectal carcinoma: preoperative stag-ing and detection of postoperative local recurrence with transrectal and transvaginal ultrasound. Abdom Imaging 1993; 18:381-389.

81. Nielsen MB, Pedersen JF, Christiansen J. Rectal endosonography in the evaluation of stenotic rectal tumors. Dis Colon Rectum 1993; 36:275-279.

82. Thaler W, Watzka S, Martin F, et al. Preoperative staging of rectal cancer by endoluminal ultrasound vs. magnetic resonance imaging: preliminary results of a prospective, comparative study. Dis Colon Rectum 1994; 37:1189-1193.

83. Barbaro B, Valentini V, Manfredi R. Combined modality staging of high risk rectal cancer. Rays 1995; 20:165-181.

84. Osti MF, Padovan FS, Pirolli C, et al. Comparison between transrectal ultrasonography and computed tomography with rectal inflation of gas in preoperative staging of lower rectal cancer. Eur Radiol 1997; 7:26-30.

85. Heneghan JP, Salem RR, Lange RC, Taylor KJ, Hammers LW. Transrectal sonography in staging rectal carcinoma: the role of gray-scale, color-flow, and Doppler imaging analysis. AJR Am J Roentgenol 1997; 169:1247-1252.

86. Lindmark GE, Kraaz WG, Elvin PA, Glimelius BL. Rectal cancer: evaluation of staging with endosonography. Radiology 1997; 204:533-538.

87. Barbaro B, Schulsinger A, Valentini V, Marano P, Rotman M. The accuracy of transrectal ultrasound in predicting the pathological stage of low-lying rectal cancer after preoperative chemoradiation therapy. Int J Radiat Oncol Biol Phys 1999; 43:1043-1047.

Chapter 4

88. Adams DR, Blatchford GJ, Lin KM, Ternent CA, Thorson AG, Christensen MA. Use of preoperative ultra-sound staging for treatment of rectal cancer. Dis Colon Rectum 1999; 42:159-166.

89. Harewood GC, Wiersema MJ, Nelson H, et al. A prospective, blinded assessment of the impact of preop-erative staging on the management of rectal cancer. Gastroenterology 2002; 123:24-32.

90. Netri G, Coco C, Valentini V, et al. Clinical staging of rectal cancer: results of a prospective continuing study. Ital J Surg Sci 1985; 15:169-174.

91. Thompson WM, Halvorsen RA, Foster WL, Roberts L, Gibbons R. Preoperative and postoperative CT stag-ing of rectosigmoid carcinoma. AJR Am J Roentgenol 1986; 146:703-710.

92. Cellini N, Coco C, Maresca G, et al. Clinical staging of rectal cancer: a study on 126 patients. Rays 1986; 11:69-79.

93. Angelelli G, Macarini L, Lupo L, Caputi-Jambrenghi O, Pannarale O, Memeo V. Rectal carcinoma: CT stag-ing with water as contrast medium. Radiology 1990; 177:511-514.

94. Civelli EM, Gallino G, Mariani L, et al. Double-contrast barium enema and computerised tomography in the pre-operative evaluation of rectal carcinoma: are they still useful diagnostic procedures? Tumori 2000; 86:389-392.

95. Gagliardi G, Bayar S, Smith R, Salem RR. Preoperative staging of rectal cancer using magnetic resonance imaging with external phase-arrayed coils. Arch Surg 2002; 137:447-451.

96. Murano A, Sasaki F, Kido C, et al. Endoscopic MRI using 3D-spoiled GRASS (SPGR) sequence for local staging of rectal carcinoma. J Comput Assist Tomogr 1995; 19:586-591.

97. Scialpi M, Rotondo A, Angelelli G. Water enema transvaginal ultrasound for local staging of stenotic rectal carcinoma. Abdom Imaging 1999; 24:132-136.

98. Lupo L, Angelelli G, Pannarale O, Altomare D, Macarini L, Memeo V. Improved accuracy of computed tomography in local staging of rectal cancer using water enema. Int J Colorectal Dis 1996; 11:60-64.

99. Zainea GG, Lee F, McLeary RD, Siders DB, Thieme ET. Transrectal ultrasonography in the evaluation of rectal and extrarectal disease. Surg Gynecol Obstet 1989; 169:153-156.

100. Beynon J, Mortensen NJ, Foy DM, Channer JL, Rigby H, Virjee J. Preoperative assessment of mesorectal lymph node involvement in rectal cancer. Br J Surg 1989; 76:276-279.

101. Hildebrandt U, Klein T, Feifel G, Schwarz HP, Koch B, Schmitt RM. Endosonography of pararectal lymph nodes: in vitro and in vivo evaluation. Dis Colon Rectum 1990; 33:863-868.

102. Milsom JW, Graffner H. Intrarectal ultrasonography in rectal cancer staging and in the evaluation of pelvic disease: clinical uses of intrarectal ultrasound. Ann Surg 1990; 212:602-606.

103. Konishi F, Ugajin H, Ito K, Kanazawa K. Endorectal ultrasonography with a 7.5 MHz linear array scanner for the assessment of invasion of rectal carcinoma. Int J Colorectal Dis 1990; 5:15-20.

104. Milsom JW, Lavery IC, Stolfi VM, et al. The expanding utility of endoluminal ultrasonography in the man-agement of rectal cancer. Surgery 1992; 112:832- 840.

105. Spinelli P, Schiavo M, Meroni E, et al. Results of EUS in detecting perirectal lymph node metastases of rectal cancer: the pathologist makes the difference. Gastrointest Endosc 1999; 49:754-758.

106. Kim NK, Kim MJ, Yun SH, Sohn SK, Min JS. Comparative study of transrectal ultrasonography, pelvic computerized tomography, and magnetic resonance imaging in preoperative staging of rectal cancer. Dis Colon Rectum 1999; 42:770-775.

107. Tada M, Endo M. Ultrasonographic examination for lateral lymphatic spread and local recurrence of rectal cancer: preoperative detection and evaluation. Dis Colon Rectum 1995; 38:1047-1052.

108. Hodgman CG, MacCarty RL, Wolff BG, et al. Preoperative staging of rectal carcinoma by computed tomog-raphy and 0.15T magnetic resonance imaging. Preliminary report. Dis Colon Rectum 1986; 29:446-450.

109. de Lange EE, Fechner RE, Edge SB, Spaulding CA. Preoperative staging of rectal carcinoma with MR imag-ing: surgical and histopathologic correlation. Radiology 1990; 176:623-628.

110. McNicholas MM, Joyce WP, Dolan J, Gibney RG, MacErlaine DP, Hyland J. Magnetic resonance imaging of rectal carcinoma: a prospective study. Br J Surg 1994; 81:911-914.

111. Lijmer JG, Mol BW, Heisterkamp S, et al. Empirical evidence of design-related bias in studies of diagnostic tests. JAMA 1999; 282:1061-1066.

112. Littenberg B, Moses LE. Estimating diagnostic accuracy from multiple conflicting reports: a new meta-analytic method. Med Decis Making 1993; 13:313-321.

113. Rutter CM, Gatsonis CA. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med 2001; 20:2865-2884.

114. Abdel-Nabi H, Doerr RJ, Lamonica DM, et al. Staging of primary colorectal carcinomas with fluorine-18 fluorodeoxyglucose whole-body PET: correlation with histopathologic and CT findings. Radiology 1998; 206:755-760.

115. Falk PM, Gupta NC, Thorson AG, et al. Positron emission tomography for preoperative staging of colorec-tal carcinoma. Dis Colon Rectum 1994; 37:153-156.

116. Gupta NC, Falk PM, Frank AL, Thorson AM, Frick MP, Bowman B. Pre-operative staging of colorectal carcinoma using positron emission tomography. Nebr Med J 1993; 78:30-35.

117. Kwok H, Bissett IP, Hill GL. Preoperative staging of rectal cancer. Int J Colorectal Dis 2000; 15:9-20.

118. Bellin MF, Lebleu L, Meric JB. Evaluation of retroperitoneal and pelvic lymph node metastases with MRI and MR lymphangiography. Abdom Imaging 2003; 28:155-163.

119. Bellin MF, Roy C, Kinkel K, et al. Lymph node metastases: safety and effectiveness of MR imaging with ul-trasmall superparamagnetic iron oxide particles—initial clinical experience. Radiology 1998; 207:799-808.

120. Harisinghani MG, Saini S, Weissleder R, et al. MR lymphangiography using ultrasmall superparamagnetic iron oxide in patients with primary abdominal and pelvic malignancies: radiographic-pathologic correla-tion. AJR Am J Roentgenol 1999; 172:1347-1351.

121. Hudgins PA, Anzai Y, Morris MR, Lucas MA. Ferumoxtran-10, a superparamagnetic iron oxide as a magnet-ic resonance enhancement agent for imaging lymph nodes: a phase 2 dose study. AJNR Am J Neuroradiol 2002; 23:649-656.

122. McCauley TR, Rifkin MD, Ledet CA. Pelvic lymph node visualization with MR imaging using local administra-tion of ultra-small superparamagnetic iron oxide contrast. J Magn Reson Imaging 2002; 15:492-497.

123. Laghi A, Iannaccone R, Trenna S, et al. Multislice spiral CT colonography in the evaluation of colorectal neoplasms. Radiol Med (Torino) 2002; 104:394-403.

124. Vogl TJ, Luboldt W, Herzog C, Hammerstingl R. Contrast-enhanced multislice CT in detection and evalua-tion of abdominal neoplasms. Radiologe 2002; 42:646-654. [German].

125. Beets-Tan RG, Beets GL, Vliegen RF, et al. Accuracy of magnetic resonance imaging in prediction of tumour-free resection margin in rectal cancer surgery. Lancet 2001; 357:497-504.

126. Bissett IP, Fernando CC, Hough DM, et al. Identification of the fascia propria by magnetic resonance imag-ing and its relevance to preoperative assessment of rectal cancer. Dis Colon Rectum 2001; 44:259-265.

127. Brown G, Richards CJ, Newcombe RG, et al. Rectal carcinoma: thin-section MR imaging for staging in 28 patients. Radiology 1999; 211:215-222.

Chapter 4

aPPenDIX

Littenberg and Moses [112] suggested that D = [logit (TPR) – logit (FPR)] is modulated as a linear

function of S = [logit (TPR) + logit (FPR)], where D is difference, FPR is false-positive rate (1 – speci-

ficity), TPR is true-positive rate (sensitivity), and S is sum.

Thus, they used the linear model D = α + β*S. Suppose, in a 2 x 2 table, n1 is the number of

patients with disease and n2 is the number of patients without disease.

D = [logit (TPR) – logit (FPR)]

= ln [TPR/(1-TPR)] – ln [FPR/(1-FPR)]

= ln (TP/n1)/(FN/ n1) - ln (FP/n2)/(TN/ n2)

= ln (TP/(FN) – ln (FP/TN)

= ln [(TP/(FN)/ (FP/TN)]

= ln [(TP*TN/ (FN*FP)]

= ln (odds ratio)

= ln (DOR)

where DOR is the diagnostic odds ratio, logit x = ln[x/(1 – x)], x is true-positive rate or false-posi-

tive rate, and

S= logit (TPR) + logit (FPR)

5C h a p t e r

Ultrasonography, computed tomography and magnetic resonance imaging for diagnosis and determining resectability of pancreatic adenocarcinoma: a meta-analysis

Shandra BipatSaffire S. K. S. PhoaOtto M. van DeldenPatrick M.M. BossuytDirk J. GoumaJohan S LamérisJaap Stoker

Journal of Computer Assisted Tomography 2005;29:438-445

Chapter 5

absTRaCT

Objective: To compare ultrasonography (US), computed tomography (CT), and magnetic reso-

nance imaging (MRI) in the diagnosis and determination of resectability of pancreatic adeno-

carcinoma.

Methods: Articles reporting US, CT, or MRI data of patients with known or suspected pancreatic

adenocarcinoma and at least 20 patients verified with histopathology, surgical findings, or

follow-up were included. A bivariate random-effects approach was used to calculate sensitivity

and specificity for diagnosis and resectability of pancreatic adenocarcinoma.

Results: Sixty-eight articles fulfilled all inclusion criteria. For diagnosis, sensitivities of helical CT,

conventional CT, MRI, and US were 91%, 86%, 84%, and 76% and specificities were 85%,

79%, 82%, and 75% respectively. Sensitivities for MRI and US were significantly lower com-

pared with helical CT (P = 0.04 and P = 0.0001). For determining resectability, sensitivities of

helical CT, conventional CT, MRI, and US were 81%, 82%, 82%, and 83% and specificities

were 82%, 76%, 78%, and 63% respectively. Specificity of US was significantly lower com-

pared with helical CT (P = 0.011).

Conclusion: Helical CT is preferable as an imaging modality for the diagnosis, and determination

of resectability of pancreatic adenocarcinoma.

Us, CT and MRI for diagnosis and staging pancreatic adenocarcinoma

InTRoDUCTIon

The worldwide incidence of pancreatic adenocarcinoma was estimated at 216,000 in the year

2000 [1]. This disease has a poor prognosis, and resection of the tumor is the only chance of cure.

Preoperative staging should select patients with resectable tumors to undergo surgery, improv-

ing their chances of survival, and identify patients with non-resectable tumors, thus preventing

unnecessary laparotomy.

Imaging modalities used in the diagnosis and determination of resectability include ultrasonog-

raphy (US), computed tomography (CT), and magnetic resonance imaging (MRI). In some institu-

tions US is initially performed to identify patients with non-resectable tumors, mainly based on the

presence of liver metastases [2]. Helical CT is often the imaging modality of choice, whereas MRI

has been introduced more recently in the workup of patients with pancreatic adenocarcinoma.

Considerable attention has been paid to the diagnostic accuracy of these modalities, [2-7] yet

studies comparing US, CT, and MRI in the same study group are scarce; this can be attributed to

the associated costs, logistics, and patient burden. A number of narrative reviews on the radio-

logic diagnosis and staging of pancreatic adenocarcinoma have been presented [8, 9].

We performed a meta-analysis on the literature to compare US, CT, and MRI. Through a critical

appraisal of the validity of published studies and a synthesis of their results, we obtained sum-

mary estimates of helical CT, conventional CT, MRI, and US for the diagnosis and determination

of resectability of pancreatic adenocarcinoma. We also explored sources of heterogeneity in the

published research.

Literature SearchA comprehensive computer search [10] was performed by one reviewer to retrieve all English

and German papers published between January 1990 and December 2003 that reported on the

diagnostic accuracy of US, CT, or MRI in the diagnosis and evaluation of resectability of pancreatic

adenocarcinoma.

The following databases were used: MEDLINE, EMBASE, Cochrane, and CANCERLIT. The

search terms were pancreatic neoplasms AND [magnetic resonance imaging OR nuclear mag-

netic resonance OR tomography, x-ray computed OR ultrasonography] as MESH headings AND

[specificity OR false negative OR accuracy] as text words.

Reviews articles, letters, comments, case reports, unpublished articles, and articles not pre-

senting raw data were not retrieved. Additional papers were identified by manually checking the

reference lists of review articles and of the original articles retrieved by the computer search.

Study SelectionAll retrieved articles were selected by two independent reviewers using the following inclusion

criteria: (1) data reported on US or CT or MRI; (2) patients included with known or suspected

Chapter 5

pancreatic adenocarcinoma (based on specific clinical symptoms, such as laboratory results or ul-

trasound findings; (3) minimal sample size of 20 patients verified with reference standard; (4) one

or more of the following reference standards-for diagnosis, histopathology obtained by surgery/

autopsy, laparotomy/ultrasound-guided biopsy, or follow-up in patients lacking histology, and for

determining resectability, surgical findings (histopathology/intraoperative findings), autopsy, and

follow-up; and (5) sufficient data presented to construct a 2 × 2 cross-classification of the imaging

technique results and results of the reference standard for the detection of pancreatic carcinoma

and for determining resectability.

When data from the same study group were reported more than once, only the most recent

article or the article reporting a higher level of data was included.

Data ExtractionTwo observers independently extracted data from each article using a standardized form, based

on the STARD criteria [11]. Observers were not blinded with regard to information about the au-

thors, their affiliations, or the journal name, since this has been shown to be unnecessary [12]. To

resolve disagreement between the two reviewers, a third reviewer assessed all discrepant items.

Study Design Characteristics

The following study design characteristics were extracted: (a) patient selection (consecutive or

non-consecutive); (b) interpretation of test results (blinded or not blinded); (c) verification (com-

plete, partial, or unknown; papers in which more than 90% of the subgroup was subjected to the

reference standard were scored as complete verification; papers in which more than 10% of the

study group was not subjected to the reference test [histopathology, follow-up, intraoperative

findings] were scored as partial verification; and all other cases were scored as unknown); (d)

methods of data collection (prospective, retrospective, or unknown in cases of doubt); (e) report-

ing of study population (sufficient or insufficient; sufficient if all of the following characteristics

were described: spectrum of disease [e.g. clinical symptoms], age of patients, and sex distribu-

tion); (f) reporting of reference test (complete, partial, or no description; papers in which the

reference test was described in detail were scored as complete description; papers in which the

reference test was mentioned but not described in detail were scored as partial description; and

all other cases were scored as no description); and (g) reporting of diagnostic tests (sufficient or

insufficient; the diagnostic test should be described with sufficient detail to allow for replication,

validation, and generalization of the study; for US, type of probe and frequency of transducer;

for CT, type of scanning [helical or multiphase helical CT or conventional], slice thickness or col-

limation, and use of contrast; for MRI, magnetic field, type of coil, sequences, slice thickness, and

use of contrast).

Other Study Characteristics

The following characteristics were also recorded: (a) year of publication; (b) number of patients;

(c) patient characteristics such as clinical symptoms, mean age, sex distribution; and (d) depart-

ment of origin (e.g. radiology, gastroenterology, surgery, internal medicine).

Examination Characteristics

We extracted data on the frequency of transducers and whether color/power Doppler was per-

formed in US imaging, the type of scanning (helical or multiphase helical CT or conventional), slice

thickness or collimation and amount of contrast for CT imaging, and the magnetic strength, type

of coil, sequences, slice thickness, and type of contrast for MRI.

Vascular invasion of the portal system (portal vein and superior mesenteric vein) is an impor-

tant determinant for non-resectability. As the CT criteria for invasion of the portal system may

differ between centers, we also extracted those from the papers.

Examination Results

The absolute numbers of true positives, false negatives, false positives, and true negatives for the

diagnosis and assessment of resectability of pancreatic carcinoma per modality were extracted

or calculated from the reported values of sensitivity and specificity. To avoid selection bias, all

tabulated results for different readers (interobserver), for multiple observations per readers (intra-

observer), for multiple MRI systems, and for multiple MRI sequences were counted as separate

data sets.

Statistical AnalysisA bivariate random-effects approach [13, 14] was used to analyze the data. In this model, we

used logit-transformed sensitivity (logit-sens) and logit-transformed specificity (logit-spec). Logit-

sens and logit-spec were calculated as follows: logit-sens = ln (sens/[1-sens]) and logit-spec = ln

(spec/[1-spec]), where ln is the natural logarithm.

Due to the transformation, these values were approximately normally distributed, and mean

logit-sens and logit-spec and their standard errors were obtained by means of a bivariate ran-

dom-effects linear regression model using the SAS proc mixed effects approach. Sensitivity and

specificity estimates with their 95% confidence intervals were back-transformed after anti-logit

transformation of the mean logit-sens and logit-spec values.

Summary Estimates of Sensitivity and SpecificityTo compare the estimates for helical CT, conventional CT, MRI, and US, we first determined

whether the logit-sens and logit-spec values depended on year of publication (continuous vari-

able: 2002 was set to 0, 2001 to -1, 2000 to -2, and so forth), sample size (≤ 50 vs. > 50), de-

partment of origin (radiology/nuclear medicine vs. others), and the study design characteristics:

patient selection (consecutive vs. non-consecutive), interpretation of test results (blinded vs. not

blinded), verification (complete verification vs. partial verification and unknown), methods of data

collection (prospective vs. retrospective and unknown), reporting of study population (sufficient

vs. insufficient), reporting of reference test (sufficient vs. insufficient and no description), and

reporting of diagnostic tests (sufficient vs. insufficient).

The year 2002 was chosen as the reference year due to the low number of publications later

than 2002. In this analysis, we considered variables as explanatory if their regression coefficients

were significant (P < 0.05).

Chapter 5

Subsequently, we developed a multivariable regression model where we used a backward step-

wise algorithm to identify only the most important characteristics. Characteristics were retained

in the regression model when their P value was < 0.10.

Afterward, logit-sens and logit-spec values of the imaging techniques helical CT, conventional CT,

MRI, and US were compared with each other in this random-effects regression model, including

all variables that significantly affected the logit-sens and logit-spec values of the imaging modali-

ties as appropriate; in this final model a factor indicating the type of diagnostic modality was

included and a P value of < 0.05 of the regression coefficient of this factor was considered to

indicate a significant difference.

The fit of the final regression model was inspected graphically by studying the histograms of

the residuals and of the random-effects estimates.

CT Criteria for Invasion of Portal Vein SystemLiver metastasis, lymph nodes, vascular invasion, and extrapancreatic invasion of adjacent organs

are the important factors for excluding patients from respectability [15-18]. Invasion of the portal

system (portal vein and superior mesenteric vein) is the major factor, and different criteria exist

for invasion of the portal system.

As helical CT is the imaging modality most frequently used for the workup of patients with

pancreatic carcinoma, enough data sets on resectability were available to study whether using

encasement of more than 180 degrees, occlusion, and thrombosis as CT criteria for invasion of

the portal system showed improvements.

SubgroupsOther CT techniques such as the number of phases (eg, pancreatic phase vs. other multiphase

helical CT), slice thickness or collimation, and the use of contrast agent could not be studied due

to the heterogeneity of the studies. Also US technique (color Doppler) and MRI technical factors

such as magnetic strength, type of coil, sequence, slice thickness, matrix, and field of view could

not be studied due to the low number of data sets.

As tabulated results for different readers (interobserver), for multiple observations per readers

(intra-observer), for multiple MR systems, and for multiple MRI sequences were considered as

separate data sets, the respective correlations were taken into account. For this approach, the

empirical standard error calculated by “sandwich estimator” was used, which is possible in the

SAS proc mixed [19]. This approach was also used for intra-modality intra-patient correlation (in

some studies, different modalities were compared in the same patient population). All analyses

were performed using SPSS 11.5 for Windows (SPSS Inc, Chicago, IL) and SAS statistical software

version 8.2 (SAS institute Inc, Cary, NC).

ResUlTs

Search Results and Study SelectionWith the computer search and after extensive cross-checking of reference lists, 342 articles were

identified. We found 193 articles to be potentially eligible after reading the titles and abstracts.

One hundred twenty-five articles had to be excluded because they did not report data on US,

CT or MRI (n = 49); did not include patients with known or suspected pancreatic adenocarcinoma

but patients with chronic pancreatitis, the evaluation of the duct system, or other types of pan-

creatic cancer (n = 4); evaluated fewer than 20 patients (n = 13); compared the results of US,

CT, or MRI with angiography, positron emission tomography (PET), or other imaging findings (n

= 6); did not report sufficient data on positives (true/false) and negatives (true/false) on either

diagnosis or resectability (n = 50); or reported results of overlapping patients in other languages

(n = 3). In total, 68 articles could be included in this meta-analysis [15-18, 20-83].

Data ExtractionStudy Design Characteristics

As presented in table 1, most articles suffered from bias in study design characteristics: for in-

stance, patient selection was non-consecutive in 39 articles (57%); interpretation of test results

was not blinded in 38 (56%); the method of data collection was retrospective or unknown in 35

(51%); and there was insufficient or no description of the reference test in 42 (62%).

Table 1. Study design characteristics of included studies

Study characteristics Score No. articles

Patient selection Consecutive

Non-consecutive

Interpretation of test results Blinded

Not Blinded

Verification Complete

Partial

Unknown

Methods of data collection Prospective

Retrospective

Unknown

Reporting of study population Sufficient

Insufficient

Reporting of reference test Complete

Partial

No description

Chapter 5

Other Study Characteristics

In the 68 articles, a total of 7,405 patients were evaluated with a range of median age of 39.0

to 74.0 years. In 53 articles, the male-to-female ratio was presented (3279:2478). Forty studies

were performed by departments of radiology or nuclear medicine and 28 by departments of

gastroenterology, surgery, or internal medicine.

Examination Characteristics

US was evaluated in 17 articles, with sufficient description in 10; frequencies of the transducer

ranged from 2 to 5 MHz and color/power Doppler was used in 4 studies. CT was evaluated in

55 articles, with sufficient description in 37; helical CT was used in 27 articles and slice thickness

or collimation ranged from 3 to 10 mm (depending on phases), and contrast was used in all 37

studies. MRI was evaluated in 15 articles, with sufficient description in 14; 1.5 T was used in 11

studies, 1.0 T in 4 studies, and a phased array coil in 5 studies.

Table 2. Study and patient characteristics of included data sets

Stage Data sets* Patients† Age‡ Imaging§ Department# References

Diagnosis

Helical CT 23 959 55.0-64.0 23 21 15-18, 20-29

Conventional CT 20 1473 39.0-71.0 7 9 30-46

MRI 11 583 55.1-74.0 11 7 23, 25, 31, 44, 45, 47-51

US 14 2909 39.0-64.0 7 4 30, 34-37, 39, 42 46, 52-55

Resectability

Helical CT 32 1823 54.0-66.4 29 29 15, 17, 18, 22, 25-27, 29, 56-69

Conventional CT 12 1467 57.7-67.0 5 5 35, 40, 70-79

MRI 7 516 58.0-74.0 7 4 25, 50, 51, 62, 76, 80, 81

US 6 1233 57.7-64.7 4 2 35, 58, 69, 71, 82, 83

* Number of data sets per imaging modality.† Total number of patients included in the corresponding data sets.‡ Range of the median age in data sets.§ Sufficient description of the imaging modality.# Data sets available from studies performed by radiology or nuclear medicine department.

Examination Results

From the 68 articles, summary data for diagnosis (23 by helical CT, 20 by conventional CT, 11 by

MRI, and 14 by US) and for resectability (32 by helical CT, 12 by conventional CT, 7 by MRI, and

6 by US) were retrieved. Table 2 presents an overview of the data sets included for diagnosis and

resectability for each imaging modality, with respective numbers of patients, their median age,

and the corresponding references and departments of origin. Figure 1 presents an overview of

sensitivity and 1—specificity values of the data sets included for diagnosis and resectability for

each imaging modality.

A complete list of all included articles with study characteristics and patient characteristics,

examination characteristics (US, CT, and MRI techniques), and examination results (absolute num-

bers of true positives, true negatives, false positives, and false negatives) is available on request

from the authors.

Data AnalysisFollowing the backward stepwise regression analysis, several variables were identified as signifi-

cant predictors of the diagnostic performance of helical CT, conventional CT, MRI, and US (table

3). Due to the low number of data sets for MRI and US in determining resectability, no covariates

were added to the models.

Summary Estimates of Sensitivity and SpecificityDiagnosis

In the final model, sufficient description of patients was included as a covariate for helical CT,

blind interpretation of results for conventional CT, and sufficient description of patient population

0,0 0,2 0,4 0,6 0,8 1,0

1-Speci�city

itivity

Helical CT

Conventional CT

0,0 0,2 0,4 0,6 0,8 1,0

1-Speci�city

itivity

Helical CTConventional CT

Diagnosis Resectability

Figure 1. Receiver operating characteristics plots presenting sensitivity and 1-specificity values for diagnosis (left) and resectability (right) of helical CT, conventional CT, MRI, and US.

Chapter 5

for MRI and US. The summary sensitivities for helical CT, conventional CT, MRI, and US were 91%,

86%, 84%, and 76% and the specificities were 85%, 79%, 82%, and 75% respectively. The sensi-

tivities for MRI and US were significantly lower than that of helical CT (P = 0.04 and 0.0001).

Resectability

The final model included year of publication (set to 2002), department of origin (radiology/

nuclear medicine), and sufficient description of diagnostic tests for helical CT and size of patient

population (> 50 patients) for conventional CT as covariates.

Sensitivities for helical CT, conventional CT, MRI, and US were 81%, 82%, 82%, and 83% and

specificities were 82%, 76%, 78%, and 63% respectively. Sensitivities were comparable. Specific-

ity for US was significantly lower compared with that for helical CT (P = 0.0011). The results of

the final models adjusted for significant variables are summarized in table 4.

Table 3. Predictors identified for diagnosis and determining resectabilty

Covariates Coefficient (95% CI) P value

Diagnosis helical CT (23 data sets)

Description of patient population* (sufficient)

Description of patient population† (sufficient)

1.06 (0.02, 2.10)

1.45 (0,64, 2.26)

0.0004

Diagnosis conventional CT (20 data sets)

Interpretation of results* (blind) 0.90 (0.43,1.36) 0.0002

Diagnosis MRI (11 data sets)

Description of patient population* (sufficient) -1.77(-2.83, -0.72) 0.001

Diagnosis US (14 data sets)

Description of patient population* (sufficient)

Description of patient population† (sufficient)

0.78 (0.39, 1.15)

-2.06 (-3.17, -0.95)

< 0.0001

0.0003

Resectability helical CT (32 data sets)

Year of publication (change per year)*

Department of origin* (Radiology/Nuclear Medicine)

Department of origin† (Radiology/Nuclear Medicine)

Description of diagnostic test* (sufficient)

Description of diagnostic test† (sufficient)

0.14 (0.03, 0.25 ‡

-1.16 (-1.61, -0.72)

1.36 (0.95, 1.77)

-1.76 (-2.11, -1.40)

1.18 (0.81, 1.54)

< 0.0001

Resectability conventional CT (12 data sets)

Size of patient population* (> 50 patient) -2.35 (-4.12, 0.59) 0.009

Numbers in parentheses are 95% confidence intervals. A positive regression coefficient indicates better discriminatory power of the imaging modality in studies with that characteristic compared with that in studies without the corresponding characteristics and a negative regression coefficient indicates reduced diagnostic performance in studies with that characteristic. * effect on sensitivity; † effects on specificity; ‡ Year 2002 was coded as 0, 2001 as –1, 2000 as –2, and so fort.

CT Criteria for Invasion of Portal SystemNineteen data sets evaluating 687 patients showed no improvements in sensitivity and specificity

values for predefined CT criteria compared with the overall results of helical CT. The sensitivity es-

timate was 83% (95% CI: 77%-88%) and the specificity estimate was 83% (95% CI: 77%-87%).

DIsCUssIon

In this meta-analysis we obtained summary estimates of helical CT, conventional CT, MRI, and

US for diagnosis and staging (distinguishing resectable for non-resectable tumors) of pancreatic

cancer. For diagnosis, the sensitivities of US and MRI were lower compared with helical CT. For

resectability, US had a significantly lower specificity compared with helical CT, indicating that US

falsely stages non-resectable tumors as resectable tumors, increasing the number of unnecessary

laparotomies.

The results on CT criteria for invasion of the portal system showed no improvement of the diag-

nostic accuracy. However, other factors such as liver metastases and lymph nodes also determine

resectability. Therefore, the impact of the predefined criteria could not be separately studied and

the diagnostic value could be therefore underestimated.

We tried to minimize some of the well-known limitations of meta-analysis by having two

reviewers independently extract data, using explicit inclusion criteria, and extracting study design

characteristics to evaluate their effects on diagnostic performance. Lijmer et al [84] showed that

bias in study design characteristics led either to overestimation or underestimation of the diag-

nostic performance. In our meta-analysis most studies suffered from bias in design characteristics

Table 4. Summary Estimates of Sensitivity and Specificity for US, CT, and MRI

Imaging modality Data sets/Number of patients

Sensitivity(95% CI)

Specificity(95% CI)

Diagnosis

Helical CT

Conventional CT

23/959

20/1473

11/583

14/2909

91% (86%-94%)

86% (81%-89%)

84% (78%-89%)*

76% (69%-82%)*

85% (76%-91%)

79% (60%-90%)

82% (67%-92%)

75% (51%-89%)

Resectability

Helical CT

Conventional CT

32/1823

12/1467

6/1233

81% (76%-85%)

82% (74%-88%)

82% (69%-91%)

83% (68%-91%)

82% (77%-87%)

76% (61%-86%)

78% (63%-87%)

63% (45%-79%)*

* Significant difference compared to helical CT.

Chapter 5

following the STARD criteria. We therefore studied whether these characteristics had an impact

on the outcome and adjusted where indicated by means of a bivariate random-effects approach.

This bivariate random-effects approach accounts not only for the heterogeneity between studies

due to different threshold settings, as is the case in the regular sROC analysis, but also for both

the error of estimation of the sensitivity and specificity values in each study [85, 86].

We analyzed 2 × 2 tables for different readers, for multiple observations per readers, and for

multiple CT and MRI techniques, as separate data sets. To adjust for the dependency in data sets

from the same patient population, we studied this correlation by using the empirical standard

error calculated by “sandwich estimator.”

An issue in any form of meta-analysis is the possibility of publication bias. We evaluated

whether the size of studies was associated with diagnostic accuracy. In particular, small studies

with optimistic results may be published more easily than small studies with unfavorable results.

Larger studies with optimistic results may also be published more easily than larger studies with

unfavorable studies, but this difference will be smaller. There was no association between sample

size and diagnostic performance.

Patient characteristics (stage of disease, age, or sex distribution) are known to affect diagnos-

tic accuracy. Unfortunately, the variation in data presentation prevented us from studying the

effect of these variables.

The reference standard used in this meta-analysis ranged from histopathology to follow-up. In

general the time interval between diagnostic test and reference test in patients with neoplasms

should be short for comparison; therefore, in patients undergoing follow-up, the results could

be underestimated. This is a major limitation of most diagnostic studies, in which histopathology

could not be performed in all patients due to ethical considerations. However, it was not possible

to study the effect of different reference standards (histopathology, intraoperative findings, or

follow-up) on diagnostic accuracy due to insufficient and not separately reported results from

different reference standards.

Another important factor in the resection of pancreatic cancer is the variation between surgical

resection procedures (eg, whether to perform vein resection). The influence of this factor could

also not be studied due to limited description of surgical procedures; this is another limitation of

most diagnostic studies.

Therefore, the STARD initiative has been developed to improve the quality of the reporting

of diagnostic studies. The items in the checklist and the flowchart can help authors in describing

essential elements of the design and conduct of the study, the execution of tests, and the results

CT technical factors such as the number of phases (pancreatic phase vs. multiphase helical CT)

and the type of scanner (single-slice vs. multi-slice) are important features; however, either due to

the low number of studies evaluating multislice CT or the heterogeneity of studies, these features

could not be studied. Although helical CT has replaced conventional CT in the staging of pancre-

atic carcinoma, this has not led to improved results. MRI has been studied less extensively in the

workup of pancreatic cancer, and therefore the number of studies was too limited to evaluate

technical factors such as magnetic field strength, type of coil, and slice thickness. US is known as

��

an operator-dependent modality, but the experience level of the sonographer is not described in

the included papers, and moreover the definition of operator experience is a subjective param-

eter. Techniques such as color/power Doppler and the frequency of the transducer could also not

be studied due to the low number of data sets.

Two other modalities playing a role in the management of pancreatic carcinoma are FDG-PET

and endoscopic ultrasonography (EUS). FDG-PET can add incremental information to the diag-

nosis of pancreas cancer; but data on this issue were limited [15, 18, 21, 26, 27, 87, 88]. EUS is

considered the most sensitive modality in the diagnosis of pancreatic cancer and in determining

the local extent of pancreatic cancer, but it cannot reliably assess metastatic involvement in the

liver and because of its invasive character is used in patients with negative or inconclusive CT or

MRI results.

In conclusion, based on the high sensitivity estimate for diagnosis of helical CT compared

with MRI and US and the high specificity value for resectability compared with US, helical CT is

preferable as an imaging modality for the diagnosis and assessment of resectability of pancreatic

adenocarcinoma.

��

Chapter 5

References

1. Parkin DM, Bray F, Ferlay J, et al. Estimating the world cancer burden: Globocan 2000. Int J Cancer. 2001;94:153-156

2. Clarke DL, Thomson SR, Madiba TE, et al. Preoperative imaging of pancreatic cancer: a management-oriented approach. J Am Coll Surg 2003;196:119-129.

3. Del Frate C, Zanardi R, Mortele K, et al. Advances in imaging for pancreatic disease. Curr Gastroenterol Rep 2002;4:140-148.

4. Fletcher JG, Wiersema MJ, Farrell MA, et al. Pancreatic malignancy: value of arterial, pancreatic, and hepatic phase imaging with multi-detector row CT. Radiology 2003;229:81-90.

5. Johnson PT, Heath DG, Hofmann LV, et al. Multidetector-row computed tomography with three-dimen-sional volume rendering of pancreatic cancer: a complete preoperative staging tool using computed tomography angiography and volume-rendered cholangiopancreatography. J Comput Assist Tomogr 2003;27:347-353.

6. Pauls S, Sokiranski R, Schwarz M, et al. [Value of spiral CT and MRI (1.5 T) in preoperative diagnosis of tumors of the head of the pancreas]. Rontgenpraxis 2003;55:3-15.

7. Prokesch RW, Schima W, Chow LC, et al. Multidetector CT of pancreatic adenocarcinoma: diagnostic advances and therapeutic relevance. Eur Radiol 2003;13:2147-2154.

8. Ishiguchi T, Maruyama K, Fukatsu H, et al. Radiologic diagnosis and staging of pancreatic ductal adeno-carcinoma. Semin Surg Oncol 1998;15:23-32.

9. Balci NC, Semelka RC. Radiologic diagnosis of pancreatic carcinoma. Eur J Radiol 2001;38:105-112.

10. Deville WL, Bezemer PD, Bouter LM. Publications on diagnostic test evaluation in family medicine journals: an optimal search strategy. J Clin Epidemiol 2000;53:65-69.

11. Bossuyt PM, Reitsma JB, Bruns DE, et al. Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. Radiology 2003;226:24-28.

12. Berlin JA. Does blinding of readers affect the results of meta-analyses? University of Pennsylvania Meta-Analysis Blinding Study Group. Lancet 1997;350:185-186.

13. Van Houwelingen HC, Zwinderman KH, Stijnen T. A bivariate approach to meta-analysis. Stat Med 1993;12:2273-2284.

14. Van Houwelingen HC, Arends LR, Stijnen T. Advanced methods in meta-analysis: multivariate approach and meta-regression. Stat Med 2002;21:589-624.

15. Diehl SJ, Lehmann KJ, Sadick M, et al. Pancreatic cancer: value of dual-phase helical CT in assessing resect-ability. Radiology 1998;206:373-378.

16. Keogan MT, McDermott VG, Paulson EK, et al. Pancreatic malignancy: effect of dual-phase helical CT in tumor detection and vascular opacification. Radiology 1997;205:513-518.

17. Legmann P, Vignaux O, Dousset B, et al. Pancreatic tumors: comparison of dual-phase helical CT and endoscopic sonography. AJR Am J Roentgenol 1998;170:1315-1322.

18. Lehmann KJ, Diehl SJ, Lachmann R, et al. [Value of dual-phase-helical CT in the preoperative diagno-sis of pancreatic cancer: a prospective study]. Rofo Fortschr Geb Rontgenstr Neuen Bildgeb Verfahr 1998;168:211-216.

19. Liang KY, Zeger SL. Regression analysis for correlated data. Annu Rev Public Health 1993;14:43-68.

20. McNulty NJ, Francis IR, Platt JF, et al. Multi-detector row helical CT of the pancreas: effect of contrast-en-hanced multiphasic imaging on enhancement of the pancreas, peripancreatic vasculature, and pancreatic adenocarcinoma. Radiology 2001;220:97-102.

21. Mertz HR, Sechopoulos P, Delbeke D, et al. EUS, PET, and CT scanning for evaluation of pancreatic adeno-carcinoma. Gastrointest Endosc 2001;52:367-371.

22. Richter GM, Wunsch C, Schneider B, et al. [Hydro-CT in detection and staging of pancreatic carcinoma]. Radiologe 1998;38:279-286.

23. Rieber A, Tomczak R, Nussle K, et al. MRI with mangafodipir trisodium in the detection of pancreatic tumours: comparison with helical CT. Br J Radiol 2000;73:1165-1169.

��

24. Shoup M, Hodul P, Aranha GV, et al. Defining a role for endoscopic ultrasound in staging periampullary tumors. Am J Surg 2000;179:453-456.

25. Schima W, Fugger R, Schober E, et al. Diagnosis and staging of pancreatic cancer: comparison of man-gafodipir trisodium-enhanced MR imaging and contrast-enhanced helical hydro-CT. AJR Am J Roentgenol 2002;179:717-724.

26. Imbriaco M, Megibow AJ, Camera L, et al. Dual-phase versus single-phase helical CT to detect and assess resectability of pancreatic carcinoma. AJR Am J Roentgenol 2002;178:1473-1479.

27. Catalano C, Laghi A, Fraioli F, et al. Pancreatic carcinoma: the role of high-resolution multislice spiral CT in the diagnosis and assessment of resectability. Eur Radiol 2003;13:149-156.

28. Kalady MF, Clary BM, Clark LA, et al. Clinical utility of positron emission tomography in the diagnosis and management of periampullary neoplasms. Ann Surg Oncol 2002;9:799-806.

29. Laghi A, Iannaccone R, Catalano C, et al. Multislice spiral computed tomography in diagnosis and staging of pancreatic carcinoma: preliminary experience. Dig Liver Dis 2002;34:732-738.

30. Bares R, Klever P, Hauptmann S, et al. F-18 fluorodeoxyglucose PET in vivo evaluation of pancreatic glucose metabolism for detection of pancreatic cancer. Radiology 1994;192:79-86.

31. Fink C, Grenacher L, Hansmann HJ, et al. [Prospective study to compare high-resolution computed tomog-raphy and magnetic resonance imaging in the detection of pancreatic neoplasms: use of intravenous and oral MR contrast media]. Rofo Fortschr Geb Rontgenstr Neuen Bildgeb Verfahr 2001;173:724-730.

32. Friess H, Langhans J, Ebert M, et al. Diagnosis of pancreatic cancer by 2[18F]-fluoro-2-deoxy-D-glucose positron emission tomography. Gut 1995;36:771-777.

33. Furukawa T, Tsukamoto Y, Naitoh Y, et al. Differential diagnosis between benign and malignant local-ized stenosis of the main pancreatic duct by intraductal ultrasound of the pancreas. Am J Gastroenterol 1994;89:2038-2041.

34. Inokuma T, Tamaki N, Torizuka T, et al. Evaluation of pancreatic tumors with positron emission tomogra-phy and F- 18 fluorodeoxyglucose: comparison with CT and US. Radiology 1995;195:345-352.

35. Nakaizumi A, Uehara H, Iishi H, et al. Endoscopic ultrasonography in diagnosis and staging of pancreatic cancer. Dig Dis Sci 1995;40:696-700.

36. Palazzo L, Roseau G, Gayet B, et al. Endoscopic ultrasonography in the diagnosis and staging of pancreatic adenocarcinoma. Results of a prospective study with comparison to ultrasonography and CT scan. Endos-copy 1993;25:143-150.

37. Pasanen PA, Eskelinen M, Partanen K, et al. A prospective study of the value of imaging, serum markers and their combination in the diagnosis of pancreatic carcinoma in symptomatic patients. Anticancer Res 1992;12:2309-2314.

38. Rodriguez J, Kasberg C, Nipper M, et al. CT-guided needle biopsy of the pancreas: a retrospective analysis of diagnostic accuracy. Am J Gastroenterol 1992;87:1610-1613.

39. Rosch T, Lorenz R, Braig C, et al. Endoscopic ultrasound in pancreatic tumor diagnosis. Gastrointest Endosc 1991;37:347-352.

40. Snady H, Cooperman A, Siegel J. Endoscopic ultrasonography compared with computed tomography with ERCP in patients with obstructive jaundice or small peri-pancreatic mass. Gastrointest Endosc 1992;38:27-34.

41. Stollfuss JC, Glatting G, Friess H, et al. 2-(fluorine-18)-fluoro-2-deoxy-D-glucose PET in detection of pancre-atic cancer: value of quantitative image interpretation. Radiology 1995;195:339-344.

42. DelMaschio A, Vanzulli A, Sironi S, et al. Pancreatic cancer versus chronic pancreatitis: diagnosis with CA 19-9 assessment, US, CT, and CT-guided fine-needle biopsy. Radiology 1991;178:95-99.

43. Furukawa T, Oohashi K, Yamao K, et al. Intraductal ultrasonography of the pancreas: development and clinical potential. Endoscopy 1997;29:561-569.

44. Muller MF, Meyenberger C, Bertschinger P, et al. Pancreatic tumors: evaluation with endoscopic US, CT, and MR imaging. Radiology 1994;190:745-751.

45. Vellet AD, Romano W, Bach DB, et al. Adenocarcinoma of the pancreatic ducts: comparative evaluation with CT and MR imaging at 1.5 T. Radiology 1992;183:87-95.

��

Chapter 5

46. Papas M, Takacs T, Tron L, et al. The possible role of F-18 FDG positron emission tomography in the dif-ferential diagnosis of focal pancreatic lesions. Clin Nucl Med 2002;27:197-201.

47. Adamek HE, Albert J, Breer H, et al. Pancreatic cancer detection with magnetic resonance cholangio-pancreatography and endoscopic retrograde cholangiopancreatography: a prospective controlled study. Lancet 2000;356:190-193.

48. Diehl SJ, Lehmann KJ, Gaa J, et al. [The value of magnetic resonance tomography (MRT), magnetic reso-nance cholangiopancreatography (MRCP) and endoscopic retrograde cholangiopancreatography (ERCP) in the diagnosis of pancreatic tumors]. Rofo Fortschr Geb Rontgenstr Neuen Bildgeb Verfahr 1999;170:463-469.

49. Dobritz M, Fellner FA, Baum U, et al. [MR imaging of pancreatic lesions with Mn-DPDP. A histopathologic correlation]. Rofo Fortschr Geb Rontgenstr Neuen Bildgeb Verfahr 2002;174:893-897.

50. Hochwald SN, Rofsky NM, Dobryansky M, et al. Magnetic resonance imaging with magnetic resonance cholangiopancreatography accurately predicts resectability of pancreatic carcinoma. J Gastrointest Surg 1999;3:506-511.

51. Richter A, Gaa J, Niedergethmann M, et al. [Ultrafast magnetic resonance tomography changes the standard in pancreas diagnosis]. Chirurg 2001;72:697-703.

52. Becker D, Strobel D, Bernatik T, et al. Echo-enhanced color- and power-Doppler EUS for the discrimination between focal pancreatitis and pancreatic carcinoma. Gastrointest Endosc 2001;53:784-789.

53. Karlson BM, Ekbom A, Lindgren PG, et al. Abdominal US for diagnosis of pancreatic tumor: prospective cohort analysis. Radiology 1999;213:107-111.

54. Maringhini A, Ciambra M, Raimondo M, et al. Clinical presentation and ultrasonography in the diagnosis of pancreatic cancer. Pancreas 1993;8:146-150.

55. Rickes S, Unkrodt K, Neye H, et al. Differentiation of pancreatic tumours by conventional ultrasound, un-enhanced and echo-enhanced power Doppler sonography. Scand J Gastroenterol 2002;37:1313-1320.

56. Andren-Sandberg A, Lindberg CG, Lundstedt C, et al. Computed tomography and laparoscopy in the as-sessment of the patient with pancreatic cancer. J Am Coll Surg 1998;186:35-40.

57. Bluemke DA, Cameron JL, Hruban RH, et al. Potentially resectable pancreatic adenocarcinoma: spiral CT assessment with surgical and pathologic correlation. Radiology 1995;197:381-385.

58. Bottger T, Engelman R, Seifert JK, et al. Preoperative diagnostics in pancreatic carcinoma: would less be better? Langenbecks Arch Surg 1998;383:243-248.

59. Hommeyer SC, Freeny PC, Crabo LG. Carcinoma of the head of the pancreas: evaluation of the pancreati-coduodenal veins with dynamic CT-potential for improved accuracy in staging. Radiology 1995;196:233-238.

60. Hough TJ, Raptopoulos V, Siewert B, et al. Teardrop superior mesenteric vein: CT sign for unresectable carcinoma of the pancreas. AJR Am J Roentgenol 1999;173:1509-1512.

61. Howard TJ, Chin AC, Streib EW, et al. Value of helical computed tomography, angiography, and endo-scopic ultrasound in determining resectability of periampullary carcinoma. Am J Surg 1997;174:237-241.

62. Lentschig MG, Reimer P, Rummeny E, et al. [The value of 3-phase spiral CT and magnetic resonance tomography in preoperative diagnosis of pancreatic carcinoma]. Radiologe 1996;36:406-412.

63. Lu DS, Reber HA, Krasny RM, et al. Local staging of pancreatic cancer: criteria for unresectability of major vessels as revealed by pancreatic-phase, thin-section helical CT. AJR Am J Roentgenol 1997;168:1439-1443.

64. Phoa SS, Reeders JW, Rauws EA, et al. Spiral computed tomography for preoperative staging of potentially resectable carcinoma of the pancreatic head. Br J Surg 1999;86:789-794.

65. Raptopoulos V, Steer ML, Sheiman RG, et al. The use of helical CT and CT angiography to predict vas-cular involvement from pancreatic cancer: correlation with findings at surgery. AJR Am J Roentgenol 1997;168:971-977.

66. Valls C, Andia E, Sanchez A, et al. Dual-phase helical CT of pancreatic adenocarcinoma: assessment of resectability before surgery. AJR Am J Roentgenol 2001;178:821-826.

67. Zeman RK, Cooper C, Zeiberg AS, et al. TNM staging of pancreatic carcinoma using helical CT. AJR Am J Roentgenol 1997;169:459-464.

��

68. Procacci C, Biasiutti C, Carbognin G, et al. Spiral computed tomography assessment of resectability of pancreatic ductal adenocarcinoma: analysis of results. Dig Liver Dis 2002;34:739-747.

69. Minniti S, Bruno C, Biasiutti C, et al. Sonography versus helical CT in identification and staging of pancreatic ductal adenocarcinoma. J Clin Ultrasound 2003;31:175-182.

70. Awad SS, Colletti L, Mulholland M, et al. Multimodality staging optimizes resectability in patients with pancreatic and ampullary cancer. Am Surg 1997;63:634-638.

71. Bakkevold KE, Arnesjo B, Kambestad B. Carcinoma of the pancreas and papilla of Vater-assessment of resectability and factors influencing resectability in stage I carcinomas. A prospective multicentre trial in 472 patients. Eur J Surg Oncol 1992;18:494-507.

72. Freeny PC, Traverso LW, Ryan JA. Diagnosis and staging of pancreatic adenocarcinoma with dynamic computed tomography. Am J Surg 1993;165:600-606.

73. Gress FG, Hawes RH, Savides TJ, et al. Role of EUS in the preoperative staging of pancreatic cancer: a large single-center experience. Gastrointest Endosc 1999;50:786-791.

74. Loyer EM, David CL, Dubrow RA, et al. Vascular involvement in pancreatic adenocarcinoma: reassessment by thin-section CT. Abdom Imaging 1996;21:202-206.

75. McCarthy MJ, Evans J, Sagar G, et al. Prediction of resectability of pancreatic malignancy by computed tomography. Br J Surg 1998;85:320-325.

76. Megibow AJ, Zhou XH, Rotterdam H, et al. Pancreatic adenocarcinoma: CT versus MR imaging in the evaluation of resectability-report of the Radiology Diagnostic Oncology Group. Radiology 1995;195:327-332.

77. O’Malley ME, Boland GW, Wood BJ, et al. Adenocarcinoma of the head of the pancreas: determination of surgical unresectability with thin-section pancreatic-phase helical CT. AJR Am J Roentgenol 1999;173:1513-1518.

78. Warshaw AL, Gu ZY, Wittenberg J, et al. Preoperative staging and assessment of resectability of pancreatic cancer. Arch Surg 1990;125:230-233.

79. Zeiss J, Coombs RJ, Bielke D. CT presentation and staging accuracy of pancreatic adenocarcinoma. Int J Pancreatol 1990;7:49-53.

80. Ahmad NA, Lewis JD, Siegelman ES, et al. Role of endoscopic ultrasound and magnetic resonance imaging in the preoperative staging of pancreatic adenocarcinoma. Am J Gastroenterol 2000;95:1926-1931.

81. McFarland EG, Kaufman JA, Saini S, et al. Preoperative staging of cancer of the pancreas: value of MR angiography versus conventional angiography in detecting portal venous invasion. AJR Am J Roentgenol 1996;166:37-43.

82. Angeli E, Venturini M, Vanzulli A, et al. Color Doppler imaging in the assessment of vascular involvement by pancreatic carcinoma. AJR Am J Roentgenol 1997;168:193-197.

83. Ishida H, Konno K, Hamashima Y, et al. Assessment of resectability of pancreatic carcinoma by color Dop-pler sonography. Abdom Imaging 1999;24:295-298.

84. Lijmer JG, Mol BW, Heisterkamp S, et al. Empirical evidence of design-related bias in studies of diagnostic tests. JAMA 1999;282:1061-1066.

85. Rutter CM, Gatsonis CA. Regression methods for meta-analysis of diagnostic test data. Acad Radiol 1995;(Suppl 1):S48-S56.

86. Rutter CM, Gatsonis CA. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med 2001;20:2865-2884.

87. Jadvar H, Fischman AJ. Evaluation of pancreatic carcinoma with FDG PET. Abdom Imaging 2001;26:254-259.

88. Keogan MT, Tyler D, Clark L, et al. Diagnosis of pancreatic carcinoma: role of FDG PET. AJR Am J Roent-genol 1998;171:1565-1570.

6C h a p t e r

Imaging and treatment of patients with colorectal liver metastases in the Netherlands: a survey

Shandra BipatMaarten S. van LeeuwenJan N.M. IJzermansPatrick M.M. BossuytJan-Willem GreveJaap Stoker

The Netherlands Journal of Medicine 2006;64:147-151

��

Chapter 6

absTRaCT

Background: Clinical experience has highlighted the absence of a uniform approach to the management of patients with colorectal liver metastases in the Netherlands.

Methods: A written survey on the diagnosis and treatment of patients with colorectal liver metastases was sent to all 107 chairmen of oncology committees in each hospital. Questions were asked concerning: specialists involved in decision-making, availability and existence of guidelines, and meetings, factors that needed to be improved, information regarding the di-agnostic work-up of liver metastases, detailed techniques of ultrasonography (US), computed tomography (CT), magnetic resonance imaging (MRI), and positron emission tomography (PET), factors influencing resectability, types of surgery performed, the use of (neo)-adjuvant chemotherapy, portal vein embolization performance, considering isolated hepatic perfusion (IHP), or local ablation as treatment options, actual performance of local ablation, and the use of systemic as well as regional chemotherapy.

Results: Response rate was 68% (73/107). Specialists involved in the management were mostly surgeons (70), medical oncologists (66), and radiologists (42). Factors that needed to be im-proved, as indicated by responders, were the absence of 1) guidelines; 2) registration of pa-tients; and 3) guidelines for radiofrequency ablation (RFA). Diagnostic work-up of synchronous liver metastases occurred in 71 hospitals, (by US in 69 and by CT in 2). For the work-up of metachronous liver metastases, US was used as initial modality in 14, CT in 2 hospitals, and 57 hospitals used one or the other (mainly US). As additional modality, CT was performed (71) and to a lesser extent MRI (38) or PET (22). Diagnostic laparoscopy and biopsy were performed incidentally. The choice for an imaging modality was mostly influenced by the literature and to a lesser extent by the availability and by costs, personnel, and waiting lists. Substantial varia-tion exists in the US, CT, MRI, and PET techniques. The absence of extrahepatic disease and the clinical condition were considered as the most important factors influencing resectability. Surgery was performed in 30 hospitals; hemihepatectomy in 25, segment resection in 27, multisegment resection in 23, wedge excision in 27, and combination of resection and RFA in 18 institutions. In 52 hospitals (neo)-adjuvant chemotherapy was administrated to improve surgical results, partly (35%) in trials. In nine hospitals portal vein embolization was performed, with the volume of the remnant liver as the most important factor. Local ablative techniques were considered as a treatment option in 48 hospitals and actually performed in 16 hospitals, without clearly defined indications. Experimental IHP was considered a treatment option by 45 (62%) responders, irrespective whether this treatment was available at their center. Patients with extensive metastases received systemic chemotherapy in all 73 hospitals and regional chemotherapy in ten hospitals.

Conclusion: This survey shows substantial variation in the diagnostic and therapeutic work-up of patients with colorectal liver metastases. This variation reflects either under- or over-utilization of diagnosis and treatment options. Evidence-based guidelines taking into account the avail-able evidence, experience and availability can solve this variation.

Management of patients with colorectal liver metastases in the netherlands

��

InTRoDUCTIon

Colorectal carcinoma is one of the commonest solid tumors and is responsible for approximately

10% of cancer-related deaths in the Western world. Liver metastasis is a common consequence

of colorectal carcinoma; 50 to 60% patients develop liver metastases.

Early and accurate diagnosis of liver metastasis is crucial for clinical decision-making. Surgery

is the only therapy that offers any possibility of cure with five-year survival rates after resection

of all detectable disease up to 40% [1-4]. Unfortunately, only 20 to 25% of patients are deemed

suitable for hepatic resection. To improve the results of surgery, a subgroup of these patients

either receive neoadjuvant or adjuvant chemotherapy. Patients not suitable for surgery, due to

extensive liver metastases or extrahepatic diseases, in general undergo systemic chemotherapy.

Several newer therapies such as cryosurgery, radiofrequency ablation (RFA), portal vein emboliza-

tion, isolated hepatic perfusion (IHP), and regional chemotherapy are being evaluated in patients

not suitable for surgery due to the number or distribution of liver metastases [5-11].

Imaging modalities such as ultrasound (US), computed tomography (CT), magnetic resonance

imaging (MRI), positron emission tomography (PET), and laparoscopy (combined with US), rep-

resent important tools in the selection of patients for the appropriate treatment [12-17]. Most

of these diagnostic and therapeutic modalities are available in the Netherlands and there are

concerns about variability in diagnosis and treatment policies.

Clinical experience has highlighted several problems: variation in diagnostic strategies, fac-

tors determining the respectability (presence of extrahepatic diseases), use of neoadjuvant or

adjuvant chemotherapy, extent of use of experimental treatment modalities (RFA, portal vein

embolization, IHP, and regional chemotherapy), and the use of different systemic chemotherapy

regimens. In addition, evidence-based guidelines concerning the diagnosis and treatment are not

available in the Netherlands at the moment.

Current policies are usually based on consensus meetings, expert opinions, results from stud-

ies, and personal and/or institutional experience and preferences, resulting in variable and incon-

sistent choices and regimens among specialists and institutions. By means of a written survey,

we evaluated the policies on the management of patients with colorectal liver metastases in the

Netherlands. The primary aim of this survey was to summarize the extent of variation in the diag-

nosis and treatment strategies. The second aim was to obtain relevant information for developing

and implementing evidence-based guidelines.

A written survey on the management of colorectal liver metastases was sent to all Dutch hospitals

dealing with this group of patients in November 2002. A total of 107 questionnaires were sent to

chairmen of the oncology committees in each hospital. All eight academic hospitals participated

in this survey. The replies were returned in prepaid stamped envelopes and collected until June

Chapter 6

2004. Due to the diversity of specialists involved in the work-up, the questionnaire was divided

into three parts:

1) In the general part, questions were asked about the presence of registration systems, the

number of patients diagnosed and/or treated, specialists involved in the treatment policy, avail-

ability of guidelines, existence of meetings, factors that needed to be improved and research on

both diagnostic and treatment field.

2) In the diagnostic part, information on the availability of modalities and the complete diag-

nostic work-up of synchronous and metachronous liver metastases was requested. This included

information on technical details of US, CT, MRI, and PET and the factors influencing the choice

between these approaches.

3) In the treatment part, questions were asked about factors influencing the choice for surgi-

cal treatment, the types of surgery performed, whether (neo)-adjuvant chemotherapy was ad-

ministrated, whether liver perfusion and local ablation were considered as treatment options ir-

respective of availability, types of local ablation performed, portal vein embolization performance

and whether systemic or regional chemotherapy was administrated. In addition, information on

schedules of the chemotherapy approaches was requested.

ResUlTs

Response rateSeventy-four (69%), 73 (68%) and 73 (68%) replies were returned for the general, diagnostic

and treatment parts of the questionnaire, respectively, (including from all eight academic institu-

tions).

GeneralSpecialists involved in the management were surgeons in 70, medical oncologists in 66, radiolo-

gists in 42, internists in 21, gastroenterologist in 17, and nuclear medicine specialists in three

hospitals. In all hospitals meetings were held frequently (once every two weeks) between special-

ists of one hospital (25), specialists of more hospitals (11), or between specialists and consulting

specialists of the Comprehensive Cancer Center in most centers (58).

Registration and guidelines

Registration of patients with colorectal liver metastases was only carried out in 26 hospitals. The

number of patients for diagnosis ranged from 10 to 150, for surgical treatment from 1 to 40

and for palliative chemotherapy from 6 to 45 patients. Practical guidelines were available in only

16 hospitals; however these guidelines were not evidence-based. In addition, most hospitals (66)

indicated they preferred national or regional evidence-based guidelines.

Factors needing improvement

The most important points of concern in the daily practice, according to the responders, were the

��

absence of general guidelines for diagnosis and treatment of patients with colorectal liver metas-

tases, absence of registration systems and to a lesser extent absence of guidelines for indications

and performance of radiofrequency ablation (RFA).

DiagnosisAvailability of imaging modalities

US and CT were available in all 73 hospitals, MRI in 71, and PET in 11 hospitals, respectively.

Diagnostic work-up of synchronous liver metastases occurred in 71 (97%) hospitals; in 69 mainly

by US and in two by means of CT.

Diagnostic work-up of metachronous liver metastases was performed step by step, starting

with an initial screening modality followed by an additional modality for further detection and

characterization of liver metastases. As initial modality US was used in 14 hospitals, CT in two

hospitals, while 57 hospitals used one or the other (mainly US). As additional modality for charac-

terization and determining resectability, CT was generally performed (71) and to a lesser extent

MRI (38) or PET (22).

In 33 hospitals a one-stop-shop imaging (for detection, characterization and determining re-

sectability) was performed by means of CT. Diagnostic laparoscopy and biopsy (US-guided or

CT-guided) were performed incidentally in 14 and 67 hospitals, respectively. Factors affecting the

choice for a diagnostic modality were mostly influenced by the literature, to a lesser extent by

availability and occasionally by costs, personnel and waiting lists.

The technical details on US, CT, MRI, and PET were provided by 62, 62, 60, and 7 hospitals,

respectively.

Ultrasonography (n = 62). In all hospitals a convex transducer was used for imaging; the use of

an additional linear transducer for detailed visualization of the liver surface was limited to seven

hospitals. US with ‘harmonic frequency’ in combination with conventional US was performed in

43 hospitals. The use of contrast agents for the assessment of vascularisation of focal lesions was

limited to two hospitals.

Computed tomography (n = 62). In 57 hospitals spiral CT scanners were used, including

37 multislice scanners. The number of detectors in the multislice scanners varied from 2 to 16

(modus: 4) and the slice thickness ranged from 1 tot 11 mm (modus: 5 mm). The introduction of

multislice scanners made it possible to perform scanning with lower slice thickness and therefore

to improve the detection of smaller lesions. In most institutions (36), four-phase scanning was

performed (unenhanced, arterial, portal, and late phase). In general the unenhanced and the

portal phases are used for detection of liver metastases; however, arterial and late phases are

helpful in distinguishing other lesions. The amount of iodine ranged from 24 to 72 g (modus 30

g). Detection of liver metastases is expected to improve by using large amount of iodine.

Magnetic emission tomography (n = 60). The magnetic strength of the MRI equipment was

mainly 1.0 T or 1.5 T (n = 47). The most frequently used contrast agent was non-specific gado-

linium (n = 42); to a lesser extent (n = 14) liver specific contrast agents such as Endorem®

(dextran-coated ferumoxide), Resovist® (ferucarbotran), Teslascan® (mangafodipir trisodium),

��

Chapter 6

and Multihance® (gadobenate dimeglumine) were used to increase detection of small liver me-

tastases, due to selective accumulation of contrast agent in liver parenchyma.

Positron emission tomography (n = 7). Six centers had a dedicated full-ring scanner. The

amount of fluoro-2-deoxyglucose varied from 150 to 600 MBq and the analysis was mostly

qualitatively and incidentally semi-quantitatively.

TreatmentFactors influencing resectability are summarized in table 1, with absence of extrahepatic disease

and the clinical condition considered being the most important factors.

Surgery was performed in 30 hospitals: hemihepatectomy in 25, segment resection in 27,

multisegment resection in 23, wedge excision in 27, and a combination of resection and RFA

in 18 institutions. In 52 (71%) hospitals either neoadjuvant or adjuvant chemotherapy was ad-

ministrated to improve surgical results with a substantial variation in the treatment regimens,

mostly 5-fluorouracil + leucovorin or 5-fluorouracil + leucovorin + oxalipatin, while irinotecan was

administrated less often. Approximately 35% (18) of the responders explicitly mentioned that

(neo)-adjuvant chemotherapy was administrated in trials.

Table 1. Factors influencing resectability of liver metastases

Factor Number of hospitals

Number of lesions 57 (78%)

Size of lesions 40 (55%)

Location of lesions 58 (79%)

R0 resection (clear surgical margins) 26 (36%)

Extrahepatic metastases 63 (86%)

Anatomic structure of the liver 26 (36%)

Stage and grade of the primary tumor 14 (19%)

Age of the patient 27 (37%)

Clinical condition of the patient 69 (95%)

Wish of the patient 52 (71%)

Time between primary tumor and metastases detection 24 (33%)

Response: 73 (68%) hospitals.

Portal vein embolization was only performed in nine hospitals to achieve a hypertrophy of the

remnant liver. The most important factor determining the choice for portal vein embolization was

the volume of the remnant liver.

Ablation techniques were considered treatment options in 48 hospitals (47 RFA, 19 cryoabla-

tion, 10 laser-induced interstitial thermotherapy). The actual use of these techniques was limited

to 16 hospitals (RFA in 15, cryoablation in two, and laser-induced interstitial thermotherapy in one

hospital), however without evident indications or guidelines.

��

Of the responders, 62% (45) indicated that they considered experimental IHP to be a treatment

option irrespective of whether this treatment was available at their center. IHP involves complete

vascular isolation of the liver to allow regional delivery of high-dose chemotherapy to the liver with

little systemic toxicity. This experimental technique is being evaluated at Leiden University Medical

Center and Erasmus Medical Center in Rotterdam.

Patients with extensive metastases are only suitable for chemotherapy, either systemic or

regional. In all 73 hospitals patients received systemic chemotherapy and in ten regional che-

motherapy were given. For systemic chemotherapy, several protocols were used: 5-fluorouracil

and leucovorin with either oxalipatin or irinotecan and the use of capecitabine (instead of 5-

fluorouracil).

DIsCUssIon

In most institutions, the strategy for diagnosis was comparable: US was used as an initial screen-

ing imaging modality to detect patients with liver metastases. Easy availability and non-invasive-

ness are some of the reasons for the widespread use of US. As additional modalities mostly

CT and to a lesser extent MRI were used; however, with a substantial variation in CT and MRI

techniques, such as the use of different phases and amount of contrast for CT and different

contrast agents for MRI.

The variation is mostly a consequence of technical developments (e.g. introduction of multislice

CT and liver specific MRI agents) [12-17] and uncertainties in the literature (different outcomes),

indicating the need for evidence-based guidelines.

In general, diagnostic laparoscopy is performed in selected cases to detect extrahepatic dis-

ease, thereby preventing unnecessary laparotomies. However, in patients selected for surgery

based on extensive imaging, the prevalence of extrahepatic disease will be low and therefore the

additional value of diagnostic laparoscopy will be limited [18-20].

There were concerns about surgery in patients with extrahepatic disease. However, most of

the responders indicated that extrahepatic disease is a major contraindicative factor for surgery.

In 52 hospitals (neo)-adjuvant chemotherapy was given to improve surgical results. Due to the

structure of the written survey, no data on the frequencies of neoadjuvant or adjuvant therapy

are available. (Neo)-adjuvant chemotherapy was also administrated in trials, explaining part of the

variation. We were aware of this variation and tried to summarize the extent of use of (neo)-ad-

juvant, without describing regimens and/or indications. In addition, the effect of (neo)-adjuvant

chemotherapy has not been significantly proven [21, 22].

Portal vein embolization was performed in nine hospitals, with the volume of the remnant liver

as the most important selection criterion.

RFA was considered as a treatment option in most hospitals; however, this technique was per-

formed in a limited number of hospitals, with no uniform indications or selection criteria. A paper

by Mutseart et al. reporting on the initial experience with RFA of malignant hepatic tumors in the

Netherlands showed recurrence in 52% of the patients [23]. In addition, there are no random-

��

Chapter 6

ized trials; RFA is being evaluated in an ongoing randomized trial comparing chemotherapy plus

local ablation with chemotherapy (CLOCC) alone. The advice of the British National Institute for

Clinical Excellence (NICE) is as follows: Current evidence of the safety and efficacy of local tumor

ablation by RFA for colorectal cancer metastases does not appear adequate to support the use of

this procedure without special arrangements for consent and for audit or research [24].

Most of the responders (62%) indicated that they considered IHP a treatment option for

selected patients with extensive liver metastases. IHP has good efficacy in terms of response rate

and duration; however, due to the high toxicity rate, the use of this technique is appropriately

limited to research protocols at dedicated centers [25-29].

The value of regional chemotherapy in patients with non-resectable tumors is unclear. A higher

response percentage is obtained compared with intravenous 5-FU; however, no improvement of

survival is shown [30]. This technique is therefore performed in limited cases in the Netherlands.

An important limitation of this survey is the suboptimal response (69%), not representing the

overall situation in the Netherlands. However, all the academic hospitals and institutions using

the experimental treatment options were included in this survey, thus indicating that the hospitals

that did not respond represent hospitals with a limited number or no patients with this disease.

Two major points of concern in the management of patients with colorectal cancer which

need to be addressed are the absence of guidelines and registration systems. Registration sys-

tems are important tools in evaluating management. The collaboration between specialists and

consulting specialists of the Comprehensive Cancer Centers will make it possible to establish a

national registry.

A national evidence-based guideline is being developed to overcome the problem concerning

the absence of guidelines.

Substantial variation exists in the diagnostic and therapeutic work-up of patients with colorec-

tal liver metastases. This can be explained by recent developments, the availability of techniques,

expertise, uncertainties in the literature (e.g. diagnostic value, effect, survival) and mostly by the

absence of guidelines. Research and evidence-based guidelines taking into account the available

evidence, experience and availability can solve this problem.

��

References

1. Fong Y, Cohen AM, Fortner JG, et al. Liver resection for colorectal metastases. J Clin Oncol 1997;15:938-946.

2. Hughes KS, Rosenstein RB, Songhorabodi S, et al. Resection of the liver for colorectal carcinoma metasta-ses. A multi-institutional study of longterm survivors. Dis Colon Rectum 1988;31:1-4.

3. Nordlinger B, Guiguet M, Vaillant JC, et al. Surgical resection of colorectal carcinoma metastases to the liver. A prognostic scoring system to improve case selection, based on 1568 patients. Association Francaise de Chirurgie. Cancer 1996;77:1254-1262.

4. Scheele J, Stang R, Altendorf-Hofmann A, Paul M. Resection of colorectal liver metastases. World J Surg 1995;19:59-71.

5. Adam R, Hagopian EJ, Linhares M, et al. A comparison of percutaneous cryosurgery and percutaneous radiofrequency for unresectable hepatic malignancies. Arch Surg 2002;137:1332-1339.

6. Ruers TJ, Joosten J, Jager GJ, Wobbes T. Long-term results of treating hepatic colorectal metastases with cryosurgery. Br J Surg 2001;88:844-849.

7. Shankar A, Lees WR, Gillams AR, Lederman JA, Taylor I. Treatment of recurrent colorectal liver metastases by interstitial laser photocoagulation. Br J Surg 2000;87:298-300.

8. Solbiati L, Livraghi T, Goldberg SN, et al. Percutaneous radio-frequency ablation of hepatic metastases from colorectal cancer: long-term results in 117 patients. Radiology 2001;221:159-166.

9. Vogl T, Mack M, Straub R, et al. [Thermal ablation of liver metastases. Current status and prospects]. Radiologe 2001;41:49-55.

10. Douillard JY, Cunningham D, Roth AD, et al. Irinotecan combined with fluorouracil compared with fluoro-uracil alone as first-line treatment for metastatic colorectal cancer: a multicentre randomised trial. Lancet 2000;355:1041-1047.

11. Saltz LB, Cox JV, Blanke C, et al. Irinotecan plus fluorouracil and leucovorin for metastatic colorectal cancer. Irinotecan Study Group. N Engl J Med 2000;343:905-914.

12. Hagspiel KD, Neidl KF, Eichenberger AC, Weder W, Marincek B. Detection of liver metastases: comparison of superparamagnetic iron oxide-enhanced and unenhanced MR imaging at 1.5 T with dynamic CT, intra-operative US, and percutaneous US. Radiology 1995:196:471-478.

13. Jang HJ, Lim HK, Lee WJ, Kim SH, Kim KA, Kim EY. Ultrasonographic evaluation of focal hepatic lesions: comparison of pulse inversion harmonic, tissue harmonic and conventional imaging techniques. J Ultra-sound Med 2000;19:293-299.

14. Lai DT, Fulham M, Stephen MS, et al. The role of whole-body positron emission tomography with [18F] fluo-rodeoxyglucose in identifying operable colorectal cancer metastases to the liver. Arch Surg 1996;131:703-707.

15. Ruers TJ, Langenhoff BS, Neeleman N, et al. Value of positron emission tomography with [F-18] fluoro-deoxyglucose in patients with colorectal liver metastases: a prospective study. J Clin Oncol 2002;20:388-395.

16. Van Etten B, van der Sijp J, Kruyt R, Oudkerk M, van der Holt B, Wiggers T. Ferumoxide-enhanced magnetic resonance imaging techniques in pre-operative assessment for colorectal liver metastases. Eur J Surg Oncol 2002;28:645-651.

17. Kopka L, Grabbe E. [Biphasic liver diagnosis with multiplanar-detector spiral CT]. Radiology 1999:39:971-978.

18. D’Angelica M, Fong Y, Weber S, et al. The role of staging laparoscopy in hepatobiliary malignancy: pro-spective analysis of 401 cases. Ann Surg Oncol 2003;10;183-189.

19. Jarnagin WR, Conlon K, Bodniewicz J, et al. A clinical scoring system predicts the yield of diagnostic laparoscopy in patients with potentially resectable hepatic colorectal metastases. Cancer 2001;91:1121-1128.

Chapter 6

�6 �7

20. Grobmyer SR, Fong Y, D’Angelica M, Dematteo RP, Blumgart LH, Jarnagin WR. Diagnostic laparoscopy prior to planned hepatic resection for colorectal metastases. Arch Surg 2004;139:1326-1330.

21. Figueras J, Valls C, Rafecas A, et al. Resection rate and effect of postoperative chemotherapy on survival after surgery for colorectal liver metastases. Br J Surg 2001;88:980-985.

22. Punt CJ. New options and old dilemmas in the treatment of patients with advanced colorectal cancer. Ann Oncol 2004;15:1453-1459.

23. Mutsaerts EL, van Coevorden F, Krause R, et al. Initial experience with radiofrequency ablation for hepatic tumours in the Netherlands. Eur J Surg Oncol 2003;29:731-734.

24. IPG092 Radiofrequency ablation for the treatment of colorectal metastases in the liver. http://www.nice.org.uk.

25. Alexander HR Jr, Bartlett DL, Libutti SK, Fraker DL, Moser T, Rosenberg SA. Isolated hepatic perfusion with tumour necrosis factor and melphalan for unresectable cancers confined to the liver. J Clin Oncol 1998;16:1479-1489.

26. Bartlett DL, Libutti SK, Figg WD, Fraker DL, Alexander HR. Isolated hepatic perfusion for unresectable hepatic metastases from colorectal cancer. Surgery 2001;129:176-187.

27. Rothbarth J, Pijl ME, Vahrmeijer AL, et al. Isolated hepatic perfusion with high-dose melphalan for the treatment of colorectal metastasis confined to the liver. Br J Surg 2003;90:1391-1397.

28. Marinelli A, de Brauw LM, Beerman H, et al. Isolated liver perfusion with mitomycin C in the treatment of colorectal cancer metastases confined to the liver. Jpn J Clin Oncol1996;26:341-350.

29. Vahrmeijer AL, van Dierendonck JH, Keizer HJ, et al. Increased local cytostatic drug exposure by isolated hepatic perfusion: a phase I clinical and pharmacologic evaluation of treatment with high dose melphalan in patients with colorectal cancer confined to the liver. Br J Cancer 2000;82:1539-1546.

30. Meta-Analysis Group in Cancer. Reappraisal of hepatic arterial infusion in the treatment of nonresectable liver metastases from colorectal cancer. J Nat Cancer Inst 1996;88:252-258.

�6 �7

7C h a p t e r

Colorectal liver metastases: CT, MR imaging, and PET for diagnosis. Meta analysis

Shandra BipatMaarten S. van LeeuwenEmile F. I. ComansMilan E. J. PijlPatrick M. M. BossuytAeilko H. ZwindermanJaap Stoker

Radiology 2005;237:123-131

Chapter 7

absTRaCT

Purpose: To perform a meta-analysis to obtain sensitivity estimates of computed tomography

(CT), magnetic resonance imaging (MRI), and fluorine 18 fluorodeoxyglucose positron emis-

sion tomography (FDG-PET) for detection of colorectal liver metastases on per-patient and

per-lesion bases.

Methods: MEDLINE, EMBASE, Web of Science, and CANCERLIT databases and Cochrane Data-

base of Systematic Reviews were searched for relevant original articles published from Janu-

ary 1990 to December 2003. Criteria for inclusion of articles were as follows: Articles were

reported in the English, German, or French language; CT, MRI or FDG-PET was performed to

identify and characterize colorectal liver metastases; histopathologic analysis (surgery, biopsy,

or autopsy), intraoperative observation (manual palpatation, intraoperative ultrasonography

[US]), and/or follow-up US was the reference standard; and data were sufficient for calculation

of true-positive or false-negative values. A random-effects linear regression model was used to

obtain sensitivity estimates in assessment of liver metastases.

Results: Of 165 identified relevant articles, 61 fulfilled all inclusion criteria. Sensitivity estimates

on a per-patient basis for non-helical CT, helical CT, 1.5 T MRI, and FDG-PET were 60.2%,

64.7%, 75.8%, and 94.6%, respectively; FDG-PET was the most accurate modality.

On a per-lesion basis, sensitivity estimates for non-helical CT, helical CT, 1.0 T MRI, 1.5 T MRI

and FDG-PET were 52.3%, 63.8%, 66.1%, 64.4%, and 75.9%, respectively; non-helical CT had

lowest sensitivity. Estimates of gadolinium-enhanced MRI and superparamagnetic iron oxide

(SPIO)–enhanced MRI were significantly better, compared with non-enhanced MRI (P = 0.019

and P < 0.001, respectively) and with helical CT with 45 g of iodine or less (P = 0.02 and P <

0.001, respectively). For lesions of 1 cm or larger, SPIO-enhanced MRI was the most accurate

modality (P < 0.001).

Conclusion: FDG-PET had significantly higher sensitivity on a per-patient basis, compared with

that of the other modalities, but not on a per-lesion basis. Sensitivity estimates for MRI with

contrast agent were significantly superior to those for helical CT with 45 g of iodine or less.

CT, MRI and PeT for diagnosis colorectal liver metastases: meta-analysis

�0�

InTRoDUCTIon

Colorectal cancer is the second leading cause of cancer-related deaths in the United States. Ac-

cording to the National Program of Cancer Registries, 146 940 new patients received a diagnosis

of the disease in 2004, with an estimated 56 730 deaths due to colorectal cancer in that year.

Liver metastasis is a common consequence of colorectal carcinoma; up to 70% of patients

with colorectal cancer eventually develop liver metastases. In 30%–40% of those patients, the

metastases are still confined to the liver at the time of detection, and only a limited number of

patients with colorectal metastases confined to the liver are surgical candidates because of the

larger size of the lesions, the broad distribution of the lesions, or the difficulty in assessing the

tumors or because the volume of the remaining liver is inadequate [1-5].

Preoperative selection of patients with colorectal metastases who are most likely to benefit

from surgery is necessary and challenging. The armamentarium for imaging-based preoperative

selection comprises transabdominal ultrasonography (US), computed tomography (CT), fluorine

18 fluorodeoxyglucose positron emission tomography (FDG-PET), and magnetic resonance imag-

ing (MRI) [6-12].

During the past 10 years, improvements in these imaging modalities were either introduced or

great progress has been made in their application [6, 8, 13-16]. Although extensive research has

been performed in regard to the diagnostic performance of CT, MRI, and FDG-PET for the detec-

tion of colorectal liver metastases, the optimal imaging staging strategy has not been defined.

Kinkel et al [17] performed a meta-analysis to compare current non-invasive imaging methods

such as US, CT, MRI and FDG-PET for the detection of hepatic metastases from colorectal, gastric,

and esophageal cancers. Treatment approaches for liver metastases from various cancerous ori-

gins (pancreatic or colorectal cancer), however, are different, and, therefore, the importance of

certain findings with respect to these origins differs.

Because of its non-invasive character, low cost, and widespread availability, US is a valuable

screening tool for the imaging of liver metastases. US, however, has two relative disadvantages:

US is more operator independent than are CT and MRI and parts of the liver remain non-visible

in certain patients at US. In daily practice, though, US is highly efficient in helping to distinguish

between two groups of patients with liver metastases: the group of patients with diffuse metas-

tases who are no longer eligible for curative treatment and the group with no metastases or a

very limited number of them. The patients in the latter group require CT, MRI or FDG-PET for the

selection of appropriate therapeutic approaches.

Thus, the aim of our study was to perform a meta-analysis to obtain the estimates of sensitivity

of CT, MRI, and FDG-PET for the detection of colorectal liver metastases on a per-patient and a

per-lesion basis.

�0�

Chapter 7

Literature SearchA comprehensive computer literature search [18] of abstracts about studies in human subjects

was performed to identify articles about the diagnostic performance of CT, MRI, and FDG-PET for

the detection of liver metastases in patients with colorectal cancer compared with the diagnostic

performance of intraoperative US, surgery, follow-up US, and histopathologic analysis as the

reference standard.

The MEDLINE and EMBASE databases, from January 1990 to December 2003, were used with

the following keywords: (“Colorectal Neoplasms” [MeSH]) AND (“Liver neoplasms” [MeSH]) AND

(“Laparoscopy” [MeSH] OR “Tomography, Emission-Computed” [MeSH] OR “magnetic resonance

imaging” [MeSH] OR “Tomography, X-Ray Computed” [MeSH] AND (sensitivity and specificity

[MeSH] OR sensitivity [WORD] OR specificity [WORD] OR false negative [WORD] OR false posi-

tive [WORD] OR diagnosis [MeSH] OR diagnostic use [MeSH] OR detection [WORD] OR accuracy

[WORD]).

Other databases, such as CINAHL and SUMSEARCH, were also checked for relevant articles

with the following keywords: Colorectal Neoplasm [MeSH] AND (Liver Neoplasms [MeSH] OR

Neoplasm Metastasis [MeSH]). The databases of Web of Science and CANCERLIT and the Co-

chrane Database of Systematic Review were checked with the following words: Colorectal cancer

AND (liver metastases OR hepatic metastasis).

Review articles, letters, comments, case reports, unpublished articles, and articles that did not

include raw data were not selected. The list of articles was supplemented with extensive cross-

checking of the reference lists of all retrieved articles.

Selection of StudiesFour observers independently checked all retrieved articles for inclusion criteria. One observer

(S.B.) checked all articles. Three observers checked a subset of articles: One observer (M.S.v.L.)

checked studies that predominantly focused on evaluation of CT, another (M.E.J.P.) checked stud-

ies that predominantly focused on evaluation of MRI and another (E.F.I.C.) checked studies that

predominantly focused on evaluation of FDG-PET. Disagreements were resolved in consensus.

The inclusion criteria were as follows: (1) articles were reported in the English, German, or

French language; (2) CT, MRI or FDG-PET was used to identify and characterize colorectal liver

metastases; (3) histopathologic analysis (performed at surgery, biopsy, and autopsy), intraopera-

tive observation (e.g. manual palpation, intraoperative US) and/or follow-up US were used as the

reference standard; (4) for per-patient or per-lesion statistics, sufficient data were presented to

calculate the true-positive and false-negative values for imaging techniques; and (5) when data or

subsets of data were presented in more than one article, the article with the most details or the

most recent article was chosen.

Studies were excluded if results of different imaging modalities were presented in combination

and could not be differentiated for performance assessment of tests on an individual modality.

�0�

Data ExtractionThe same observers independently extracted relevant data about study (design) characteristics

and examination results, which will be discussed later, from each article by using a standardized

One observer (S.B.) extracted data from all articles. Three observers extracted data from a

subset of articles: One observer extracted only data from studies that predominantly focused on

evaluation of CT (M.S.v.L.), another extracted only data from studies that predominantly focused

on evaluation of MRI (M.E.J.P.), and still another extracted only data from studies that predomi-

nantly focused on evaluation of FDG-PET (E.F.I.C.). Observers were not blinded with regard to

information about the authors, the authors’ affiliation, or the journal name, since this has been

shown to be unnecessary [19]. To resolve disagreement between reviewers, a fifth reviewer (J.S.)

assessed all discrepant items, and the majority opinion was used for analysis.

Study design characteristics

The QUADAS quality assessment tool was used to extract relevant study design characteristics of

each study. This tool and the definitions of the characteristics are fully described elsewhere [20].

Other study characteristics

In addition, the following characteristics were recorded: (a) year of publication; (b) sample size

(number of patients with colorectal liver metastases); (c) description of study population, which

included disease severity (tumor stage), age, and male-female distribution; (d) description of

interpretation of diagnostic tests, which included the reporting of the characterization of lesions

as benign versus malignant or the detailed subcharacterization of lesions as cysts, hemangiomas,

or metastases and the confidence rating used for identification of lesions; and (e) description of

reference tests, which included intraoperative findings (at inspection and/or palpation), intraop-

erative US features (probe frequency), pathologic features (staining, lamination, and thickness of

slices), or follow-up characteristics (interval between examinations, frequencies [how many times

follow-up examinations were performed] and modality).

The following imaging features were extracted: (a) for CT, these features included type of scan-

ner (non-helical, single-section helical, or multisection helical), section thickness, amount of con-

trast agent, and number of phases; (b) for MRI, these features included magnetic field strength,

type of contrast agent (non-specific or liver-specific agents), sequences, type of coil (body coil or

phased-array coil), and section thickness; and (c) for FDG-PET, these features included system type

(dedicated full ring or other), amount of tracer, type of analysis (qualitative or quantitative), and

data acquisition characteristics (timing of scanning and time of scanning per table position).

Examination results

The numbers of true-positive, false-negative, and false-positive results in the detection of liver

metastases were extracted on a per-lesion basis. The numbers of true-positive, false-negative,

false-positive, and true-negative results were also extracted on a per-patient basis. All tabulated

�0�

Chapter 7

results for different readers (interobserver), for multiple observations per reader (intraobserver),

and for multiple CT and MRI systems and/or techniques were counted as separate data sets.

Data and Statistical AnalysisData were separately analyzed for non-helical CT, helical CT, MRI at 1.0 T, MRI at 1.5 T, and FDG-

PET. For each data set, we calculated sensitivity of the imaging techniques as the proportion p of

patients with liver metastases (per patient) or as the proportion of liver metastases (per lesion)

correctly recognized by the imaging modality.

In our statistical analysis, we used logit-transformed sensitivity ln {p /(1 – p)}, where ln is

the natural logarithm. Because of the transformation, these values were approximately normal

distributed, with a variance of 1/[n*p* (1 – p)], where n is the total number of patients with liver

metastases (per patient) or the number of liver metastases (per lesion).

Mean logit sensitivity and the standard error were obtained by means of a random-effects

linear regression model with a mixed-effects approach that was a procedure that was within the

software (SAS; SAS Institute, Cary, NC) [21, 22]. After antilogit transformation of the mean logit

sensitivity, sensitivity estimates with the 95% confidence intervals (CIs) were obtained.

All analyses were performed with statistical software (SPSS 11.5 for Windows, SPSS, Chicago,

Ill; SAS, version 8.02, SAS Institute).

Estimates of sensitivity

For comparison of the sensitivity estimates of the different imaging techniques, we first deter-

mined whether the logit sensitivity values depended on year of publication (1995 or earlier vs.

later than 1995), sample size (≤ 50 vs > 50 patients), and the study design characteristics (“yes”

vs “no” and “unclear” responses). In this analysis, we considered variables as explanatory if the

regression coefficient of the variables was significant (P < 0.05).

Subsequently, we developed a multivariable regression model with which we used a backward

stepwise algorithm to identify only the most important characteristics. Characteristics were re-

tained in the regression model when the P value for them was less than 0.10.

Afterward, logit sensitivity values of the imaging techniques of non-helical CT, helical CT, MRI

at 1.0 T, MRI at 1.5 T, and FDG-PET were compared with each other in this random-effects

regression model, which included all variables that significantly affected the logit sensitivity of

the imaging modalities (set to one, indicating the ideal design, vs. zero) as appropriate; in this

final model, a factor that indicated the type of diagnostic modality was included, and a P value of

less than 0.05 of the regression coefficient of this factor was considered to indicate a significant

difference.

Fit of the final regression model was inspected graphically with evaluation of the histograms

of the residuals and of the random-effects estimates.

When studies contributed two or more sensitivity values, for instance, when results of mul-

tiple readers (interobserver) or multiple observations per readers (intraobserver) were available

or when multiple CT or MRI systems or multiple MRI sequences were evaluated, each sensitivity

�0�

value was counted as a separate data set. We accounted for the likely correlation between such

sensitivity values by calculating so-called robust standard errors, which are provided with the

random-effects linear regression model with the mixed-effects approach [23].

This approach was also used for intramodality intrapatient correlation (in some studies, differ-

ent modalities were compared in the same patient population).

Subgroup analysis 1

Enough data sets were available to perform subgroup analyses for helical CT and MRI at 1.5 T.

For helical CT, subgroup analyses were used to compare section thicknesses (5 mm or smaller

vs. larger than 5 mm), the amounts of administered iodine in the contrast agent (≤ 45 g vs. >

45 g), and the number of phases (one phase [portal phase] vs. two phases [arterial and portal

phases]). For MRI at 1.5 T, non-enhanced MRI, MRI enhanced with gadolinium-based contrast

agents, and MRI enhanced with superparamagnetic iron oxide (SPIO) were compared.

The subgroup analyses were performed only for data on a per-lesion basis, as data on a per-

patient basis were limited.

Subgroup analysis 2

In addition, data sets also were analyzed for different lesion sizes (lesions of < 1 cm vs. lesions

≥ 1 cm). With subgroup analyses, lesion size was compared for helical CT, non-enhanced MRI,

gadolinium-enhanced MRI and SPIO-enhanced MRI for 1.5 T imagers.

This analysis was performed only on a per-lesion basis, as data on a per-patient basis were not

available.

ResUlTs

Literature Search and Selection of StudiesWith the computer search and after extensive cross-checking of reference lists, 315 abstracts

were retrieved.

After reading of the abstracts was performed, 165 articles were found to be eligible. One hun-

dred four of the 165 relevant articles were excluded because (a) researchers in the articles did not

report data about the use of CT, MRI or FDG-PET for identification and characterization of colorec-

tal liver metastases (n = 18); (b) researchers in the articles did not use histopathologic analysis,

intraoperative observation, including manual palpation and intraoperative US, and/or follow-up

US as the reference standard (n = 25); (c) researchers in the articles did not report data that could

be used to construct or calculate true-positive, false-positive, true-negative, and/or false-negative

results (n = 54); or (d) researchers in the articles presented results from a combination of differ-

ent imaging modalities that could not be differentiated for assessment of single tests (n = 7).

Sixty-one articles fulfilled all inclusion criteria and were selected for data extraction and data

analysis.

Chapter 7

Study Design CharacteristicsMost studies (table 1) had a suboptimal design in regard to the period between the time when the

reference standard was performed and the time when the index test was performed (67.2% for

“no” responses to question 4), the description of the execution of the reference standard (63.9%

for “no” responses to question 8b), the interpretation of the reference standard results without

knowledge of the index test results (91.2% for “no” responses to question 9b), the availability

of clinical data when test results were interpreted (70.5% for “no” responses to question 10),

reporting of uninterpretable and/or intermediate test results (98.4% for “no” responses to ques-

tion 11), and explanation of withdrawals from the study (75.4% for “no” responses to question

Table 1. Results of distribution of study design characteristics in 61 studies

Question about Study Design Characteristic Response* Yes No

1. Was the spectrum of patients representative of the patients who receive the test in practice?

2. Were selection criteria clearly described? 36 25

3. Is the reference standard likely to help correctly classify the target condi-tion?

4. Is the time period between performance of reference standard and index test short enough?

5. Did the whole sample or a random selection of the sample, receive verifi-cation using a reference standard?

6. Did patients undergo examination with the same reference standard re-gardless of the index test result?

7. Was the reference standard performed independently of the index test? 52 9

8a. Was the execution of the index test described in sufficient detail to permit replication of the test?

8b. Was the execution of the reference standard described in sufficient detail to permit replication of the test?

9a. Were the index test results interpreted without knowledge of the results of the reference standard?

9b. Were the reference standard results interpreted without knowledge of the results of the index test?

10. Were the same clinical data available when test results were interpreted as would be available in practice?

11. Were uninterpretable and/or intermediate test results reported? 1 60

12. Were withdrawals from the study explained? 15 46

13. Was the data collected after the research question defined? 36 25

* Data are the numbers of responses from the QUADAS tool. The numbers indicate how many articles were assigned as score of “yes” (for the QUADAS tool) and how many articles were assigned a score of “No”. The responses of “No” and “Unclear” were summarized together.

12). It is impossible to perform an ideal study; however, as the choice for a treatment strategy

strongly depends on the outcome of the diagnosis. The description of the execution of the refer-

ence standard remains a problem in studies of diagnostic performance of modalities.

Other Study Design CharacteristicsThe age of the patients included in the selected studies ranged from 12 to 93 years, with a

mean age of 61.0 years and a total of 3187 patients. In 57 studies, the sex distribution was

described: 1733 patients were male and 1128 patients were female. In all studies, imaging data

were presented about the identification of lesions; in only nine studies was a confidence rating

scale presented. In 31 of 61 studies, lesions were characterized (benign lesions were distinguished

from malignant lesions in 28 studies and detailed subcharacterization of lesions was included in

three studies). The reference standard was intraoperative observation (palpation) in 43 studies,

intraoperative US in 37, pathologic analysis in 54, and follow-up US in 31. The frequency of the

transducer used for intraoperative US varied from 5.0 to 7.5 MHz. In 11 of 54 studies in which

pathologic analysis was used as the reference standard, the method of analysis (e.g., staining or

lamination) and the thickness of slices were described.

Examination ResultsTable 2 presents the included data sets (per-patient basis and per-lesion basis), with correspond-

ing numbers of patients and reference numbers. A full list of all included articles with all relevant

study characteristics and complete examination results is available on request from the authors

of this article.

The range in section thickness at CT was 5–12 mm (median, 10 mm), and that for MRI, 5–10

mm (median, 10 mm). The range in the amount of iodine in the administered contrast agent dur-

ing CT (reported in 23 studies) was 30–60 g. In 15 studies, either non-specific gadolinium chelates

or liver-specific MRI contrast agents, such as SPIO and gadobenate dimeglumine, were used. In

most of the studies (15 of 21) about FDG-PET, the images were qualitatively analyzed (uptake of

Table 2. Study characteristics of included data sets for each imaging modality

Imaging modality No. of Data setsand Articles

No. of Patientsin Study

References

Non-helical CT 58, 28 1915 10, 24-50

Helical CT 53, 15 621 51-65

1.0 T MRI 34, 5 173 57, 66-69

1.5 T MRI 102, 12 391 27, 51, 53, 54, 70-77

FDG-PET 26, 21 1058 41-50, 52, 59, 64, 77-83

�0�

Chapter 7

Sensitivity Estimates on Per-Patient BasisFollowing the backward stepwise regression analysis, several variables were identified as signifi-

cant predictors of the diagnostic performance of non-helical CT and FDG-PET for assessment of

colorectal liver metastases on a per-patient basis (table 3). No predictors were identified for helical

CT and MRI at 1.5 T. No data sets were available for MRI at 1.0 T. In the final models, all significant

variables were included as covariates.

The sensitivity estimates for non-helical CT, helical CT, MRI at 1.5 T, and FDG-PET were 60.2%

(95% CI: 55.7%-64.6%), 64.7% (95% CI: 30.4%-88.5%), 75.8% (95% CI: 55.9%-88.6%), and

94.6% (95% CI: 92.5%-96.1%), respectively (figure). FDG-PET had a significantly higher sensitivity

estimate compared with that of non-helical CT (P < 0.001), helical CT (P = 0.003), and MRI at 1.5

T (P < 0 .001).

Sensitivity Estimates on Per-Lesion BasisSeveral variables were identified as significant predictors of the diagnostic performance of non-

helical CT, helical CT, MRI at 1.0 T, and MRI at 1.5 T for assessment of colorectal liver metastases

(table 3) on a per-lesion basis. No predictors were found for FDG-PET.

Overall sensitivity estimates for non-helical CT, helical CT, MRI at 1.0 T, MRI at 1.5 T, and

FDG-PET were 52.3%, 63.8%, 66.1%, 64.4%, and 75.9% (table 4). Non-helical CT had the lowest

sensitivity estimate compared with helical CT (P < 0.017), MRI at 1.0 T (P < 0.001), MRI at 1.5 T

(P < 0.001), and FDG-PET (P < 0.003).

Sensitivity Estimates for each Imaging Modality on per-patientFigure shows sensitivity estimates of 60.2%, 64.7%, 75.8%, and 94.6% with 95% confidence intervals for non-helical CT, helical CT, MRI at 1.5 T, and FDG-PET, respectively on a per-patient basis. FDG-PET was the most accurate modality. A comparison of FDG-PET with non-helical CT, helical CT, and MRI at 1.5 T yielded three p values (P < 0.001, P = 0.003, P < 0 .001).

Non-helical CT Helical CT MRI 1.5T FDG-PET

Sensitivity

Per-patient basisPer-patient basis

�0�

Subgroup Analysis 1For helical CT, subgroup analyses included a comparison of a section thickness of 5 mm (no data

about section thickness of < 5 mm were available) with a section thickness of larger than 5 mm, a

comparison of the amount of iodine in the contrast agent of 45 g or less with an amount of more

than 45 g, and a comparison of the number of phases (one phase [portal phase] vs. two phases

[arterial and portal phases]).

Sensitivity estimates for a section thickness of 5 mm and a section thickness of larger than 5

mm were comparable: 68.2% and 69.1%, respectively. For the amount of iodine of 45 g or less

Table 3. Predictors identified with backward regression analysis for each imaging modality

Modality Covariates Regressioncoefficient *

P value

Per Patient

Non-helical CT (19 data sets)

Reference standard helped to correctly clas-sify the target condition

–0.85(–1.37, –0.32)

< 0.002

Reference standard results were interpreted without knowledge of index test results

–0.55(–0.88, –0.21)

< 0.002

Helical CT (2 data sets)

No predictors found

1.5 T MRI (6 data sets)

No predictors found

FDG-PET (15 data sets)

Index test results were interpreted without knowledge of reference standard results

1.07(0.40, 1.75)

< 0.002

Per-lesion

Non-helical CT (22 data sets)

Execution of the index test was described in sufficient detail

0.87(0.38, 1.36)

< 0.001

Reference standard results were interpreted without knowledge of index test results

–0.61(–0.94, –0.28)

< 0.001

Helical CT (39 data sets)

–0.78(–1.40, –0.17)

0.0124

Spectrum of patients was representative of patients in practice

0.08(0.07, 0.09)

< 0.001

Was the reference standard performed independently of the index test

–0.77(–0.89, –0.65)

< 0.001

–1.85(–2.24, –1.46)

< 0.001

FDG-PET (9 data sets)

No predictors found

* Numbers in parentheses are 95% confidence intervals. A positive regression coefficient indicates better discrimi-natory power of the imaging modality in studies with that characteristic compared with that in studies without the corresponding characteristics and a negative regression coefficient indicates reduced diagnostic performance in studies with that characteristic.

��0

Chapter 7

and that of more than 45 g, the estimates were 61.4% and 64.0%, respectively. Although the

sensitivity estimate for the portal phase was higher (71.4%) compared with that of the portal and

arterial phases (65.7%), this difference was not significant.

For MRI at 1.5 T, non-enhanced MRI, gadolinium-enhanced MRI, and SPIO-enhanced MRI were

compared. Estimates of sensitivity for non-enhanced MRI, gadolinium-enhanced MRI, and SPIO-

enhanced MRI were 59.8%, 78.2%, and 73.2%, respectively. Sensitivity estimates for gadolinium-

enhanced MRI (P = 0.019) and SPIO-enhanced MR (P < 0.001) were significantly higher compared

with the estimate for non-enhanced MRI.

In addition, sensitivity estimates for gadolinium-enhanced MRI (P = 0.02) and SPIO-enhanced

MRI (P < 0.001) were significantly higher compared with the estimate for helical CT with 45 g or

less of iodine.

Subgroup Analysis 2Sensitivity estimates for non-helical CT, helical CT, non-enhanced MRI, gadolinium-enhanced MRI,

and SPIO-enhanced MRI for lesions smaller than 1 cm were 25.3% (95% CI: 15.9%-37.6%),

23.1% (95% CI: 7.0%-54.7%), 12.6% (95% CI: 8.0%-17.5%), 11.6% (95% CI: 9.5%-14.2%), and

29.3% (95% CI: 18.2%-43.6%), respectively. No differences were found between the imaging

modalities.

Table 4. Sensitivity Estimates for Non-helical CT, Helical CT, 1.0 T MRI, 1.5 T MRI, and FDG-PET on a Per-Lesion Basis

Modality Subgroups Sensitivity Estimates*

Non-helical CT Overall 52.3% (52.1-52.5)†

Helical CT Overall 63.8% (54.4-72.2)†

Slice thickness of 5 mm 68.2% (50.5-81.9)†

Slice thickness of > 5 mm 69.1% (59.8-77.1)†

Amount of Iodine of ≤ 45 gr 61.4% (43.5-76.6)†

Amount Iodine of > 45 gr 64.0% (55.1-72.0)†

Two phases (arterial and portal phases) 65.7% (56.8-73.7)†

One phase (portal phase only) 71.4% (57.7-82.1)†

1.0 T MRI Overall 66.1% (65.9-66.3)†

1.5 T MRI Overall 64.4% (57.8-70.5)†

Non-enhanced MRI 59.8% (49.0-69.7)†

Gadolinium-enhanced MRI 78.2% (63.0-88.3)‡

SPIO-enhanced MRI 73.2% (62.3-81.9)‡

FDG-PET Overall 75.9% (61.1-86.3)†

* Sensitivity estimates were obtained by means of a logit-transformed data analysis, and percentages were not cal-culated with raw data numbers. Numbers in parentheses are 95% confidence intervals expressed as percentages. † Significantly higher compared with non-helical CT‡ Significantly higher compared with non-enhanced MRI and amount of Iodine of 45 gr or less.

��

Sensitivity estimates for non-helical CT, helical CT, non-enhanced MRI, gadolinium-enhanced MRI,

and SPIO-enhanced MRI for lesions of 1 cm or larger were 74.3% (95% CI: 66.5%-80.9%), 73.5%

(95% CI: 62.2%- 82.4%), 65.7% (95% CI: 56.4%-73.9%), 68.8% (95% CI: 61.9%-75.0%), and

90.2% (95% CI: 87.5%-92.4%), respectively. The sensitivity estimate for SPIO-enhanced MRI was

significantly higher (P < 0.001).

DIsCUssIon

In this meta-analysis, we found that on a per-patient basis, FDG-PET was most accurate for de-

tection of colorectal liver metastases. On a per-lesion basis, helical CT, MRI at 1.0 T, MRI at

1.5 T, and FDG-PET were comparable and significantly more accurate than was non-helical CT.

Data about subgroup analyses indicated no differences between section thicknesses, amounts of

administered iodine, and numbers of phases for helical CT. Gadolinium-enhanced MRI and SPIO-

enhanced MRI, however, were significantly better compared with non-enhanced MRI and helical

CT with an amount of iodine of 45 g or less.

As treatment policies differ for liver metastases of various cancerous origins (pancreatic cancer

or colorectal cancer), only data about colorectal cancer were extracted and analyzed.

To avoid selection bias, not only the MEDLINE database but also the CINAHL, SUMSEARCH,

Web of Science, and CANCERLIT databases and the Cochrane Database of Systematic Review

were searched for relevant articles. In addition, all reference lists were checked manually.

To minimize bias in the selection of studies and in data extraction, reviewers independently

selected articles on the basis of inclusion criteria, and scores were assigned to study design char-

acteristics and examination results by using a standardized form that was based on the QUADAS

tool. The QUADAS tool is an evidence-based quality assessment tool, which was developed for

use in systematic reviews of studies of diagnostic accuracy [20].

Data were analyzed by means of a random-effects approach, which accounts for the hetero-

geneity between studies caused by different threshold settings (as in regular summary receiver

operating characteristic curves) [84, 85], for the error of estimation of the sensitivity values in

each study that represents the size of the population, and finally for the residual heterogeneity

that may remain even after adjustment for study design characteristics [86, 87]. Lijmer et al [88]

showed that studies of diagnostic performance of modalities with methodological shortcomings

may cause overestimation of the accuracy of a diagnostic test; we, therefore, evaluated the effect

of these characteristics on diagnostic performance and made adjustments when appropriate.

During the years, substantial improvements in CT (e.g., introduction of spiral CT, multisec-

tion CT) and MRI (e.g., liver-specific agents and more widespread use of higher magnetic field

strength) have been introduced [13-16]. To account for these improvements, data for techniques

were analyzed separately, and, if possible, subgroup analyses were performed. In addition, data

were extracted and analyzed on a per-patient, as well as on a per-lesion, basis. This is important,

since the treatment policy depends not only on distinguishing patients with or without liver me-

tastases but also on the number, size, location, and surgical margin of the liver metastases in the

��

Chapter 7

first group. Thereby, the prevalence of liver metastases in patients with primary colorectal cancer

is high.

A potential limitation of any meta-analysis is the possibility of publication bias. We did not per-

form any analysis for detection of and adjustment for publication bias. A recent systematic review

conducted by the Cochrane Collaboration showed that a large number of methods have been

developed, and when the methods are compared with one another, results can provide different

estimates in terms of direction and magnitude of publication bias [89]. In addition, studies about

publication bias focus mostly on randomized trials, and these studies are registered; the registra-

tion of studies about diagnostic studies is either limited or difficult to achieve. We attempted to

examine publication bias by using an evaluation of whether the size of studies was associated

with the results for diagnostic accuracy. In particular, small studies with optimistic results may be

published more easily than small studies with unfavorable results. Larger studies with optimistic

results may also be published more easily than larger studies with unfavorable results, but this

difference usually is smaller. There was no association between sample size and diagnostic per-

formance.

Characteristics of the patients, such as the stage of disease, differentiation between synchro-

nous and metachronous liver metastases, presence of extrahepatic disease, and age or sex distri-

bution, are also important for diagnostic accuracy, but because of variation in data presentation

or incomplete reporting of data, the effect of these variables could not be examined.

The reference standard used in this systematic review ranged from histopathologic analysis

to follow-up US. It was impossible to examine the effect of each reference test on diagnostic

accuracy. As stated in the Standards for Reporting of Diagnostic Accuracy initiative, a reference

standard can be either a single method or a combination of methods to establish the presence of

the target condition [90]. The major problem, however, was the absence of critical information,

such as data about the execution of the reference test, the confidence rating, or the characteriza-

tion of lesions, and these data were insufficiently described or not described in a large subset of

articles. This has also been described by authors of other meta-analyses [91, 92]. Therefore, the

Standards for Reporting of Diagnostic Accuracy initiative was developed to improve the quality of

the reporting of diagnostic studies. The items in the checklist and the flowchart can help authors

in describing essential elements of the design and conduct of the study, the execution of tests,

and the results.

Another limitation is the consideration of 2 x 2 tables for different readers, for multiple obser-

vations per reader, and for multiple CT and MRI techniques as separate data sets. This has been

performed to avoid selection bias. We are aware of the dependency in data sets from the same

patient population. Analysis of this dependency is not possible with our software, as the random-

effects linear regression model with mixed-effects approach is able to adjust for this potential

dependency only if the same numbers of data sets in each study are available. We examined this

correlation by using the empirical standard error calculated by using the “sandwich estimator,”

which is possible with the software for the regression model with mixed-effects approach [25].

We also used this approach to adjust for correlation between imaging modalities performed in

the same patient population.

��

A final limitation of this study was the absence of information on specificity values. On a per-

patient basis, specificity can be calculated. To minimize selection bias, we included all studies in

which data were presented about colorectal liver metastases (also other conditions) and not stud-

ies in which only colorectal liver metastases were presented. We summarized and analyzed only

data on colorectal liver metastases. The specificity can be underestimated in studies in which only

colorectal liver metastases are evaluated and overestimated in studies in which other conditions

are also evaluated.

Kinkel et al [17] also performed a meta-analysis to compare US, CT, MRI, and FDG-PET in the

detection of liver metastases. In studies with specificity higher than 85%, the sensitivity for US

was 55%, and that for CT was 72%, that for MRI was 76%, and that for FDG-PET was 90%. They,

however, analyzed hepatic metastases that originated from colorectal, gastric, and esophageal

cancers. Treatment policies and reference standards differ for liver metastases of different origins,

and, therefore, the importance of certain findings differs. In addition, the literature search was

performed only in the MEDLINE database, and some study design characteristics were used to

include studies, thereby introducing selection bias. Moreover, they combined results on a per-

patient basis and on a per-lesion basis, thereby causing overestimation of the diagnostic accuracy

of FDG-PET. In our per-patient analysis, FDG-PET had a significantly higher sensitivity estimate

(94.6%) compared with that of helical CT, non-helical CT, and MRI, although the sensitivity esti-

mate was comparable on a per-lesion basis. Analysis of combined data would, therefore, lead to

overestimation of the diagnostic accuracy of FDG-PET.

Although on a per-patient basis, FDG-PET was found to be most accurate, the treatment

policy depends not only on distinguishing patients with liver metastases from patients without

liver metastases but also mainly on the number, size, location, and surgical margin of the liver

metastases. In addition, FDG-PET mostly was performed in selected patients [10, 49] or with a

long time between CT and FDG-PET (> 4 weeks) [42, 46, 49], thereby increasing the detection of

liver metastases by using FDG-PET compared with the detection by using CT. In general, the time

between diagnostic tests should be short to avoid differences in disease status. In several studies,

scans were not corrected for attenuation with CT [46, 64, 78].

Because of its non-invasive character, low cost, and widespread availability, US can be used

to help distinguish patients with diffuse disease who are not eligible for curative treatment from

the group of patients with no liver metastases or the group with a limited number of them. The

patients in the latter group should undergo CT, MRI, or FDG-PET. On a per-lesion basis, helical

CT, MRI at 1.0 T, MRI at 1.5 T, and FDG-PET were comparable and significantly more accurate

compared with non-helical CT. In the subgroup analyses, however, SPIO-enhanced MRI and gado-

linium-enhanced MRI were significantly more accurate compared with non-enhanced MRI and

helical CT performed with a contrast agent that has 45 g or less of iodine.

The choice between portal phase helical CT performed with more than 45 g of iodine and MRI

with a gadolinium-based contrast agent or SPIO should, therefore, also depend on availability and

expertise and not on diagnostic accuracy only. The role of FDG-PET at this moment is limited and,

therefore, it will be used mainly as an additional imaging modality for detection of extrahepatic

disease.

��

Chapter 7

References

1. Jatzko GR, Lisborg PH, Stettner HM, Klimpfinger MH. Hepatic resection for metastases from colorectal carcinoma: a survival analysis. Eur J Cancer 1995;31A:41–46.

2. Castells A, Bessa X, Daniels M, et al. Value of postoperative surveillance after radical surgery for colorectal cancer: results of a cohort study. Dis Colon Rectum 1998;41:714–723.

3. Komborozos VA, Skrekas GJ, Pissiotis CA. The contribution of follow-up programs in the reduction of mortality of rectal cancer recurrences. Dig Surg 2001;18:403–408.

4. Safi F, Link KH, Beger HG. Is follow-up of colorectal cancer patients worthwhile? Dis Colon Rectum 1993;36:636–643.

5. Ketteniss M, Schutz G, Ulrich B. Costs and efficiency of a tumor follow-up program for the detection of colorectal liver metastases. Int J Colorectal Dis 2001;16:28–31.

6. Albrecht T, Blomley MJ, Burns PN, et al. Improved detection of hepatic metastases with pulse-inversion US during the liver-specific phase of SHU 508A: multicenter study. Radiology 2003;227:361–370.

7. Balfe DM. Hepatic metastases from colorectal cancer: radiologic strategies for improved selection. Radiol-ogy 1992;185:18–19.

8. Bellin MF, Zaim S, Auberton E, et al. Liver metastases: safety and efficacy of detection with superparamag-netic iron oxide in MRI. Radiology 1994;193:657–663.

9. Bernatik T, Strobel D, Hahn EG, Becker D. Detection of liver metastases: comparison of contrast-enhanced wide-band harmonic imaging with conventional ultrasonography. J Ultrasound Med 2001;20:509–515.

10. Boykin KN, Zibari GB, Lilien DL, McMillan RW, Aultman DF, McDonald JC. The use of FDG-positron emission tomography for the evaluation of colorectal metastases of the liver. Am Surg 1999;65:1183–1185.

11. Caudana R, Morana G, Pirovano GP, et al. Focal malignant hepatic lesions: MR imaging enhanced with gadolinium benzyloxypropionictetra-acetate (BOPTA)—preliminary results of phase II clinical application. Radiology 1996;199:513–520.

12. Ch’en IY, Katz DS, Jeffrey RB Jr, et al. Do arterial phase helical CT images improve detection or character-ization of colorectal liver metastases? J Comput Assist Tomogr 1997;21:391–397.

13. Kamel IR, Choti MA, Horton KM, et al. Surgically staged focal liver lesions: accuracy and reproducibility of dual-phase helical CT for detection and characterization. Radiology 2003;227:752–757.

14. Kim MJ, Kim JH, Chung JJ, Park MS, Lim JS, Oh YT. Focal hepatic lesions: detection and characteriza-tion with combination gadolinium- and superparamagnetic iron oxide–enhanced MR imaging. Radiology 2003;228:719–726.

15. Dill-Macky MJ, Burns PN, Khalili K, Wilson SR. Focal hepatic masses: enhancement patterns with SH U 508A and pulse-inversion US. Radiology 2002;222:95–102

16. Even-Sapir E, Lerman H, Figer A, et al. Role of (18)F-FDG dual-head gamma-camera coincidence imaging in recurrent or metastatic colorectal carcinoma. J Nucl Med 2002;43:603–609.

17. Kinkel K, Lu Y, Both M, Warren RS, Thoeni RF. Detection of hepatic metastases from cancers of the gastrointestinal tract by using noninvasive imaging methods (US, CT, MR imaging, PET): a meta-analysis. Radiology 2002;224:748–756.

18. Deville WL, Bezemer PD, Bouter LM. Publications on diagnostic test evaluation in family medicine journals: an optimal search strategy. J Clin Epidemiol 2000;53:65–69.

19. Berlin JA. Does blinding of readers affect the results of meta-analyses? University of Pennsylvania Meta-analysis Blinding Study Group. Lancet 1997;350:185-186.

20. Whiting P, Rutjes AW, Reitsma JB, Bossuyt PM, Kleijnen J. The development of QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Med Res Meth-odol 2003;3:25.

21. Van Houwelingen HC, Zwinderman KH, Stijnen T. A bivariate approach to meta-analysis. Stat Med 1993;12:2273–2284.

��

22. Van Houwelingen HC, Arends LR, Stijnen T. Advanced methods in meta-analysis: multivariate approach and meta-regression. Stat Med 2002;21:589–624.

23. Liang KY, Zeger SL. Regression analysis for correlated data. Annu Rev Public Health 1993;14:43–68.

24. Acunas B, Rozanes I, Acunas G, et al. Preoperative CT staging of colon carcinoma (excluding the recto-sigmoid region). Eur J Radiol 1990;11:150–153.

25. Lundstedt C, Ekberg H, Hederstrom E, Tranberg KG. The accuracy of computed tomography of the liver in colo-rectal carcinoma. Clin Radiol 1990;42:335–339.

26. Machi J, Isomoto H, Kurohiji T, et al. Accuracy of intraoperative ultrasonography in diagnosing liver metas-tasis from colorectal cancer: evaluation with postoperative follow-up results. World J Surg 1991;15:551–556.

27. Wernecke K, Rummeny E, Bongartz G, et al. Detection of hepatic masses in patients with carcinoma: comparative sensitivities of sonography, CT, and MR imaging. AJR Am J Roentgenol 1991;157:731–739.

28. Yamaguchi A, Ishida T, Nishimura G, et al. Detection by CT during arterial portography of colorectal cancer metastases to liver. Dis Colon Rectum 1991;34:37–40.

29. Oudkerk M, van Ooijen B, Mali SP, et al. Liver metastases from colorectal carcinoma: detection with continuous CT angiography. Radiology 1992;185:157–161.

30. Soyer P, Levesque M, Elias D, Zeitoun G, Roche A. Preoperative assessment of resectability of hepatic metastases from colonic carcinoma: CT portography vs sonography and dynamic CT. AJR Am J Roentgenol 1992;159:741–744.

31. Soyer P, Levesque M, Elias D, Zeitoun G, Roche A. Detection of liver metastases from colorectal cancer: comparison of intraoperative US and CT during arterial portography. Radiology 1992;183:541–544.

32. Kerner BA, Oliver GC, Eisenstat TE, Rubin RJ, Salvati EP. Is preoperative computerized tomography useful in assessing patients with colorectal carcinoma? Dis Colon Rectum 1993;36:1050–1053.

33. Knol JA, Marn CS, Francis IR, et al. Comparisons of dynamic infusion and delayed computed tomography, intraoperative ultrasound, and palpation in the diagnosis of liver metastases. Am J Surg 1993;165:81–87.

34. Ohlsson B, Tranberg KG, Lundstedt C, Ekberg H, Hederstrom E. Detection of hepatic metastases in colorec-tal cancer: a prospective study of laboratory and imaging methods. Eur J Surg 1993;159:275–281.

35. Soyer P, Elias D, Zeitoun G, Roche A, Levesque M. Surgical treatment of hepatic metastases: impact of intraoperative sonography. AJR Am J Roentgenol 1993;160:511–514.

36. Stewart PJ, Chu JM, Kos SC, Chapuis PH, Bokey EL. Intra-operative ultrasound for the detection of hepatic metastases from colorectal cancer. Aust N Z J Surg 1993;63:530–534.

37. Paul MA, Mulder LS, Cuesta MA, et al. Impact of intraoperative ultrasonography on treatment strategy for colorectal cancer. Br J Surg 1994;81:1660–1663.

38. Leen E, Angerson WJ, Wotherspoon H, et al. Detection of colorectal liver metastases: comparison of lapa-rotomy, CT, US, and Doppler perfusion index and evaluation of postoperative follow-up results. Radiology 1995;195:113–116.

39. Carter R, Hemingway D, Cooke TG, et al. A prospective study of six methods for detection of hepatic colorectal metastases. Ann R Coll Surg Engl 1996;78:27–30.

40. van Ooijen B, Oudkerk M, Schmitz PI, Wiggers T. Detection of liver metastases from colorectal carcinoma: is there a place for routine computed tomography arteriography? Surgery 1996;119:511–516.

41. Lai DT, Fulham M, Stephen MS, et al. The role of whole-body positron emission tomography with [18F]fluorodeoxyglucose in identifying operable colorectal cancer metastases to the liver. Arch Surg 1996;131:703–707.

42. Delbeke D, Vitola JV, Sandler MP, et al. Staging recurrent metastatic colorectal carcinoma with PET. J Nucl Med 1997;38:1196–1201.

43. Ogunbiyi OA, Flanagan FL, Dehdashti F, et al. Detection of recurrent and metastatic colorectal cancer: comparison of positron emission tomography and computed tomography. Ann Surg Oncol 1997;4:613–620.

��6

Chapter 7

44. Abdel-Nabi H, Doerr RJ, Lamonica DM, et al. Staging of primary colorectal carcinomas with fluorine-18 fluorodeoxyglucose whole-body PET: correlation with histopathologic and CT findings. Radiology 1998;206:755–760.

45. Yasuda S, Makuuchi Y, Sadahiro S, et al. Colorectal cancer recurrence in the liver: detection by PET. Tokai J Exp Clin Med 1998;23:167–171.

46. Valk PE, Abella-Columna E, Haseman MK, et al. Whole-body PET imaging with [18F]fluorodeoxyglucose in management of recurrent colorectal cancer. Arch Surg 1999;134:503–511.

47. Zhuang H, Sinha P, Pourdehnad M, et al. The role of positron emission tomography with fluorine-18-de-oxyglucose in identifying colorectal cancer metastases to liver. Nucl Med Commun 2000;21:793–798.

48. Whiteford MH, Whiteford HM, Yee LF, et al. Usefulness of FDG-PET scan in the assessment of suspected metastatic or recurrent adenocarcinoma of the colon and rectum. Dis Colon Rectum 2000;43:759–767.

49. Hung GU, Shiau YC, Tsai SC, et al. Value of 18F-fluoro-2-deoxyglucose positron emission tomography in the evaluation of recurrent colorectal cancer. Anticancer Res 2001;21:1375–1378.

50. Johnson K, Bakhsh A, Young D, Martin TE Jr, Arnold M. Correlating computed tomography and positron emission tomography scan with operative findings in metastatic colorectal cancer. Dis Colon Rectum 2001;44:354–357.

51. Hagspiel KD, Neidl KF, Eichenberger AC, Weder W, Marincek B. Detection of liver metastases: comparison of superparamagnetic iron oxide–enhanced and unenhanced MR imaging at 1.5 T with dynamic CT, intraoperative US, and percutaneous US. Radiology 1995;196:471–478.

52. Vitola JV, Delbeke D, Sandler MP, et al. Positron emission tomography to stage suspected metastatic colorectal carcinoma to the liver. Am J Surg 1996;171:21–26.

53. Strotzer M, Gmeinwieser J, Schmidt J, et al. Diagnosis of liver metastases from colorectal adenocarcinoma: comparison of spiral-CTAP combined with intravenous contrast-enhanced spiral-CT and SPIO-enhanced MR combined with plain MR imaging. Acta Radiol 1997;38:986–992.

54. Lencioni R, Donati F, Cioni D, et al. Detection of colorectal liver metastases: prospective comparison of unenhanced and ferumoxides-enhanced magnetic resonance imaging at 1.5 T, dual-phase spiral CT, and spiral CT during arterial portography. MAGMA 1998;7:76–87.

55. Valls C, Lopez E, Guma A, et al. Helical CT versus CT arterial portography in the detection of hepatic metastasis of colorectal carcinoma. AJR Am J Roentgenol 1998;170:1341–1347.

56. Rahusen FD, Cuesta MA, Borgstein PJ, et al. Selection of patients for resection of colorectal metastases to the liver using diagnostic laparoscopy and laparoscopic ultrasonography. Ann Surg 1999;230:31–37.

57. Ward J, Naik KS, Guthrie JA, Wilson D, Robinson PJ. Hepatic lesion detection: comparison of MR imaging after the administration of superparamagnetic iron oxide with dual-phase CT by using alternative–free response receiver operating characteristic analysis. Radiology 1999;210:459–466.

58. Schmidt J, Strotzer M, Fraunhofer S, Boedeker H, Zirngibl H. Intraoperative ultrasonography versus helical computed tomography and computed tomography with arterioportography in diagnosing colorectal liver metastases: lesion-by-lesion analysis. World J Surg 2000;24:43–47.

59. Arulampalam T, Costa D, Visvikis D, et al. The impact of FDG-PET on the management algorithm for recur-rent colorectal cancer. Eur J Nucl Med 2001;28:1758–1765.

60. Baulieu F, Bourlier P, Scotto B, et al. The value of immunoscintigraphy in the detection of recurrent colorec-tal cancer. Nucl Med Commun 2001;22:1295–1304.

61. Scott DJ, Guthrie JA, Arnold P, et al. Dual phase helical CT versus portal venous phase CT for the detection of colorectal liver metastases: correlation with intra-operative sonography, surgical and pathological find-ings. Clin Radiol 2001;56:235–242.

62. Valls C, Andia E, Sanchez A, et al. Hepatic metastases from colorectal cancer: preoperative detection and assessment of resectability with helical CT. Radiology 2001;218:55–60.

63. Haider MA, Amitai MM, Rappaport DC, et al. Multi–detector row helical CT in preoperative assessment of

small (≤ 1.5 cm) liver metastases: is thinner collimation better? Radiology 2002;225:137–142.

��7

64. Ruers TJ, Langenhoff BS, Neeleman N, et al. Value of positron emission tomography with [F-18]fluorodeox-yglucose in patients with colorectal liver metastases: a prospective study. J Clin Oncol 2002;20:388–395.

65. Pijl ME, Wasser MN, Joekes EC, van De Velde CJ, Bloem JL. Metastases of colorectal carcinoma: compari-son of soft and hard copy helical CT interpretation. Radiology 2003;227:747–751.

66. Soyer P, Laissy JP, Sibert A, et al. Hepatic metastases: detection with multisection FLASH MR imaging during gadolinium chelate–enhanced arterial portography. Radiology 1993;189:401–405.

67. Soyer P, Laissy JP, Sibert A, et al. Focal hepatic masses: comparison of detection during arterial portogra-phy with MR imaging and CT. Radiology 1994;190:737–740.

68. Ward J, Chen F, Guthrie JA, et al. Hepatic lesion detection after superparamagnetic iron oxide enhance-ment: comparison of five T2-weighted sequences at 1.0 T by using alternative–free response receiver operating characteristic analysis. Radiology 2000;214:159–166.

69. del Frate C, Bazzocchi M, Mortele KJ, et al. Detection of liver metastases: comparison of gadobenate dimeglumine–enhanced and ferumoxides-enhanced MR imaging examinations. Radiology 2002;225:766–772.

70. Soyer P, Levesque M, Caudron C, et al. MRI of liver metastases from colorectal cancer vs. CT during arterial portography. J Comput Assist Tomogr 1993;17:67–74.

71. Peterson MS, Baron RL, Murakami T. Hepatic malignancies: usefulness of acquisition of multiple arterial and portal venous phase images at dynamic gadolinium-enhanced MR imaging. Radiology 1996;201:337–345.

72. Oudkerk M, van den Heuvel AG, Wielopolski PA, et al. Hepatic lesions: detection with ferumoxide-en-hanced T1-weighted MR imaging. Radiology 1997;203:449–456.

73. Pijl MEJ, Wasser MNJM, van Persijn van Meerten EL, et al. Comparison of inversion-recovery gradient- and spin-echo and fast spin-echo techniques in the detection and characterization of liver lesions. Radiology 1998;209:427–434.

74. Semelka RC, Cance WG, Marcos HB, Mauro MA. Liver metastases: comparison of current MR tech-niques and spiral CT during arterial portography for detection in 20 surgically staged cases. Radiology 1999;213:86–91.

75. Matsuo M, Kanematsu M, Itoh K, et al. Detection of malignant hepatic tumors: comparison of gadolinium- and ferumoxide-enhanced MR imaging. AJR Am J Roentgenol 2001;177:637–643.

76. Ward J, Guthrie JA, Wilson D, et al. Colorectal hepatic metastases: detection with SPIO-enhanced breath-hold MR imaging—comparison of optimized sequences. Radiology 2003;228(3):709–718.

77. Yang M, Martin DR, Karabulut N, Frick MP. Comparison of MR and PET imaging for the evaluation of liver metastases. J Magn Reson Imaging 2003;17:343–349.

78. Schiepers C, Penninckx F, De Vadder N, et al. Contribution of PET in the diagnosis of recurrent colorectal cancer: comparison with conventional imaging. Eur J Surg Oncol 1995;21:517–522.

79. Flanagan FL, Dehdashti F, Ogunbiyi OA, Kodner IJ, Siegel BA. Utility of FDG-PET for investigating unex-plained plasma CEA elevation in patients with colorectal cancer. Ann Surg 1998;227:319–323.

80. Staib L, Schirrmeister H, Reske SN, Beger HG. Is (18)F-fluorodeoxyglucose positron emission tomography in recurrent colorectal cancer a contribution to surgical decision making? Am J Surg 2000;180:1–5.

81. Willkomm P, Bender H, Bangard M, et al. FDG PET and immunoscintigraphy with 99mTc-labeled antibody fragments for detection of the recurrence of colorectal carcinoma. J Nucl Med 2000;41:1657–1663.

82. Rohren EM, Paulson EK, Hagge R, et al. The role of F-18 FDG positron emission tomography in preopera-tive assessment of the liver in patients being considered for curative resection of hepatic metastases from colorectal cancer. Clin Nucl Med 2002;27:550–555.

83. Rydzewski B, Dehdashti F, Gordon BA, et al. Usefulness of intraoperative sonography for revealing hepatic metastases from colorectal cancer in patients selected for surgery after undergoing FDG PET. AJR Am J Roentgenol 2002;178:353–358.

84. Littenberg B, Moses LE. Estimating diagnostic accuracy from multiple conflicting reports: a new meta-analytic method. Med Decis Making 1993;13:313–321.

Chapter 7

��

85. Moses LE, Shapiro D, Littenberg B. Combining independent studies of a diagnostic test into a summary ROC curve: data-analytic approaches and some additional considerations. Stat Med 1993;12:1293–1316.

86. Rutter CM, Gatsonis CA. Regression methods for meta-analysis of diagnostic test data. Acad Radiol 1995;2(suppl 1):S48–S56.

87. Rutter CM, Gatsonis CA. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med 2001;20:2865–2884.

88. Lijmer JG, Mol BW, Heisterkamp S, et al. Empirical evidence of design-related bias in studies of diagnostic tests. JAMA 1999;282:1061–1066.

89. Pham B, Platt R, McAuley L, Klassen TP, Moher D. Is there a “best” way to detect and minimize publication bias? an empirical evaluation. Eval Health Prof 2001;24:109–125.

90. Bossuyt PM, Reitsma JB, Bruns DE, et al. Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. Radiology 2003;226:24–28.

91. de Vries SO, Hunink MG, Polak JF. Summary receiver operating characteristic curves as a technique for meta-analysis of the diagnostic performance of duplex ultrasonography in peripheral arterial disease. Acad Radiol 1996;3:361–369.

92. Nelemans PJ, Leiner T, de Vet HC, van Engelshoven JM. Peripheral arterial disease: meta-analysis of the diagnostic performance of MR angiography. Radiology 2000;217:105–114.

��

8C h a p t e r

Evidence-based guideline on management of colorectal liver metastases in the Netherlands

Shandra BipatMaarten S. van LeeuwenJan N.M. IJzermansEmile F.I. ComansAndre S.Th. PlantingPatrick M.M. BossuytJan-Willem GreveJaap Stoker

The Netherlands Journal of Medicine 2007;65:5-14

��

Chapter 8

absTRaCT

A Dutch national evidence-based guideline on the diagnosis and treatment of patients with

colorectal liver metastases has been developed. The most important recommendations are as

follows.

For synchronous liver metastases, spiral computed tomography (CT) or magnetic resonance

imaging (MRI) should be used as imaging. For evaluation of lung metastases, imaging can be

limited to chest radiography.

For detection of metachronous liver metastases, ultrasonography could be performed as

initial modality, if the entire liver is adequately visualized. In doubtful cases or potential candi-

dates for surgery, CT or MRI should be performed as additional imaging. For evaluation of ex-

trahepatic disease, abdominal and chest CT could be performed. Fluorodeoxyglucose Positron

Emission Tomography (FDG-PET) could be valuable in patients selected for surgery based on

CT (liver/abdomen/chest), for identifying additional extrahepatic disease.

Surgical resection is the treatment of choice with a 5-years survival of 30 to 40%. Variation

in selection criteria for surgery is caused by inconclusive data in the literature concerning surgi-

cal margins <10 mm, presence of extrahepatic disease and the role of (neo)-adjuvant therapy.

To minimize variation in selection criteria, selection should be performed according to this

guideline and preferable in qualified centers.

If resection is not possible, due to extensive disease, palliative chemotherapy is recommend-

ed. Systemic chemotherapy with fluoropyrimidine first-line chemotherapy (5-FU/Leucovorin)

combined with irinotecan or oxaliplatin should be considered as standard regimens.

Radiofrequency ablation, isolated hepatic perfusion, portal vein embolization, and intra-

arterial chemotherapy are considered experimental and should only be performed as part of

a clinical research protocol.

evidence-based guideline ‘Management of colorectal liver metastases’

��

InTRoDUCTIon

Colorectal cancer is the second leading cause of cancer-related deaths in the Netherlands with an

incidence of 9900 and 4400 deaths in 2003 according to the Association of Comprehensive Can-

cer Centers [1]. Approximately 50 to 60% of patients with colorectal cancer eventually develop

liver metastases.

As there are variations in the therapeutic strategies for these patients, the optimal therapy

should be determined on an individual basis. A Dutch survey on the diagnostic and therapeutic

work-up of patients with colorectal liver metastases performed in 2004 showed substantial varia-

tion between different centers in both diagnostic work up and treatment. The most important

points of concern according to the responders of this survey were the absence of a national

guideline for diagnosis and treatment of patients with colorectal liver metastases and the absence

of a registration system [2].

MeTHoDs

To develop a national evidence-based guideline, a working group was established representing

the disciplines involved in this field, including surgeons, medical oncologists, radiologists, gastro-

enterologists and nuclear medicine specialists. All specialists were mandated by their respective

health professional organizations. A list of the members of the working group is presented in

appendix 1.

We performed literature searches in the Cochrane, MEDLINE, CANCERLIT, EMBASE, CINAHL

and Web of Science databases from 1992 to 2005 for different questions. The search strategies

are described in table1. Literature searches were performed for:

1) The diagnostic accuracy of computed tomography (CT), magnetic resonance imaging (MRI),

and 18-fluorodeoxyglucose positron emission tomography (FDG-PET) in the detection of

liver metastases and for detection of extrahepatic lesions; no search was performed for the

diagnostic accuracy of ultrasonography (US), as this modality has a low accuracy;

2) The diagnostic accuracy of diagnostic laparoscopy in the detection of liver metastases and

for detection of extrahepatic lesions;

3) The selection criteria on which surgery is based;

4) The effectiveness of neoadjuvant or adjuvant chemotherapy;

5) The role and effectiveness of the experimental therapeutic options such as portal vein

embolization, ablation techniques and isolated hepatic perfusion;

6) The effectiveness of different chemotherapeutic regimens used;

7) The role of follow-up after treatment of colorectal liver metastases.

All evidence was collected, discussed and categorized by the working group according to general

systems used in evidence-based medicine (table 2). Based on the relevant evidence and taking

into account factors such as experience and availability, recommendations were formulated for

��

Chapter 8

daily practice. These recommendations with corresponding evidence were sent to all the disci-

plines involved for comments, remarks and approval; all disciplines responded with minor com-

ments, remarks and suggestions and approved the final draft of the guideline. All comments and

remarks were incorporated in the final version of the guideline.

In this paper, on behalf of the working group, we report the recommendations with the cor-

responding evidence (including the level of evidence) for the diagnosis, treatment and follow-up

of patients with colorectal liver metastases in the Netherlands.

Table 1. Search strategies

DIAGNOSISMEDLINE

(Colorectal Neoplasms [MESH]) AND (Liver neoplasms [MESH]) AND ((Laparoscopy [MESH]) OR (Tomography, Emission-Computed [MESH]) OR (magnetic resonance imaging [MESH]) OR (Tomography, X-Ray Computed [MESH]) OR (ULTRASONOGRAPHY [MESH])) AND ((sensitivity and specificity [MESH]) OR (specificity [WORD]) OR (false negative [WORD]) OR (diagnosis [SH]) OR (diagnostic use [SH]) OR (detection [WORD]) OR (accuracy [WORD]))

EMBASE

(Colorectal Cancer [MESH]) AND (Liver metastasis [MESH])

CINAHL/SUMSEARCH

(Colorectal Neoplasm [MESH]) AND ((Liver Neoplasms [MESH]) OR (Neoplasm Metastasis [MESH]))

Web of Science/CANCERLIT/ COCHRANE

(Colorectal cancer) AND ((liver metastases) OR (hepatic metastasis))

TREATMENTMEDLINE

(Colorectal Neoplasms [MESH]) AND (Liver Neoplasms [MESH]) AND ((surgery [MESH]) OR (Hepatectomy [MESH]) OR (PERIOPERATIVE CARE [MESH]) OR (Catheter Ablation [MESH]) OR (Cryosurgery [MESH]) OR (Hyperthermia, Induced [MESH]) OR (Palliative Care [MESH]) OR (Drug therapy [MESH]) OR (Antineoplastic Agents [MESH]) OR (Infusions, Intra-Arterial [MESH]) OR (Perfusion, Regional [MESH]) OR (Radiotherapy [MESH])) AND ((Treatment outcome [MESH]) OR (Survival analysis [MESH] OR (Survival [MESH]) OR (Mortality [MESH]) OR (Morbidity [MESH]))

EMBASE

(Colorectal Cancer [MESH]) AND (Liver metastasis [MESH])

CINAHL/SUMSEARCH

(Colorectal Neoplasm [MESH]) AND ((Liver Neoplasms [MESH]) OR (Neoplasm Metastasis [MESH]))

Web of Science/CANCERLIT/ COCHRANE

(Colorectal cancer) AND ((liver metastases) OR (hepatic metastasis))

��

DIaGnosIs

Beside medical history, physical examination and laboratory test (e.g. CEA measurements), imag-

ing modalities such as transabdominal ultrasonography (US), CT, MRI, and FDG-PET imaging play

a major role in the selection of patients with liver metastases [3-11]. During the past ten years,

improvements in the imaging modalities and changes in applications have been made [6, 7,

10]. Extensive research has been carried out on the diagnostic performance of US, CT, MRI, and

FDG-PET for the detection of liver metastases. Another diagnostic technique playing a role in the

evaluation of liver metastases is diagnostic laparoscopy. However, the optimal imaging staging

strategy has not yet been defined.

Imaging plays a major role at: (1) the time of the diagnosis and treatment of the primary

tumor (for detection of synchronous liver and lung metastases); (2) during the follow-up after

the treatment of the primary tumor (for detection of metachronous liver metastases); and (3) for

Table 2. Levels of Evidence Based on the Categories of Literature*

Level of evidence

1 systematic review (A1) or at least two independent performed studies of category A2

2 systematic review (B1) or at least two independent performed studies of category B2

3 1 study of category A2, B2 or C

4 Expert opinion (category D)

Categories of literature

A1 Systematic reviews of category A2 studies with consistent findings

A2 D: accuracy study (index test compared with reference test) with high quality (prospective performed with blinded interpretation of index test and reference test and large number of consecutive patients undergoing complete verification) T: Randomized controlled trails of high quality (randomized, blinded, complete follow-up, simi-lar baseline characteristics, intension to treat analysis)

B1 Systematic reviews of category B2 studies with consistent findings

B2 D: accuracy study (index test compared with reference test) with poor quality (missing the above mentioned characteristics).T: Randomized controlled trail of low quality or other comparative studies such as non-random-ized, cohort and case-control studies.

C D: Non-comparative study (index test not compared with reference test)T: Non-randomized, cohort and case-control studies with poor quality or descriptive studies (non-comparative studies)

D Opinion form expert committee or clinical experience

D: diagnosis; T: treatment* Sackett DL, Strauss SE, Richardson WS, Rosenberg W, Haynes RB. Evidence-based medicine: How to practice and teach EBM. 2nd ed. Edinburgh: Churchill Livingstone 2000.

��6

Chapter 8

determining the resectability (detection of liver metastases and extrahepatic disease). The recom-

mendations are described in the following paragraphs.

At the time of initial diagnosis and treatment 1) To study baseline characteristics, a spiral CT or MRI of the liver should be performed in stead of

US, due to the low accuracy of US. Baseline CT or MRI are important not only for the detection or

characterization of liver lesions, but also for determining whether patients need adjuvant therapy.

In case of doubt about the presence and characterization of lesions, the CT or MRI examination

should be repeated after three months. Level of evidence: 4

2) For the evaluation of the lungs, imaging can be limited to plain chest radiography, due to the

low prevalence of lung metastases. CT provides a high sensitivity, but it should be noted that

chest CT also gives more false positives. In addition, in patients with negative chest radiography,

the additional value of CT is limited [12, 13]. Level of evidence: 3

During the follow-up and to determine resectability1) For the detection of metachronous liver metastases, we recommend using CEA as marker, if an

elevated CEA level was measured at the time of detection of the primary colorectal tumor. For the

evaluation of the liver, imaging may be limited to US, if the entire liver is assessable. For follow-up

no additional value of spiral CT or MRI to US has been demonstrated [14]. Level of evidence: 2

Because of its non-invasive character, low cost, and widespread availability, US is a valuable

screening tool for the imaging of liver metastases. Moreover US is highly efficient in helping to

distinguish between two groups of patients with liver metastases: patients with diffuse metasta-

ses who are no longer eligible for curative treatment and patients with no or a limited number

of metastases. In daily practice, therefore US is often used as initial imaging modality for the

detection of metachronous liver metastases [2].

2) If the liver cannot be evaluated properly by US, or the CEA elevation cannot be explained or

the irresectability cannot be determined based on US, an additional spiral CT or MRI should be

performed. MRI with Gadolinium (Gd) or superparamagnetic iron oxide (SPIO) contrast medium

and spiral CT with > 45 gram Iodine have a comparable sensitivity for the detection of liver me-

tastases [15]. Level of evidence: 1

The choice between spiral CT with > 45 gr Iodine or MRI with contrast agent (Gd or SPIO),

should therefore be mainly based on local infrastructure (costs, availability and expertise).

3) The role of FDG-PET for the detection of liver metastases and determining the resectability is

limited and should therefore not be performed routinely. In case of doubt concerning lesion char-

acterization on CT and MRI examination, an additional FDG-PET could be helpful, as in patients

with a long interval between CT and FDG-PET or patients selected for additional FDG-PET; this

modality seems to be sensitive for the detection of liver metastases [2] and is therefore also used

as additional modality in daily practice. Level of evidence: 1

4) The prevalence of extrahepatic disease (lung metastases and lymph nodes metastases) in

patient selected for surgery based on extensive imaging is low. From a practical point of view,

during the CT of the liver, additional CT of the abdomen could be performed for evaluation of

��7

the abdomen. There are no studies evaluating the additional role of abdominal CT for detection

of extrahepatic disease.

For the evaluation of the lungs, a chest CT could also be performed, however chest CT provides

a high number of false positives and the additional value in patients with negative chest radiog-

raphy seems to be low [12, 13]. Level of evidence: 3

Taken into account the low prevalence of lung metastases and the limited additional value of

chest CT for evaluation of the lungs, imaging can be limited to plain chest radiography.

5) In patients selected for surgery after chest, liver and abdominal CT, an additional FDG-PET can

be considered. FDG-PET seems to be sensitive for the detection of extrahepatic disease [16]. Level

of evidence: 1

Moreover the preliminary results of the POLEM study (randomized study: half of the patients

selected for surgery based on abdominal, chest and liver CT, underwent FDG-PET), showed that

unnecessary laparotomy can be prevented in significantly more patients in the FDG-PET group.

In the non-FDG-PET group 29% (14/49) underwent unnecessary laparotomy, while in FDG-PET

group only 11% (5/48) underwent unnecessary laparotomy (P = 0.02); in the FDG-PET group,

surgery was cancelled in 4 patients after FDG-PET. However these data are based on preliminary

nine-month follow–up of 97 patients, while 150 patients are included in this study. (Report

POLEM study, the Netherlands Organization for Health Research and Development (ZonMw)

grant 945-11-017).

PET-CT Hybrid PET-CT can be used for detection of liver metastases and extrahepatic disease when equip-

ment and sufficient expertise is available. Studies have shown that accuracy rates of up to 98%

can be achieved for the detection of liver metastases, extrahepatic disease and local recurrence in

patients who have been treated for colorectal tumor [17-19]. Level of evidence: 3

Diagnostic laparoscopy There is no role for diagnostic laparoscopy in routine daily practice, due to the invasiveness of

diagnostic laparoscopy, low prevalence of small subcapsular lesions and extrahepatic disease and

absence of clinical consequences of small liver metastases, as these generally can be resected.

The additional value of diagnostic laparoscopy in patients after extensive imaging also seems to

be limited [20, 21]. Level of evidence: 3

Additional examination 1) If liver metastases seem to be resectable based on imaging examination, additional examina-

tion of the cardiopulmonary system should be performed to study the clinical condition of the

patient. In general no cytological/ histological biopsies are performed.

2) If liver metastases based on imaging examination and the clinical condition of the patient seem

to be irresectable, no cytological/ histological biopsies should be performed to verify the diagnosis

because of the increased risk for developing needle tract metastases [22]. Biopsies should only be

performed, if histopathology will have clinical consequences.

��

Chapter 8

sURGeRY

Approximately 20% of the patients with liver metastases are considered candidates for surgery,

with a five-year survival of 30 to 40% [23-26].

Selection criteria for surgery are: a residual liver volume of ≥ 30% after resection, the feasibility

of an R0 resection (clear resection margin), limited or no presence of extrahepatic disease and

adequate clinical condition of the patient. However, there is some variation in the prognostic fac-

tors such as the presence of extrahepatic disease, surgical margins of < 10 mm and the timing of

the resection of synchronous liver metastases [2]. Neoadjuvant and adjuvant chemotherapy are

usually administrated to increase the effectiveness of surgery [27, 28], however the effectiveness

is also unknown.

Recommendations based on the evidence found in the literature:

1) In patient with normal functioning liver, at least 30% of the liver parenchyma should remain

after surgery. Up to 70% of the liver volume can be removed in these patients with a normal

functioning liver without risks of postoperative failure [29-31].

Level of evidence: 3

2) As there are no uniform results in the literature concerning a margin of < 10 mm [32-

36] (Level of evidence: 3) and due to the fact that the surgical margin cannot be accurately

determined pre-operatively, a surgical margin of ≥ 10 mm is recommended. Depending on the

anatomic location, a margin of < 10 mm is acceptable as long as a radical resection can be ob-

tained.

3) Attention should be paid to the preoperative evaluation of extrahepatic disease, as patients

with extrahepatic disease have a significantly worse prognosis compared to patients without

extrahepatic disease [37, 38]. Level of evidence: 3

However, there is controversial data on the consequences of the involvement of lymph nodes

located near the liver hilum. Several papers report that this should not be considered an absolute

contraindication for resection and an extended lymphadenectomy should be performed [39, 40],

while in a systematic review only few five-year survivors after liver resection with involvement of

hilum lymph nodes were reported [41]. In summary, there is no uniform evidence concerning the

resection of lymph nodes in the hilum of the liver.

4) The presence of a limited number of lung metastases, without mediastinal lymph node

involvement, is not considered an absolute contraindication for resection of liver metastases, as

resection of a limited number of lung metastases can prolong long-term survival [42-46]. Level

of evidence: 3

Therefore after radical surgery of the liver, subsequent lung surgery could be considered when

only a limited number of lung metastases are found.

5) High age in a patient with good cardiopulmonary condition should not be a contraindica-

tion for liver resection for colorectal cancer metastases. In patients > 70 years a median survival

of up to 33 months and a five-year survival of up to 22% can be achieved [47, 48]. Level of

evidence: 3

��

6) Although patients with solitary metachronous liver metastases have a better survival com-

pared to patients with synchronous metastases, the presence of synchronous liver metastases

should not be a contraindication for surgery, as five-year survival of up to 31% can be obtained

by resection of synchronous metastases [49-51]. Level of evidence: 3

7) Even though survival after simultaneous resection of colorectal cancer and liver metastases

and resection of liver metastases after an interval of two to three months are comparable [51,

52], simultaneous resection should be avoided, due to the high complication rate. In addition, in

two-thirds of patients major hepatic surgery is avoided, because of the detection of an increased

number of hepatic or distant metastases after an interval of two to three months [52]. Level of

evidence: 3

8) Repeat hepatectomy is advised in patients with new liver metastases after previous liver

surgery for colorectal metastases, if the patient fulfils all criteria for resectability. Repeat liver re-

section for colorectal liver metastases is safe and in well-selected patients can provide prolonged

survival after recurrence of colorectal liver metastases with limited mortality and morbidity rate

[53-59]. Level of evidence: 3

9) Data on the effectiveness of neoadjuvant chemotherapy are controversial and we therefore

recommend the use of neoadjuvant chemotherapy only in clinical research protocols. In a selected

patient population, neoadjuvant chemotherapy with the more effective regimens (combination

of 5-FU/LV with irinotecan or oxaliplatin) can induce response, making curative resection of previ-

ously irresectable liver metastases possible [27, 60-65]. Level of evidence: 3

10) The role of adjuvant chemotherapy after curative surgery is unclear and not advised rou-

tinely [66-71]. Level of evidence: 2

As there is a substantial variation in prognostic factors (see above), the working group recom-

mends that:

1) Liver resection should be performed in centers with high experience level, where appropri-

ate equipment is available and with enough experience in intensive care, anesthesiology and

interventional radiology. Administration of neoadjuvant or adjuvant chemotherapy should be

limited to trials.

2) Registration of patients should be performed, also outside trials. Registration systems are

important tools in evaluating indications for resection and results of resections.

eXPeRIMenTal THeRaPY

As most of the patients with liver metastases are not considered suitable for surgery, other treat-

ment modalities such as ablative therapy, portal vein embolization and isolated hepatic perfusion

have been developed during the last decades [72-82]. However, there is no information available

on the effectiveness of these modalities and the criteria for their application in the Netherlands

[2]. The recommendations of the working group are given for each experimental therapy.

��0

Chapter 8

Portal vein embolization Some patients not considered candidates for surgery due to insufficient remnant liver volume

with increased risk of postoperative liver failure can undergo portal vein embolization (PVE) of

the liver parts to be resected. Portal vein embolization results in atrophy of the embolized parts

and hypertrophy of the remnant liver, reducing the risk of hepatic failure after extended hepa-

tectomy.

So far, only retrospective studies with long-term results [83, 84] or prospective studies with

short-term results in terms of success rate and complications [74, 85-88] have been reported, with

in general, favorable results/findings. Level of evidence: 3 Moreover, small numbers of patients

have been included in these studies. Due to the lack of data on long-term results, PVE should

only be performed in trials, in centers with high experience and where clear-cut indications are

defined.

Ablative therapy Another treatment modality developed during the last decades for patients with liver malignan-

cies is local ablation therapy. The principle of ablation is based on tumor destruction by applying

heat (RFA or interstitial laser therapy) or cold (cryotherapy) or by chemical tumor destruction

(ethanol injection).

1) No recommendations could be made on the role of laser ablation, due to the small number of

studies evaluating long-term results of laser therapy [89, 90]. Level of evidence: 3

2) The number of studies with long-term results on cryotherapy is limited. In comparison with

RFA, cryotherapy has a higher complication rate (bleeding and infection) and more recurrence

[73, 89, 91]. Level of evidence: 3

3) The use of ethanol injection for colorectal liver metastases is not advised, due to the small

number of studies and the low response rate obtained [92-94]. Level of evidence: 3

4) RFA is the most promising technique for ablation purposes [95-98]. Level of evidence: 3

This technique is highly effective for tumor destruction. However, it is not known whether RFA

will prolong the survival of patients with extensive disease. In an ongoing randomized phase III

study (CLOCC trial), the role of local treatment by RFA in patients with irresectable colorectal liver

metastases is being studied. In this study one arm receives RFA combined with chemotherapy

while the second arm receives only chemotherapy. Current evidence on the safety and efficacy

of RFA for colorectal cancer liver metastases does not appear adequate and this experimental

therapy should therefore only be performed as part of a clinical research protocol.

Isolated hepatic perfusion In patients with extensive non-resectable liver metastases, isolated hepatic perfusion (IHP) can be

considered. IHP involves intraoperative perfusion of the isolated liver with extremely high-dose

chemotherapy. The results of recent studies show that high response rates and considerable

survival benefit can be achieved by IHP with different treatment strategies, including IHP with

melphalan alone and melphalan combined with TNF-α or followed by monthly hepatic intra-arte-

rial infusion of fluorodeoxyuridine (FUDR) and leucovorin. In these studies, IHP for colorectal liver

��

metastases showed response rates of up to 74%, a median time to progression of up to 14.5

months and a median survival of up to 27 months [75, 99]. Level of evidence: 3

IHP was first clinically applied over 40 years ago, but its technical complexity, the potential

morbidity, toxicity rate and the lack of documented efficacy probably have prevented widespread

use. Patient selection is important to ensure good results with minimal morbidity and mortality.

Work to define the appropriate clinical groups is ongoing in the Leiden University Medical Center

and the Erasmus Medical Center Rotterdam and therefore it is necessary to wait for the results of

these studies.

CHeMoTHeRaPY

Most patients with extensive and non-resectable metastases are only eligible for systemic chemo-

therapy. The following recommendations for systematic chemotherapy can be made:

1) For systemic chemotherapy fluoropyrimidine first-line chemotherapy (either oral or systemic

5-FU/Leucovorin) combined with irinotecan or oxaliplatin should be considered as standard regi-

mens; however the optimal regimens with either irinotecan or oxaliplatin are unknown. The effect

of oral 5-FU prodrug monotherapy is comparable with intravenous bolus 5-FU regimens. [100-

103]. Level of evidence: 1

Irinotecan or oxaliplatin combined with 5-FU/leucovorin increases the response and disease-

free-survival compared with 5-FU/leucovorin alone [104-106]. Level of evidence: 2

2) In the absence of contraindications, bevacizumab could be added to the first-line chemother-

apy. This has additional therapeutic value if bevacizumab is added to a fluoropyrimidine first-line

chemotherapy regimen (higher response rate, disease-free and total survival) [107, 108]. Level

of evidence: 2

3) An improvement in the field of chemotherapy is the development of regional (intra-arterial)

chemotherapy [109-111]. With regional chemotherapy higher doses can be administrated and

therefore higher tumor response rates could be achieved; however the effectiveness in terms of

disease-free survival and overall survival are yet unknown [112]. Level of evidence: 1

Therefore, regional chemotherapy at this stage has no role in the routine management.

folloW-UP

When possible, surgical resection is the treatment of choice for hepatic colorectal metastases,

with five-year survival rates of up to 30 to 40%. However, in most of the reported series, disease

recurs in up to 80% of patients after hepatectomy. The recurrence usually involves the liver and is

confined to the liver in approximately half of these cases.

As with initial hepatectomy, the feasibility of repeat resection depends not only on the disease

being confined to the liver but also on the distribution of hepatic disease permitting curative resec-

tion. Overall, only 23% to 33% of hepatic recurrences are resectable [59]. Repeat hepatectomy

��

Chapter 8

is associated with five-year survival rates equivalent to those reported for first hepatectomy [53]

and therefore detecting hepatic recurrence at a resectable stage would significantly improve

prognosis for this selected group of patients.

The aim of follow-up, therefore, is to select patients who are candidates for repeat resection.

This has also been shown in a recently published review [56]. However, there is no evidence avail-

able on the timing, frequency and the program of follow up.

Based on the results of the studies included in the review, a follow-up visit every three months

is recommended for two years, thereafter every six months until five years. Each visit is accompa-

nied with clinical examination, CEA measurements, and CT of the chest and abdomen.

ReGIsTRaTIon sYsTeM

Based on the survey/recommendations from the field, the working group also advocates the

development of a national registration system for the diagnosis and treatment of patients with

colorectal liver metastases. Registration systems are important tools in evaluating patient man-

agement. The collaboration between medical specialists and consulting specialists of the Associa-

tion of Comprehensive Cancer Centers provides the possibility of a national registration.

IMPleMenTaTIon of THe GUIDelIne

For all practitioners involved in the management of patients with colorectal liver metastases in the

Netherlands, the guideline is available on www.oncoline.nl or www.vikc.nl.

Although we are aware that passive dissemination of a guideline may be unlikely to lead to

change, whereas the combination of several active meetings is more likely to lead to success, we

firstly choose to disseminate the guideline by internet. This is because in general, guidelines for

oncologic diseases reported by these sites are easily implemented in daily practice. In addition,

a compact and transparent summary of the guideline has been written which will be sent to all

chairmen of oncology committees in each hospital, in which referral is made to the complete

guideline. Also, the working group has presented this guideline during meetings of the several

disciplines involved in the management of patients with colorectal liver metastases.

There is ongoing research both on diagnosis (POLEM study) and treatment (CLOCC trial and

experimental IHP, PVE). The results of these studies will most likely change the management of

this patient group. Therefore this guideline should be updated, when the results of these and

other relevant studies will be available.

��

Flowchart Diagnostic and therapeutic strategy in the follow-up after primary colorectal tumor management2 and 3 are considered experimental therapeutic strategies and the working group therefore recommends per-forming these strategies in trials or in centers with extensive experience, with proper equipments and where registration is optimal.

��

Chapter 8

References

1. http://www.ikcnet.nl

2. Bipat S, van Leeuwen MS, Ijzermans JN, Bossuyt PM, Greve JW, Stoker J. Imaging and treatment of patients with colorectal liver metastases in the Netherlands: a survey. Neth J Med 2006;64:147-151.

3. Hagspiel KD, Neidl KF, Eichenberger AC, Weder W, Marincek B. Detection of liver metastases: comparison of superparamagnetic iron oxide-enhanced and unenhanced MR imaging at 1.5 T with dynamic CT, intra-operative US, and percutaneous US. Radiology 1995;196:471-478.

4. Hung GU, Shiau YC, Tsai SC, Chao TH, Ho YJ, Kao CH. Value of 18F-fluoro-2-deoxyglucose positron emis-sion tomography in the evaluation of recurrent colorectal cancer. Anticancer Res 2001;21:1375-1378.

5. Ruers TJ, Langenhoff BS, Neeleman N, et al. Value of positron emission tomography with [F-18]fluoro-deoxyglucose in patients with colorectal liver metastases: a prospective study. J Clin Oncol 2002;20:388-395.

6. Schmidt J, Strotzer M, Fraunhofer S, Boedeker H, Zirngibl H. Intraoperative ultrasonography versus helical computed tomography and computed tomography with arterioportography in diagnosing colorectal liver metastases: lesion-by-lesion analysis. World J Surg 2000;24:43-47.

7. Scott DJ, Guthrie JA, Arnold P, et al. Dual phase helical CT versus portal venous phase CT for the detec-tion of colorectal liver metastases: correlation with intra-operative sonography, surgical and pathological findings. Clin Radiol 2001;56:235-242.

8. Valls C, Andia E, Sanchez A, et al. Hepatic metastases from colorectal cancer: preoperative detection and assessment of resectability with helical CT. Radiology 2001;218:55-60.

9. Vitola JV, Delbeke D, Sandler MP, et al. Positron emission tomography to stage suspected metastatic colorectal carcinoma to the liver. Am J Surg 1996;171:21-26.

10. Ward J, Naik KS, Guthrie JA, Wilson D, Robinson PJ. Hepatic lesion detection: comparison of MR imaging after the administration of superparamagnetic iron oxide with dual-phase CT by using alternative-free response receiver operating characteristic analysis. Radiology 1999;210:459-466.

11. Whiteford MH, Whiteford HM, Yee LF, et al. Usefulness of FDG-PET scan in the assessment of suspected metastatic or recurrent adenocarcinoma of the colon and rectum. Dis Colon Rectum 2000;43:759-767.

12. Kronawitter U, Kemeny NE, Heelan R, Fata F, Fong Y. Evaluation of chest computed tomography in the staging of patients with potentially resectable liver metastases from colorectal carcinoma. Cancer 1999;86:229-235.

13. Povoski SP, Fong Y, Sgouros SC, Kemeny NE, Downey RJ, Blumgart LH. Role of chest CT in patients with negative chest x-rays referred for hepatic colorectal metastases. Ann Surg Oncol 1998;5:9-15.

14. Berman JM, Cheung RJ, Weinberg DS. Surveillance after colorectal cancer resection. Lancet 2000;355:395-399

15. Bipat S, van Leeuwen MS, Comans EF et al. Colorectal liver metastases: CT, MR imaging, and PET for diagnosis—meta-analysis. Radiology 2005;237:123-131.

16. Wiering B, Krabbe PF, Jager GJ, Oyen WJ, Ruers TJ. The impact of fluor-18-deoxyglucose-positron emission tomography in the management of colorectal liver metastases. Cancer 2005;104:2658-2670.

17. Kamel IR, Cohade C, Neyman E, Fishman EK, Wahl RL. Incremental value of CT in PET/CT of patients with colorectal carcinoma. Abdom Imaging 2004;29:663-668.

18. Kim JH, Czernin J, len-Auerbach MS, et al. Comparison between 18F-FDG PET, in-line PET/CT, and software fusion for restaging of recurrent colorectal cancer. J Nucl Med 2005;46:587-595.

19. Selzner M, Hany TF, Wildbrett P, McCormack L, Kadry Z, Clavien PA. Does the novel PET/CT imaging mo-dality impact on the treatment of patients with metastatic colorectal cancer of the liver? Ann Surg 2004 ;240 :1027-1034.

20. D’Angelica M, Fong Y, Weber S, et al. The role of staging laparoscopy in hepatobiliary malignancy: pro-spective analysis of 401 cases. Ann Surg Oncol 2003;10:183-189.

��

21. Jarnagin WR, Conlon K, Bodniewicz J, et al. A clinical scoring system predicts the yield of diagnostic laparoscopy in patients with potentially resectable hepatic colorectal metastases. Cancer 2001;91:1121-1128.

22. Metcalfe MS, Bridgewater FH, Mullin EJ, Maddern GJ. Useless and dangerous—fine needle aspiration of hepatic colorectal metastases. BMJ 2004;328:507-508.

23. Fong Y, Cohen AM, Fortner JG, et al. Liver resection for colorectal metastases. J Clin Oncol 1997;15:938-946.

24. Jatzko GR, Lisborg PH, Stettner HM, Klimpfinger MH. Hepatic resection for metastases from colorectal carcinoma—a survival analysis. Eur J Cancer 1995;31A:41-46.

25. Mutsaerts EL, van Ruth S, Zoetmulder FA, Rutgers EJ, Hart AA, van Coevorden F. Prognostic factors and evaluation of surgical management of hepatic metastases from colorectal origin: a 10-year single-institute experience. J Gastrointest Surg 2005;9:178-186.

26. Sasaki A, Iwashita Y, Shibata K, Matsumoto T, Ohta M, Kitano S. Analysis of preoperative prognostic fac-tors for long-term survival after hepatic resection of liver metastasis of colorectal carcinoma. J Gastrointest Surg 2005 ;9 :374-380.

27. Adam R, Avisar E, Ariche A, et al. Five-year survival following hepatic resection after neoadjuvant therapy for nonresectable colorectal. Ann Surg Oncol 2001;8:347-353.

28. Kemeny MM. Chemotherapy after hepatic resection of colorectal metastases. Cancer Treat Res 1994;69:121-128.

29. de Baere T, Roche A, Elias D, Lasser P, Lagrange C, Bousson V. Preoperative portal vein embolization for extension of hepatectomy indications. Hepatology 1996;24:1386-91.

30. Kubota K, Makuuchi M, Kusaka K, et al. Measurement of liver volume and hepatic functional reserve as a guide to decision-making in resectional surgery for hepatic tumors. Hepatology 1997;26:1176-81.

31. Schneider PD. Preoperative assessment of liver function. Surg Clin North Am 2004;84:355-373.

32. Cady B, Jenkins RL, Steele GD Jr, et al. Surgical margin in hepatic resection for colorectal metastasis: a critical and improvable determinant of outcome. Ann Surg 1998;227:566-571.

33. Elias D, Cavalcanti A, Sabourin JC, Pignon JP, Ducreux M, Lasser P. Results of 136 curative hepatectomies with a safety margin of less than 10 mm for colorectal metastases. J Surg Oncol 1998;69:88-93.

34. Elias D, Cavalcanti A, Sabourin JC, et al. Resection of liver metastases from colorectal cancer: the real impact of the surgical margin. Eur J Surg Oncol 1998;24:174-179.

35. Kokudo N, Miki Y, Sugai S, et al. Genetic and histological assessment of surgical margins in resected liver metastases from colorectal carcinoma: minimum surgical margins for successful resection. Arch Surg 2002;137:833-840.

36. Pawlik TM, Scoggins CR, Zorzi D, et al. Effect of surgical margin status on survival and site of recurrence after hepatic resection for colorectal metastases. Ann Surg 2005;241:715-737.

37. Beckurts KT, Holscher AH, Thorban S, Bollschweiler E, Siewert JR. Significance of lymph node involvement at the hepatic hilum in the resection of colorectal liver metastases. Br J Surg 1997;84:1081-1084.

38. Jamison RL, Donohue JH, Nagorney DM, Rosen CB, Harmsen WS, Ilstrup DM. Hepatic resection for meta-static colorectal cancer results in cure for some patients. Arch Surg 1997;132:505-510.

39. Elias D, Ouellet JF, Bellon N, Pignon JP, Pocard M, Lasser P. Extrahepatic disease does not contraindicate hepatectomy for colorectal liver metastases. Br J Surg 2003;90:567-574.

40. Jaeck D. The significance of hepatic pedicle lymph nodes metastases in surgical management of colorectal liver metastases and of other liver malignancies. Ann Surg Oncol 2003;10:1007-1011.

41. Rodgers MS, McCall JL. Surgery for colorectal liver metastases with hepatic lymph node involvement: a systematic review. Br J Surg 2000;87:1142-1155.

42. Ike H, Shimada H, Togo S, Yamaguchi S, Ichikawa Y, Tanaka K. Sequential resection of lung metastasis following partial hepatectomy for colorectal cancer. Br J Surg 2002;89:1164-1168.

43. Kobayashi K, Kawamura M, Ishihara T. Surgical treatment for both pulmonary and hepatic metastases from colorectal cancer. J Thorac Cardiovasc Surg 1999;118:1090-1096.

��6

Chapter 8

44. Lehnert T, Knaebel HP, Duck M, Bulzebruck H, Herfarth C. Sequential hepatic and pulmonary resections for metastatic colorectal cancer. Br J Surg 1999;86:241-243.

45. Mineo TC, Ambrogi V, Tonini G, et al. Longterm results after resection of simultaneous and sequential lung and liver metastases from colorectal carcinoma. J Am Coll Surg 2003;197:386-391.

46. Regnard JF, Grunenwald D, Spaggiari L, et al. Surgical treatment of hepatic and pulmonary metastases from colorectal cancers. Ann Thorac Surg 1998; 66:214-218.

47. Brand MI, Saclarides TJ, Dobson HD, Millikan KW. Liver resection for colorectal cancer: liver metastases in the aged. Am Surg 2000;66:412-415.

48. Termuhlen PM, Kemeny MM. Surgery in the older patient. Oncology (Williston Park) 2002;16:183-189.

49. Fujita S, Akasu T, Moriya Y. Resection of synchronous liver metastases from colorectal cancer. Jpn J Clin Oncol 2000;30:7-11.

50. Lyass S, Zamir G, Matot I, Goitein D, Eid A, Jurim O. Combined colon and hepatic resection for synchronous colorectal liver metastases. J Surg Oncol 2001;78:17-21.

51. Vogt P, Raab R, Ringe B, Pichlmayr R. Resection of synchronous liver metastases from colorectal cancer. World J Surg 1991;15:62-67.

52. Lambert LA, Colacchio TA, Barth RJ, Jr. Interval hepatic resection of colorectal metastases improves patient selection. Arch Surg 2000;135:473-479.

53. Adam R, Bismuth H, Castaing D, et al. Repeat hepatectomy for colorectal liver metastases. Ann Surg 1997;225:51-60.

54. Fong Y, Blumgart LH, Cohen A, Fortner J, Brennan MF. Repeat hepatic resections for metastatic colorectal cancer. Ann Surg 1994;220:657-662.

55. Imamura H, Kawasaki S, Miyagawa S, Ikegami T, Kitamura H, Shimada R. Aggressive surgical approach to recurrent tumors after hepatectomy for metastatic spread of colorectal cancer to the liver. Surgery 2000;127:528-535.

56. Metcalfe MS, Mullin EJ, Maddern GJ. Choice of surveillance after hepatectomy for colorectal metastases. Arch Surg 2004;139:749-754.

57. Muratore A, Polastri R, Bouzari H, Vergara V, Ferrero A, Capussotti L. Repeat hepatectomy for colorectal liver metastases: A worthwhile operation? J Surg Oncol 2001;76:127-132.

58. Nagakura S, Shirai Y, Suda T, Hatakeyama K. Multiple repeat resections of intra- and extrahepatic recur-rences in patients undergoing initial hepatectomy for colorectal carcinoma metastases. World J Surg 2002 ;26 :141-147.

59. Nordlinger B, Vaillant JC, Guiguet M, et al. Survival benefit of repeat liver resections for recurrent colorectal metastases: 143 cases. Association Francaise de Chirurgie. J Clin Oncol 1994 ;12 :1491-1496.

60. Adam R, Delvart V, Pascal G, et al. Rescue surgery for unresectable colorectal liver metastases downstaged by chemotherapy: a model to predict long-term survival. Ann Surg 2004;240:644-657.

61. Giacchetti S, Itzhaki M, Gruia G, et al. Long-term survival of patients with unresectable colorectal cancer liver metastases following infusional chemotherapy with 5-fluorouracil, leucovorin, oxaliplatin and surgery. Ann Oncol 1999;10:663-669.

62. Meric F, Patt YZ, Curley SA, et al. Surgery after downstaging of unresectable hepatic tumors with intra-arterial chemotherapy. Ann Surg Oncol 2000;7:490-495.

63. Punt CJ. New options and old dilemmas in the treatment of patients with advanced colorectal cancer. Ann Oncol 2004;15:1453-1459.

64. Rivoire M, De CF, Meeus P, Negrier S, Sebban H, Kaemmerlen P. Combination of neoadjuvant chemo-therapy with cryotherapy and surgical resection for the treatment of unresectable liver metastases from colorectal carcinoma. Cancer 2002;95:2283-2292.

65. Wein A, Riedel C, Kockerling F, et al. Impact of surgery on survival in palliative patients with metastatic colorectal cancer after first line treatment with weekly 24-hour infusion of high-dose 5-fluorouracil and folinic acid. Ann Oncol 2001;12:1721-1727.

��7

66. Figueras J, Valls C, Rafecas A, Fabregat J, Ramos E, Jaurrieta E. Resection rate and effect of postoperative chemotherapy on survival after surgery for colorectal liver metastases. Br J Surg 2001 ;88 :980-985.

67. Kemeny N, Huang Y, Cohen AM, et al. Hepatic arterial infusion of chemotherapy after resection of hepatic metastases from colorectal cancer. N Engl J Med 1999;341:2039-2048.

68. Kemeny N, Gonen M, Sullivan D, et al. Phase I study of hepatic arterial infusion of floxuridine and dexa-methasone with systemic irinotecan for unresectable hepatic metastases from colorectal cancer. J Clin Oncol 2001;19:2687-2695.

69. Lorenz M, Muller HH, Staib-Sebler E, et al. Relevance of neoadjuvant and adjuvant treatment for patients with resectable liver metastases of colorectal carcinoma. Langenbecks Arch Surg 1999;384:328-338.

70. Ruers T, Bleichrodt RP. Treatment of liver metastases, an update on the possibilities and results. Eur J Cancer 2002;38:1023-1033.

71. Tono T, Hasuike Y, Ohzato H, Takatsuka Y, Kikkawa N. Limited but definite efficacy of prophylactic hepatic arterial infusion chemotherapy after curative resection of colorectal liver metastases: A randomized study. Cancer 2000;88:1549-1556.

72. Abdalla EK, Hicks ME, Vauthey JN. Portal vein embolization: rationale, technique and future prospects. Br J Surg 2001;88:165-175.

73. Adam R, Hagopian EJ, Linhares M, et al. A comparison of percutaneous cryosurgery and percutaneous radiofrequency for unresectable hepatic malignancies. Arch Surg 2002;137:1332-1339.

74. Azoulay D, Castaing D, Smail A, et al. Resection of nonresectable liver metastases from colorectal cancer after percutaneous portal vein embolization. Ann Surg 2000;231:480-486.

75. Bartlett DL, Libutti SK, Figg WD, Fraker DL, Alexander HR. Isolated hepatic perfusion for unresectable hepatic metastases from colorectal cancer. Surgery 2001;129:176-187.

76. Imamura H, Shimada R, Kubota M, et al. Preoperative portal vein embolization: an audit of 84 patients. Hepatology 1999;29:1099-1105.

77. Livraghi T, Solbiati L, Meloni F, Ierace T, Goldberg SN, Gazelle GS. Percutaneous radiofrequency ablation of liver metastases in potential candidates for resection: the “test-of-time approach”. Cancer 2003;97:3027-3035.

78. Makuuchi M, Thai BL, Takayasu K, et al. Preoperative portal embolization to increase safety of major hepatectomy for hilar bile duct carcinoma: a preliminary report. Surgery 1990;107:521-527.

79. Marinelli A, de Brauw LM, Beerman H, et al. Isolated liver perfusion with mitomycin C in the treatment of colorectal cancer metastases confined to the liver. Jpn J Clin Oncol 1996;26:341-350.

80. Pearson AS, Izzo F, Fleming RY, et al. Intraoperative radiofrequency ablation or cryoablation for hepatic malignancies. Am J Surg 1999;178:592-599.

81. Rothbarth J, Pijl ME, Vahrmeijer AL, et al. Isolated hepatic perfusion with high-dose melphalan for the treatment of colorectal metastasis confined to the liver. Br J Surg 2003;90:1391-1397.

82. Vahrmeijer AL, van Dierendonck JH, Keizer HJ, et al. Increased local cytostatic drug exposure by isolated hepatic perfusion: a phase I clinical and pharmacologic evaluation of treatment with high dose melphalan in patients with colorectal cancer confined to the liver. Br J Cancer 2000;82:1539-1546.

83. Elias D, De Baere T, Roche A, Bonvallot S, Lasser P. Preoperative selective portal vein embolizations are an effective means of extending the indications of major hepatectomy in the normal and injured liver. Hepatogastroenterology 1998;45:170-177.

84. Elias D, Ouellet JF, de BT, Lasser P, Roche A. Preoperative selective portal vein embolization before hepa-tectomy for liver metastases: long-term results and impact on survival. Surgery 2002;131:294-299.

85. Imamura H, Shimada R, Kubota M, et al. Preoperative portal vein embolization: an audit of 84 patients. Hepatology 1999;29:1099-1105.

86. Fujii Y, Shimada H, Endo I, et al. Changes in clinicopathological findings after portal vein embolization. Hepatogastroenterology 2000;47:1560-1563.

87. Kodama Y, Shimizu T, Endo H, Miyamoto N, Miyasaka K. Complications of percutaneous transhepatic portal vein embolization. J Vasc Interv Radiol 2002;13:1233-1237.

��

Chapter 8

88. Fujii Y, Shimada H, Endo I, et al. Effects of portal vein embolization before major hepatectomy. Hepato-gastroenterology 2003;50:438-442.

89. Christophi C, Nikfarjam M, Malcontenti-Wilson C, Muralidharan V. Long-term survival of patients with unresectable colorectal liver metastases treated by percutaneous interstitial laser thermotherapy. World J Surg 2004;28:987-994.

90. Mack MG, Straub R, Eichler K, et al. Percutaneous MR imaging-guided laser-induced thermotherapy of hepatic metastases. Abdom Imaging 2001;26:369-374.

91. Seifert JK, Morris DL. Prognostic factors after cryotherapy for hepatic metastases from colorectal cancer. Ann Surg 1998;228:201-208.

92. Giorgio A, Tarantino L, Mariniello N, et al. [Ultrasonography-guided percutaneous ethanol injection in large an/or multiple liver metastasis]. Radiol Med (Torino) 1998;96:238-242.

93. Giovannini M, Seitz JF. Ultrasound-guided percutaneous alcohol injection of small liver metastases. Results in 40 patients. Cancer 1994;73:294-297.

94. Livraghi T, Vettori C, Lazzaroni S. Liver metastases: results of percutaneous ethanol injection in 14 patients. Radiology 1991;179:709-712.

95. Gillams AR, Lees WR. Radio-frequency ablation of colorectal liver metastases in 167 patients. Eur Radiol 2004;14:2261-2267.

96. Solbiati L, Ierace T, Goldberg SN, et al. Percutaneous US-guided radio-frequency tissue ablation of liver metastases: treatment and follow-up in 16 patients. Radiology 1997;202:195-203.

97. Solbiati L, Livraghi T, Goldberg SN, et al. Percutaneous radio-frequency ablation of hepatic metastases from colorectal cancer: long-term results in 117 patients. Radiology 2001;221:159-166.

98. Solbiati L, Ierace T, Tonolini M. Long-term survival of patients treated with radiofrequency ablation for liver colorectal metastases: improved outcome with increasing experience. Radiolog 2003S;229:411.

99. Alexander HR, Jr., Bartlett DL, Libutti SK, Fraker DL, Moser T, Rosenberg SA. Isolated hepatic perfusion with tumor necrosis factor and melphalan for unresectable cancers confined to the liver. J Clin Oncol 1998 ;16 :1479-1489.

100. Carmichael J, Popiela T, Radstone D, et al. Randomized comparative study of tegafur/uracil and oral leucovorin versus parenteral fluorouracil and leucovorin in patients with previously untreated metastatic colorectal cancer. J Clin Oncol 2002;20:3617-3627.

101. Douillard JY, Hoff PM, Skillings JR, et al. Multicenter phase III study of uracil/tegafur and oral leucovorin versus fluorouracil and leucovorin in patients with previously untreated metastatic colorectal cancer. J Clin Oncol 2002;20:3605-3616.

102. Hoff PM, Ansari R, Batist G, et al. Comparison of oral capecitabine versus intravenous fluorouracil plus leucovorin as first-line treatment in 605 patients with metastatic colorectal cancer: results of a randomized phase III study. J Clin Oncol 2001;19:2282-2292.

103. van Cutsem E, Twelves C, Cassidy J, et al. Oral capecitabine compared with intravenous fluorouracil plus leucovorin in patients with metastatic colorectal cancer: results of a large phase III study. J Clin Oncol 2001 ;19 :4097-4106.

104. de Gramont A, Figer A, Seymour M, et al. Leucovorin and fluorouracil with or without oxaliplatin as first-line treatment in advanced colorectal cancer. J Clin Oncol 2000;18:2938-2947.

105. Douillard JY, Cunningham D, Roth AD, et al. Irinotecan combined with fluorouracil compared with fluoro-uracil alone as first-line treatment for metastatic colorectal cancer: a multicentre randomized trial. Lancet 2000;355:1041-1047.

106. Giacchetti S, Perpoint B, Zidani R, et al. Phase III multicenter randomized trial of oxaliplatin added to chro-nomodulated fluorouracil-leucovorin as first-line treatment of metastatic colorectal cancer. J Clin Oncol 2000;18:136-147.

107. Hurwitz H, Fehrenbacher L, Novotny W, et al. Bevacizumab plus irinotecan, fluorouracil, and leucovorin for metastatic colorectal cancer. N Engl J Med 2004;350:2335-2342.

��

108. Kabbinavar FF, Schulz J, McCleod M, et al. Addition of bevacizumab to bolus fluorouracil and leucovorin in first-line metastatic colorectal cancer: results of a randomized phase II trial. J Clin Oncol 2005;23:3697-3705.

109. Harmantas A, Rotstein LE, Langer B. Regional versus systemic chemotherapy in the treatment of colorectal carcinoma metastatic to the liver. Is there a survival difference? Meta-analysis of the published literature. Cancer 1996;78:1639-1645.

110. Kerr DJ, McArdle CS, Ledermann J, et al. Intrahepatic arterial versus intravenous fluorouracil and folinic acid for colorectal cancer liver metastases: a multicentre randomized trial. Lancet 2003;361:368-373.

111. Lorenz M, Muller HH. Randomized, multicenter trial of fluorouracil plus leucovorin administered either via hepatic arterial or intravenous infusion versus fluorodeoxyuridine administered via hepatic arterial infusion in patients with nonresectable liver metastases from colorectal carcinoma. J Clin Oncol 2000;18:243-254.

112. Reappraisal of hepatic arterial infusion in the treatment of nonresectable liver metastases from colorectal cancer. Meta-Analysis Group in Cancer. J Natl Cancer Inst 1996;88:252-258.

Chapter 8

��0 ��

aPPenDIX 1

The working group Chairman: J. Stoker, MD, PhD, radiologist, Academic Medical Center, Amsterdam

Vice-chairman: J.W. Greve MD, PhD, surgeon, Maastricht University Hospital, Maastricht

Other members in alphabetic order:

S. Bipat MSc, researcher, Academic Medical Center, Amsterdam

P.M.M. Bossuyt PhD, clinical epidemiologist, Academic Medical Center, Amsterdam

A. Cats MD, PhD, gastro-enterologist, Netherlands Cancer Institute, Amsterdam

E.F.I. Comans MD, PhD, nuclear medicine specialist, VU University Medical Center, Amsterdam

T.M. van Gulik MD, PhD, surgeon, Academic Medical Center, Amsterdam

R.L.H.Jansen MD, PhD, medical oncologist, Maastricht University Hospital, Maastricht

M.S. van Leeuwen MD, PhD, radiologist, University Medical Center Utrecht, Utrecht

M.E.J. Pijl MD, PhD, radiologist, Martini hospital, Groningen

A.S.Th. Planting MD, PhD, medical oncologist, Erasmus Medical Center Rotterdam, Rotterdam

R.A. Tollenaar MD, PhD, surgeon, Leiden University Medical Center, Leiden

J.N.M. IJzermans MD, PhD, surgeon, Erasmus Medical Center Rotterdam, Rotterdam

��0

��0 ��

9C h a p t e r

Multivariate random-effects approach: for meta-analysis of cancer staging studies

Shandra BipatAeilko H. ZwindermanPatrick M.M. BossuytJaap Stoker

Submitted

��

Chapter 9

absTRaCT

Background: Meta-analyses of diagnostic accuracy studies produce summary estimates of sen-

sitivity and specificity. Cancer staging however relies on staging systems and meta-analysis is

often performed after dichotomization. For each dichotomization, summary estimates of sensi-

tivity and specificity can be calculated by repeated bivariate random-effects analyses. However

staging information is lost and under- and overstaging can not be adequately expressed.

Methods: We propose a new multivariate random-effects approach, which is an extension of

the bivariate random-effects approach. In this approach, staging data were used to calculate

correct staging and under- and overstaging. We also calculated sensitivity and specificity for

each dichotomization, using the results obtained by the multivariate approach and compared

these estimates with the results from the repeated bivariate analyses. We used data from a

meta-analysis comparing Endoluminal Ultrasonograpy (EUS) and Magnetic Resonance Imaging

(MRI) in staging of rectal cancer.

Results: In both approaches EUS was more accurate than MRI in staging. The sensitivity and

specificity values of EUS or MRI for the dichotomizations, calculated from the results of the

multivariate approach, were comparable with the sensitivity and specificity estimates of EUS or

MRI obtained by the bivariate analysis. The multivariate analysis demonstrated more overstag-

ing with MRI than EUS.

Conclusion: The multivariate random-effects approach can be a useful meta-analytic method for

summarizing cancer staging data presented in diagnostic accuracy studies.

Multivariate random-effects approach

��

InTRoDUCTIon

Systematic reviews of diagnostic test accuracy studies are undertaken to collect the available

evidence, to evaluate the quality of published studies, and to account for variation in findings

between studies [1-6].

The diagnostic accuracy of a test is often quantified in terms of its sensitivity and specificity.

When results from several studies of a diagnostic test are available, pooling of the results in a

meta-analysis can be done in several ways. For summarizing sensitivity and specificity, several au-

thors [7-9] have shown the potential of a bivariate random-effects approach, and others (10) have

proposed a hierarchical summary receiver operating characteristic (HSROC) model for obtaining

summary estimates sensitivity and specificity. These two models are very closely related [11].

Sensitivity and specificity are important for the differentiation between disease and non-disease.

In the work up of patients with cancer the correct differentiation between disease and non-

disease is not the only important issue. Correctly determining the extent of tumor invasion (T-stag-

ing) is of primary interest as well [12-16]. So far, summary estimates of the results of T-staging

as compared to the reference standard are usually obtained by first dichotomizing the staging

results in each study, and then performing meta-analysis of the resulting sensitivity and specificity

pairs. In rectal cancer, for example, four stages of the primary tumor are reported: T1, T2, T3

and T4. To analyze the data in terms of sensitivity and specificity one can compare: T1 versus

T2+T3+T4, T1+T2 versus T3+T4 and T1+T2+T3 versus T4. Such repeated analyses necessitate a

number of decisions by the analyst which are hard to objectify.

In this paper we propose a new multivariate random-effects approach in which staging data

(k x k tables) are used to calculate correct staging, understaging and overstaging. We suggest

performing meta-analysis directly on these tables as obtained from the included primary studies.

In this way only one data-analysis is required, while every comparison can be calculated from the

results of this single analysis.

We will compare this multivariate approach with the result of the repeated bivariate approach

after dichotomization, using data from a previously published meta-analysis [17] on staging rectal

cancer: 11 data sets on Magnetic Resonance Imaging (MRI) and 26 data sets on endoluminal

ultrasonography (EUS).

The first section contains a short description of the bivariate approach and is applied to the MRI

and EUS data. The second section described the multivariate random- effects approach and its

application to the same data. Finally, we will compare both methods, and discuss our findings.

bIVaRIaTe RanDoM-effeCTs aPPRoaCH

The vast majority of diagnostic accuracy studies report their findings as pairs of sensitivity and

specificity. Sensitivity and specificity have become the most familiar measures to clinicians. Con-

sequently, these statistics are also considered as the main outcome measures in meta-analysis of

diagnostic accuracy studies.

��6

Chapter 9

1. PrincipleDichotomizing data for bivariate analysis

T-staging of a rectal cancer is defined as follows; T1-stage: invasion of the submucosa; T2-

stage: invasion of muscularis propria; T3-stage: invasion of perirectal tissue and T4-stage: invasion

of adjacent organs. If the results of EUS or MRI of n patients are compared to the reference

standard, the results in a particular study can be summarized in a 4-by-4 table (table 1), where n11

is the number of on these tables as obtained from the included primary studies. In this way only

patients with T1 stage by either EUS or MRI and verified by the reference standard. Similarly, n12 is

the number of patients with T1 stage by EUS/MRI and T2 stage according to reference standard;

and so on.

If the table is dichotomized as T1 vs. T2+T3+T4, the specificity in this study is computed

as p1=n11/n1 and sensitivity as p2= (n22+n23+n24+n32+n33+n34+n42+n43+n44)/ (n2+n3+n4). If

the table is dichotomized as T1+T2 vs. T3+T4 the specificity in this study is computed as p1=

(n11+n12+n21+n22)/ (n1+n2) and the sensitivity as p2=n33+n34+n43+n44)/ (n3+n4). Finally, if the table

is dichotomized as T1+T2+T3 vs. T4 the specificity in this study is computed as p1= (n11+n12+n13+

n21+n22+n23+n31+n32+n33/ (n1+n2+n3), and the sensitivity as p2=n44/n4.

Table 1. EUS or MRI data compared with reference standard

Reference standard

T1 T2 T3 T4

EUS or MRI

T1 n11 n12 n13 n14

T2 n21 n22 n23 n24

T3 n31 n32 n33 n34

T4 n41 n42 n43 n44

Sum n1 n2 n3 n4

The bivariate analysis

The bivariate model uses pairs of sensitivity and specificity estimates from a series of studies (9).

In this model, the assumption is made that the true values of sensitivity and specificity of the dif-

ferent studies follow a bivariate distribution. As sensitivity and specificity only vary between 0 and

1, the logit-transformed sensitivity and logit-transformed specificity values are modeled.

Logit-transformed sensitivity and specificity are calculated as follows:

ln and ln respectively.

��7

The logit-transformed specificities and logit-transformed sensitivities are assumed to follow a

bivariate normally distribution across studies

with means (µ1, µ2) and covariance matrix , where represents the variance

of the logit-specificity, the variance of the logit-sensitivity and , the covariance between

logit-sensitivity and logit-specificity.

In this approach, unexplained variability between studies can be incorporated. This variation

in (underlying) sensitivities and specificities between studies can thus be related to remaining

differences in study population, differences in implicit thresholds, or unnoticed variations in index

test protocol. The potential presence of a (negative) correlation between sensitivity and specificity

within studies is addressed by explicitly incorporating this correlation into the analysis, through

modeling the covariance parameter .

In addition to the variability between studies in sensitivity and specificity, there is also varia-

tion due to sampling. As studies differ in size, variability due to chance is more likely to occur in

smaller studies. For this reason, the bivariate model is extended by incorporating the precision

by which sensitivity and specificity have been measured in each study. When comparing T1 vs.

T2+T3+T4, the model evaluates the binomial likelihood of observing n11 correct T1-stages by EUS

or MRI given n1 true T1-stages according to reference standard, and the binomial likelihood of

correctly observing n22+n23+n24+n32+n33+n34+n42+n43+n44 T2, T3 or T4 stages by EUS or MRI

given n2+n3+n4, T2, T3, or T4 stages observed by the reference standard. This means that studies

with a more precise estimate of sensitivity and specificity are given a higher weight in the analysis

of sensitivities and specificities.

This bivariate model can be analyzed using linear and nonlinear mixed model techniques,

which are now widely available in statistical packages (SAS proc Nlmixed).

2. Outcomes of bivariate random-effects approachThe bivariate model produces the following relevant results:

1. Summary estimates of the mean logit sensitivity (µ2) and logit specificity (µ1) with corre-

sponding standard errors. Summary estimates of sensitivity and specificity and their 95% confi-

dence interval can be calculated after anti-logit transformation. These intervals take into account

the heterogeneity beyond chance between studies (random-effects model).

2. The parameters of the bivariate distribution can also be used to obtain a summary Receiver

Operating Characteristics (sROC) curve. The relation between logit-transformed sensitivity and

logit-transformed specificity is given by: logit-sensitivity= α+β*logit-specificity. The slope (β) of this

line equals / , and the intercept (α) equals α = µ2- β*µ1. After anti-logit transformation of

the regression line, a sROC curve is obtained.

��

Chapter 9

3. Other measures derived from sensitivity and specificity can be calculated, such as the diag-

nostic odds ratios (exp[µ1+µ2]), and likelihood ratios:

positive likelihood ratio = (exp[µ2]+exp[µ1+µ2])/(exp[µ2]+1), and

negative likelihood ratio = (exp[µ1]+1)/ (exp[µ1]+exp[µ1+µ2]).

4. Covariates can be added to this bivariate model. These can lead to separate effects on sen-

sitivity and specificity. Net effects on the diagnostic odds ratio are also available. This means that

we can explicitly test whether sensitivity or specificity, or both, differ between two diagnostic tech-

nologies. A detailed technical description of this bivariate model can be found in Reitsma et al [9].

3. Example bivariate random-effects approach We have applied this method to data from a previously published meta-analysis on staging of

rectal cancer: only data sets from 1993-2002 were included: 11 data sets on MRI and 26 data

sets on EUS (17). The EUS and MRI data sets of the studies included in this study are given in ap-

pendix 1. We performed bivariate meta-analyses of three dichotomizations: T1 versus T2+T3+T4,

of T1+T2 versus T3+T4, and T1+T2+T3 versus T4.

Figures 1A-C show the sensitivity and specificity obtained in each study per imaging technique

for the three dichotomizations, with corresponding estimates of sensitivity and specificity ob-

tained by the bivariate approach. In all data sets EUS was found to be more accurate than MRI

(see table 2); by MRI significant numbers of patients were incorrectly categorized as T4, while

they were either T1, T2 or T3 (p < 0.001) compared to EUS.

Table 2. Bivariate pooled sensitivity and specificity estimates for the three dichotomizations

EUS MRI

specificity sensitivity specificity sensitivity

T1 vs. T2+T3+T4 0.86 (0.74-0.93)

0.99 (0.97- 0.995)

0.69 (0.34-0.90)

0.99 (0.96-0.998)

T1+T2 vs. T3+T4 0.84 (0.76-0.89)

0.94 (0.91-0.96)

0.75 (0.57-0.87)

0.91 (0.80-0.96)

T1+T2+T3 vs. T4 0.99 (0.98-0.996)*

0.81(0.68-0.89)

0.97 (0.971-0.976)*

0.79 (0.31-0.97)

Values in parentheses are 95% confidence intervals. * P < 0.001.

For EUS, the Diagnostic Odds Ratios were 488 (95% CI: 145-1646), 75 (95% CI:37-153) and 523

(95% CI:183-1492) for T1 vs. T2+T3+T4, T1+T2 vs. T3+T4, and T1 vs. T2+T3+T4, respectively.

For MRI, the Diagnostic Odds Ratios were 244 (95% CI: 29-2033), 29 (95% CI: 8-105) and 132

(95% CI: 16-1122) respectively.

Both for EUS and MRI, and for all three dichotomizations of the T-system, the random-effects

model fitted the data better than the corresponding fixed-effects model; the Akaike Information

Criterion (AIC) values of the random-effects model were lower. A lower AIC value indicates a

better fit (18).

��

Even in the random-effects model, some parameters of the covariance matrix were difficult to

estimate, as computationally it tooks many iterations. For example, as one can see in Figure 1

(A and B) there was much more between study variations in specificity than in sensitivity when

comparing T1 vs. T1+T2+T3 and T1+T2 vs. T3+T4. As a result, the variance of the logit-sensitivity

( ) was difficult to estimate when comparing T1 vs. T2+T3+T4 and T1+T2 vs. T3+T4. Figure 1C

shows that there was much more variation between studies in sensitivity than in specificity when

comparing T1+T2+T3 vs. T4. Therefore, the variance of the logit-specificity ( ) was difficult to

estimate when comparing T1+T2+T3 vs. T4.

Moreover, the covariance between logit-sensitivity and logit-specificity ( ) was difficult to

estimate in all cases: in none of the analyses we found that the covariance was significantly dif-

ferent from zero.

0 0.2 0.4 0.6 0.8 1

Speci�city

EUSMRIEstimate EUSEstimate MRI

0 0.2 0.4 0.6 0.8 1

Speci�city

0 0.2 0.4 0.6 0.8 1

Speci�city

T1 vs T2+T3+T4 T1+T2 vs T3+T4 T1+T2+T3 vs T4

Figure 1A. The distribution of sensitivity and specificity for T1 vs. T2+T3+T4 Summary sensitivity and specificity for EUS were 0.99 (95% CI:0.97-0.995) and 0.86 (95% CI:0.74-0.93)

Summary sensitivity and specificity for MRI were 0.99 (95% CI: 0.96-0.998) and 0.69 (95% CI: 0.34-0.90)

��0

Chapter 9

0 0.2 0.4 0.6 0.8 1

Speci�city

0 0.2 0.4 0.6 0.8 1

Speci�city

0 0.2 0.4 0.6 0.8 1

Speci�city

0 0.2 0.4 0.6 0.8 1

Speci�city

0 0.2 0.4 0.6 0.8 1

Speci�city

0 0.2 0.4 0.6 0.8 1

Speci�city

Figure 1B. The distribution of sensitivity and specificity for T1+T2 vs. T3+T4Summary sensitivity and specificity for EUS were 0.94 (95% CI:0.91-0.96) and 0.84 (95% CI:0.76-0.89)

Figure 1C. The distribution of sensitivity and specificity for T1+T2+T3 vs. T4Summary sensitivity and specificity for EUS were 0.81 (95% CI:0.68-0.89) and 0.99 (95% CI:0.98-0.996)

��

MUlTIVaRIaTe RanDoM-effeCTs aPPRoaCH

In the bivariate random-effects approach three different analyses were required. We extended the

bivariate approach to a multivariate model with which we can discriminate between all T-stages,

and hence report on correct staging, but also on understaging and overstaging of tumor.

1. PrincipleIn this example of rectal cancer staging four T- stages are reported. So in rectal cancer staging, the

entire 4 x 4 table is relevant. Instead of only two proportions, as in the bivariate case (i.e. sensitiv-

ity, and specificity), we now consider all 16 proportions. For any particular study, the 4-by-4 table

will contain 16 proportions (see table 3), where p11 is defined as p11=n11/n1, and p21=n21/n1,

and p12=n12/n2, and so on. Notice that we require that p11+p21+p31+p41=1, and similarly for the

other true T-stages according to reference standard. Hence, there are only 12 “independent”

proportions, because ρ11 + ρ21 + ρ31 + ρ41 =1, similar ρ12 + ρ22 + ρ32 + ρ42 = ρ13 + ρ23 + ρ33 + ρ43 =

ρ14 + ρ24 + ρ34 + ρ44= 1.

Table 3. Example of a 4 by 4 table containing the 16 proportions

Reference standard

T1 T2 T3 T4

EUS or MRI T1 p11 p12 p13 p14

T2 p21 p22 p23 p24

T3 p31 p32 p33 p34

T4 p41 p42 p43 p44

Sum 1 1 1 1

Similar to the bivariate meta-analysis, we assume that these 12 “independent” proportions vary

between studies according to a 12-variate distribution. Here also it is awkward to model pro-

portions, because these may vary only between zero and 1, and therefore we modeled the

“logit”-transformed proportions. The “logit”-transform is slightly more complex when there are

four categories. For the ith true T-stage (according to the reference standard) the proportion

pji of patients scored in T-stage j by EUS or MRI is modeled as , where a4i= 0. In

this notation, we chose the T4 category by EUS or MRI as the reference category (details see

appendix 2).

The 12 transformed parameters aji were assumed to follow a normal distribution with means

(µ11 ,..., µji, ..., µ34), and 12-variate covariance matrix Σ. This covariance matrix is symmetric and

��

Chapter 9

has 12 variances on the diagonal and (12*11/2=) 66 different covariances on the 132 off-diago-

nal cells (see also appendix 2).

Unless many different studies are available it may be difficult to estimate all of these covarianc-

es, and some constraints may be required, for instance all covariances can be set to zero or might

be assumed to be equal. Variation due to chance is incorporated by evaluating the multinomial

likelihood that out of n1 true T1-stages according to the reference standard n11 correct T1-assess-

ments were observed by EUS/MRI, and n21, n31, and n41 incorrect T2-, T3-, and T4-stages. Similar

multinomial likelihoods for the n2, n3, and n4 true T2-, T3-, and T4-stages observed by reference

standard review can be defined. The parameters are estimated by maximizing the product of

these four multinomial likelihoods integrated over the 12-variate normal distribution. This is done

in SAS proc NLmixed. An example of the SAS syntax is provided in the appendix 3.

2. Outcomes multivariate random-effects approachThis multivariate random-effects approach can produce the following relevant results:

1. Summary estimates of the means (µ11 ,..., µji, ..., µ34) will be obtained, and from these all

average proportions (ρji) of the 4-by-4 table can be calculated by anti-“logit” transform.

2. Other measures can be derived from the calculated proportions ρji. Specificity of T1 vs.

T2+T3+T4 can be calculated as ρ11, and sensitivity as (ρ22+ρ23+ρ24+ρ32+ρ33+ρ34+ρ42+ρ43+ρ44)/3,

and sensitivity and specificity of T1+T2 vs. T3+T4, and of T1+T2+T3 vs. T4 can be calculated in a

similar fashion.

3. Accuracy of EUS/MRI staging is defined as (ρ11+ρ22+ρ33+ρ44)/4, and the multivariate diag-

nostic odds ratio may be defined as ρ11*ρ22*ρ33*ρ44/[(1-ρ11)(1-ρ22)(1-ρ33)(1-ρ44)].

3. Example multivariate random-effects approach:We reanalyzed the dataset (EUS and MRI in rectal cancer staging) as previously reported, using

the multivariate random-effects approach. The results obtained by the multivariate approach on

EUS and MRI are listed in tables 4 and 5 respectively.

With the multivariate random-effects approach EUS was found to be more accurate than MRI

in staging (80% [95% CI: 77%-83%] vs. 72% [95% CI: 64%-80%), but the differences were not

statistically significant. MRI seemed more often to overstage than EUS: T1 cancers were more

often seen by MRI as T2 or even T3 (12% vs. 2%), and T2 cancers were more often seen as T3

(42% vs. 28%). With respect to staging T3- and T4-cancers EUS- and MRI-results were more

comparable. The multivariate Odds Ratios were 426 (95% CI: 99-737) and 59 (95% CI: 14-244)

for EUS and MRI respectively.

Both for US and for MRI the AIC values of the random-effects models were significantly lower

than the AIC values of the corresponding fixed-effects models; this points to significant differ-

ences between the studies with respect to T-staging. These differences are illustrated in Figure 2.

The fractions of correctly identified T1-stage cancers by EUS in the different studies are plotted;

there is a considerable difference between studies since the fractions varied between 0.17 and

1.00. The fractions correctly identified T2-, T3-, and T4-stage cancers by EUS, and MRI also varied

considerably between studies. Also the fractions of T1-stage cancers that were classified as T2 by

��

EUS in the different studies are plotted, and here too there were noticeable differences between

studies (the fractions varied between 0.00 and 0.50), and the same applied for almost all other

incorrect classifications.

Table 4. Staging results of EUS obtained by the multivariate approach

T stage on reference standard

T1 T2 T3 T4

T stage on EUS

uT1 0.88 (0.82-0.91)

0.08(0.04-0.10)

0.02 (0-0.03)

0.00(0-0)

uT2 0.10 (0.07-0.15)

0.63 (0.59-0.71)

0.08 (0.05-0.11)

0.01(0-0.04)

uT3 0.02 (0.01-0.04)

0.28 (0.22-0.34)

0.90 (0.89-0.93)

0.20 (0.12-0.28)

uT4 0 (0-0.01)0.01

(0-.015)0.01

(0-0.02)0.78

(0.69-0.87)

Proportions per stage are presented with 95% confidence intervals in parentheses

Table 5. Staging results of MRI obtained by the multivariate approach

T stage on reference standard

T1 T2 T3 T4

T stage on MRI

mT1 0.71 (0.55-0.81)

0.03 (0.01-0.08)

0 (0-0.01)

0 (0-0.005)

mT2 0.17 (0.10-0.29)

0.54 (0.44-0.67)

0.11 (0.07-0.17)

0.02 (0-0.11)

mT3 0.12 (0.05-0.22)

0.42 (0.29-0.52)

0.85 (0.79-0.90)

0.20 (0.07-0.40)

mT4 0 (0-0)

0 (0-0.03)

0.04 (0.02-0.07)

0.78(0.57-0.91)

Proportions per stage are presented with 95% confidence intervals in parentheses

As with the bivariate approach some of the parameters of the multivariate random-effects models

were difficult to estimate, especially the parameters associated with cells with low numbers.

There was, for instance, not a single patient with a T4-cancer who was diagnosed as T1 by EUS/

MRI. The associated parameter was (correctly) estimated to be minus infinity, and the variance

representing differences between studies with respect to this specific cell of the four-by-four

table was estimated as zero. Some other cells contained only 2 patients from two studies, and

the variances of the associated parameters were also estimated as zero. Just as with the previ-

ous bivariate analyses the correlations between the 12 random-effects were small (they varied

between –0.15 and +0.02 for EUS, and –0.16 and +0.13 for MRI).

��

Chapter 9

-1.00 0.50 0.00 0.50 1.00

Proportion

Incorrectly

staged as

Correcly

staged as

-1.00 0.50 0.00 0.50 1.00

Proportion

Incorrectly

staged as

Correcly

staged as

Left: proportions of T1-stage cancers incorrectly classified as T2-stage by EUS; there were notice-able differences between studies, with proportions varied between 0.00 and 0.50.

Right: proportions of correctly identified T1-stage cancers by EUS in the different studies; there is also considerable difference between studies, since the fractions varied between 0.17 and 1.00.

Figure 2 shows the proportions of T1-stage cancers incorrectly classified as T2-stage by EUS and the proportions of correctly identified T1-stage cancers by EUS in the different studies.

��

CoMPaRIson of boTH aPPRoaCHes

The results from the multivariate approach were used to calculate the sensitivity and specificity for

the three dichotomizations. The results are summarized in table 6, which also includes the results

from the repeated bivariate approach.

Table 6. Comparison of the results of the bivariate and multivariate approaches

EUS MRI

multivariate bivariate multivariate bivariate

spec/sens spec/sens spec/sens spec/sens

T1 vs. T2+T3+T4 0.89/0.98 0.86/0.99 0.71/0.97 0.69/0.99

T1+T2 vs. T3+T4 0.87/0.91 0.84/0.94 0.73/0.94 0.75/0.91

T1+T2+T3 vs. T4 0.99/0.81 0.99/0.81 0.98/0.78 0.97/0.79

spec: specificity; sens: sensitivity

DIsCUssIon

The proportions of patients correctly staged, understaged and overstaged for each stage were

obtained by the multivariate approach. EUS was more accurate in correct staging compared

to MRI. When the results of the multivariate model were used to calculate the sensitivity and

specificity for the three dichotomizations (T1 vs. T2+T3+T4, T1+T2 vs. T3+T4, and T1+T2+T3

vs. T4), the findings were comparable with the results of the bivariate analyses for the three

dichotomizations.

For meta-analysis of diagnostic accuracy studies several approaches exist: a bivariate random-

effects approach [7-9] and a hierarchical summary receiver operating characteristic (HSROC)

model [10]. These models are very close related [11].

We extended the bivariate approach to a multivariate approach for analyzing staging data.

This multivariate approach has four additional advantages. Firstly, the multivariate approach en-

tails only a single statistical analysis, and does not require decisions on dichotomizations of data.

This will reduce chance findings by reducing the impact of subjective decisions in the statistical

analysis. Secondly, the multivariate analysis does provide all cell probabilities and the confidence

intervals of all cells of the 4-by-4 table, and with that information it is much easier to pinpoint

the amount of under- and overstaging, and how much studies vary in that aspect. The third

advantage of the multivariate approach is that studies that only report the results of particular

dichotomizations, say T1+T2 vs. T3+T4, can still be used in the meta-analysis, and this applies in

general for all dichotomizations. In contrast, with the repeated bivariate model the study who

only reported the results of T1+T2 vs. T3+T4 cannot be used when analyzing other dichotomiza-

tions. The fourth advantage is that the multivariate analysis requires one single statistical analysis

��6

Chapter 9

and afterwards all these parameters could be used to calculate sensitivity and specificity of any

dichotomizations.

As argued, the multivariate model can be extended with covariates, and these can be incor-

porated in several ways: (i) the regression parameters may be fixed or vary between the studies,

and (ii) the regression parameters may be different but they may also be similar for the three

possible dichotomizations, T1 vs. T2+T3+T4, T1+T2 vs. T3+T4, and T1+T2+T3 vs. T4), since the

staging-system is likely measured on an ordinal scale.

The calculations were performed in the SAS system with the NLmixed procedure. We found

the program to be robust, but a severe limitation was that the calculations took a very long time

for the multivariate analyses. The calculations took 48 hours on a PC, or more. We therefore

implemented the multivariate model in the Winbugs program which uses a Bayesian algorithm.

This took much less computing time, typically about two hours. Another advantage of the Bayes-

ian approach is that it is much easier to calculate the confidence intervals of the probabilities as

reported in tables 4 and 5. The Winbugs-implementation can be obtained from the first author.

From the clinical point of view, understaging and overstaging are important issues, as the

choice of the treatment is mostly influenced by the invasiveness of the tumor. For example,

patients with T1-stage rectal cancer will mostly undergo transanal endoscopic microsurgery

(TEM), a minimally invasive procedure [19, 20], whereas patients with more advanced (T2, T3

and T4-stages), in general, undergo total mesorectal excision (TME), with or without (extended)

radiotherapy [21-23]. Another example, in cervical cancer staging, overstaging of a T2A tumor

and understaging of a T2B tumor would lead to inappropriate treatment of patients. Patients

with early stage tumor in general undergo surgery, while patients with advanced stage tumor will

undergo more complex treatment [24-27].

There are numerous types of cancer in which T-stages (e.g. esophageal cancer, prostate can-

cer) are important for selecting the appropriate therapeutic procedures and accurate staging is

required [28-33]. In the literature, imaging data are presented per stage and we feel these should

be summarized properly, without dichotomizing data or selecting data in order to calculate sen-

sitivity and specificity [34, 35]. The multivariate random-effects approach described above could

also be used to summarize data on lymph node status (N-status), as these results are also reported

per stage, for example N0, N1, N2 and N3 stages in lung cancer [36].

In conclusion, for summarizing ordinal or nominal data in diagnostic accuracy studies, this

multivariate random-effects approach is a very helpful meta-analytic method. Because of the

increased interest in meta-analysis for evidence-based guideline development, an adequate statis-

tical analysis should be used to summarize the staging findings.

��7

References

1. Cook DJ, Greengold NL, Ellrodt AG, Weingarten SR. The relation between systematic reviews and practice guidelines. Ann Intern Med 1997;127:210-216.

2. Cook DJ, Mulrow CD, Haynes RB. Systematic reviews: synthesis of best evidence for clinical decisions. Ann Intern Med 1997;126:376-380.

3. Gallagher EJ. Systematic reviews: a logical methodological extension of evidence-based medicine. Acad Emerg Med 1999;6:1255-1260.

4. Deville WL, Buntinx F, van der Windt DA, et al. Didactic guidelines for conducting systematic reviews of studies evaluating the accuracy of diagnostic tests. 2001. Knottnerus JA, editor. The evidence Base of diagnosis. London: BMJ Publishing Group.

5. Deeks JJ. Systematic reviews in health care: Systematic reviews of evaluations of diagnostic and screening tests. BMJ 2001;323:157-162.

6. Khan KS. Systematic reviews of diagnostic tests: a guide to methods and application. Best Pract Res Clin Obstet Gynaecol 2005;19(1):37-46

7. van Houwelingen HC, Arends LR, Stijnen T. Advanced methods in meta-analysis: multivariate approach and meta-regression. Stat Med 2002;21:589-624.

8. Arends LR, Voko Z, Stijnen T. Combining multiple outcome measures in a meta-analysis: an application. Stat Med 2003;22:1335-1353.

9. Reitsma JB, Glas AS, Rutjes AW, Scholten RJ, Bossuyt PM, Zwinderman AH. Bivariate analysis of sen-sitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol 2005;58:982-990.

10. Rutter CM, Gatsonis CA. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med 2001;20:2865-2884.

11. Harbord RM, Deeks JJ, Egger M, Whiting P, Sterne JA. A unification of models for meta-analysis of diag-nostic accuracy studies. Biostatistics 2006

12. Steele SR, Martin MJ, Place RJ. Flexible endorectal ultrasound for predicting pathologic stage of rectal cancers. Am J Surg 2002;184:126-130.

13. Garcia-Aguilar J, Pollack J, Lee SH, et al. Accuracy of endorectal ultrasonography in preoperative staging of rectal tumors. Dis Colon Rectum 2002;45:10-15.

14. Herzog U, von Flue M, Tondelli P, Schuppisser JP. How accurate is endorectal ultrasound in the preopera-tive staging of rectal cancer? Dis Colon Rectum 1993;36:127-134.

15. Hadfield MB, Nicholson AA, MacDonald AW, Farouk R, Lee PW, Duthie GS et al. Preoperative staging of rectal carcinoma by magnetic resonance imaging with a pelvic phased-array coil. Br J Surg 1997;84:529-531.

16. Drew PJ, Farouk R, Turnbull LW, Ward SC, Hartley JE, Monson JR. Preoperative magnetic resonance stag-ing of rectal cancer with an endorectal coil and dynamic gadolinium enhancement. Br J Surg 1999;86:250-254.

17. Bipat S, Glas AS, Slors FJ, Zwinderman AH, Bossuyt PM, Stoker J. Rectal cancer: local staging and assess-ment of lymph node involvement with endoluminal US, CT, and MR imaging--a meta-analysis. Radiology 2004;232:773-783.

18. Akaike, H. A new look at the Statistical Model Identification. IEEE Trans Automat Control 1974; 19:716-23

19. Langer C, Liersch T, Markus P, et al. Transanal endoscopic microsurgery (TEM) for minimally invasive resec-tion of rectal adenomas and “Low-risk” carcinomas (uT1, G1 - 2). Z Gastroenterol 2002;40:67-72.

20. Burghardt J, Buess G. Transanal endoscopic microsurgery (TEM): a new technique and development dur-ing a time period of 20 years. Surg Technol Int 2005;14:131-137

21. Marijnen CA, Nagtegaal ID, Kapiteijn E, et al. Radiotherapy does not compensate for positive resection margins in rectal cancer patients: report of a multicenter randomized trial. Int J Radiat Oncol Biol Phys 2003;55:1311-1320.

��

Chapter 9

22. Goldberg S, Klas JV. Total mesorectal excision in the treatment of rectal cancer: a view from the USA. Semin Surg Oncol 1998;15:87-90.

23. Wiig JN, Carlsen E, Soreide O. Mesorectal excision for rectal cancer: a view from Europe. Semin Surg Oncol 1998;15:78-86.

24. Loizzi V, Cormio G, Loverro G, Selvaggi L, Disaia PJ, Cappuccini F. Chemoradiation: A new approach for the treatment of cervical cancer. Int J Gynecol Cancer 2003;13:580-586.

25. Panici PB, Cutillo G, Angioli R. Modulation of surgery in early invasive cervical cancer. Crit Rev Oncol Hematol 2003;48:263-270.

26. duPont NC, Monk BJ. Chemotherapy in the management of cervical carcinoma. Clin Adv Hematol Oncol 2006;4:279-286.

27. Duenas-Gonzalez A, Cetina L, Mariscal I, de la GJ. Modern management of locally advanced cervical carcinoma. Cancer Treat Rev 2003;29:389-399.

28. Kelsen DP, Ginsberg R, Pajak TF, et al. Chemotherapy followed by surgery compared with surgery alone for localized esophageal cancer. N Engl J Med 1998;339:1979-1984.

29. Hulscher JB, van Sandick JW, de Boer AG, et al. Extended transthoracic resection compared with limited transhiatal resection for adenocarcinoma of the esophagus. N Engl J Med 2002;347:1662-1669.

30. Hulscher JB, Tijssen JG, Obertop H, van Lanschot JJ. Transthoracic versus transhiatal resection for carci-noma of the esophagus: a meta-analysis. Ann Thorac Surg 2001;72:306-313.

31. Mazhar D, Ngan S, Waxman J. Improving outcomes in early prostate cancer: Part II--neoadjuvant treat-ment. BJU Int 2006;98:731-734.

32. Mazhar D, Ngan S, Waxman J. Improving outcomes in early prostate cancer: Part I--adjuvant treatment. BJU Int 2006;98:725-730.

33. Pisansky TM. External-beam radiotherapy for localized prostate cancer. N Engl J Med 2006;355:1583-1591.

34. Engelbrecht MR, Jager GJ, Laheij RJ, Verbeek AL, van Lier HJ, Barentsz JO. Local staging of prostate cancer using magnetic resonance imaging: a meta-analysis. Eur Radiol 2002;12:2294-2302.

35. van Westreenen HL, Westerterp M, Bossuyt PM, et al. Systematic review of the staging performance of 18F-fluorodeoxyglucose positron emission tomography in esophageal cancer. J Clin Oncol 2004;22:3805-3812.

36. Cerfolio RJ, Ojha B, Bryant AS, Raghuveer V, Mountz JM, Bartolucci AA. The accuracy of integrated PET-CT compared with dedicated PET alone for the staging of patients with nonsmall cell lung cancer. Ann Thorac Surg 2004;78:1017-1023.

��

aPPenDIX 1

Numbers of patients with EUS T-stages and T-stages obtained by reference standard

Study T1 by reference standard

T2 by reference standard

EUS T1 T2 T3 T4 T1 T2 T3 T4 T1 T2 T3 T4 T1 T2 T3 T4

Herzog 1993 19 3 0 0 1 17 8 0 0 0 67 0 0 0 0 2

Nielsen 1993 0 0 0 0 0 2 3 0 0 0 14 0 0 0 2 7

Houvenaeghel 1993 2 3 1 1 2 5 1 0 2 2 12 0 0 0 0 1

Hulsmans 1994 1 0 2 0 1 4 16 1 0 1 25 0 0 0 1 3

Kim 1994 1 0 0 0 0 3 1 0 0 1 25 1 0 0 1 1

Adams 1995 59 0 0 0 1 2 0 0 0 0 0 0 0 0 0 0

Starck 1995 1 1 0 0 0 8 1 0 0 2 21 0 0 0 0 0

Fedyaev 1995 0 0 0 0 0 30 3 0 0 2 41 2 0 0 3 28

Kaneko 1996 3 1 0 0 0 9 1 0 0 3 21 0 0 0 0 0

Pegios 1996 6 1 0 0 0 13 3 0 0 3 51 1 0 0 7 15

Osti 1997 3 0 0 0 0 11 7 0 0 4 32 0 0 0 0 6

Sailer 1997 43 10 0 0 1 12 16 0 0 1 61 5 0 1 2 8

Hunerbein 1997 17 1 0 0 0 19 4 0 1 3 18 0 0 1 0 6

Maier 1997 4 2 0 0 0 15 6 0 0 2 45 0 0 0 4 2

Akasu 1997 32 1 1 0 4 19 11 0 0 4 78 2 0 0 3 9

Massari 1998 13 0 1 0 0 16 2 0 2 2 32 0 0 0 0 7

Scialpi 1999 0 0 0 0 0 0 0 0 0 0 16 1 0 0 1 3

Barbaro 1999 0 0 0 0 0 9 1 0 0 1 15 0 0 0 0 2

Carmody 2000 11 4 0 0 1 7 2 0 0 0 11 0 0 0 0 0

Gualdi 2000 2 0 0 0 0 5 5 0 0 1 13 0 0 0 0 0

Blomqvist 2000 1 1 3 1 4 3 4 0 0 4 25 0 0 0 0 3

Akahoshi 2000 9 1 0 0 0 2 1 0 0 5 20 0 0 0 0 1

Akasu 2000 74 4 2 0 8 40 26 0 0 6 149 0 0 0 0 0

Hunerbein 2000 16 0 0 0 0 10 1 0 0 1 1 0 0 0 0 1

Steele 2002 5 5 2 0 0 7 5 1 0 3 12 1 0 0 3 1

Garcia-Aquilar 2002 226 24 0 0 24 104 37 2 3 25 92 2 0 0 2 4

Chapter 9

Numbers of patients with MRI T-stages and T-stages obtained by reference standard

Study T1 by reference standard

MRI T1 T2 T3 T4 T1 T2 T3 T4 T1 T2 T3 T4 T1 T2 T3 T4

Okizuka 1993 11 1 1 0 2 3 1 0 0 5 7 0 0 0 0 2

Schnall 1994 8 3 0 0 0 6 4 0 0 0 15 0 0 0 0 0

Murano 1995 2 1 2 0 0 3 2 0 0 2 9 0 0 0 0 1

Pegios 1996 16 1 0 0 0 5 1 0 0 0 4 0 0 0 0 2

Hadfield 1997 2 3 2 0 0 2 2 0 0 6 17 3 0 0 1 0

Vogl 1997 5 1 0 0 0 4 1 0 0 0 5 0 0 0 0 4

Drew 1999 0 2 2 0 1 2 10 1 0 2 7 1 0 0 1 0

Gualdi 2000 2 0 0 0 0 6 4 0 0 0 14 0 0 0 0 0

Blomqvist 2000 0 2 4 0 0 4 7 0 1 2 25 2 0 0 3 0

Kim 2000 3 1 0 0 0 20 17 0 0 15 141 6 0 1 1 12

Hunerbein 2000 14 0 0 0 1 9 1 0 0 0 2 0 0 0 0 1

�6�

aPPenDIX 2

The proportions, ρ11, ρ21, ρ31, ρ41, are transformed into:

ρ11 = , ρ21 = ,

ρ31 = , ρ41 = ,

depending on the parameters: a11, a21, a31. The proportions (ρ12, ρ22, ρ32, ρ42), (ρ13, ρ23, ρ33, ρ43)

and (ρ14, ρ24, ρ34, ρ44), are transformed similarly depending on parameters (a12, a22, a32), (a13, a23,

a33) and (a14, a24, a34) respectively. We will assume that in our model, these 12 parameters (a11,

a21, a31, a12, a22, a32, a13, a23, a33, a14, a24, a34) follow a 12–variate normal distribution with means

(µ11, µ21, µ31, µ12, µ22, µ32, µ13, µ23, µ33, µ14, µ24, µ34) and covariance matrix ∑.

This covariance matrix is symmetric and has therefore 12 variances on the diagonal and (12*11/2)

66 different covariances on the 132 off-diagonal cells; the variances representing differences

between studies; σ5,8 representing covariance between parameters 5 and 8, similarly σ10, 12 rep-

resenting the covariance between parameters 10 and 12.

�6�

Chapter 9

aPPenDIX 3

Technical description of the model in which covariances are assumed to be zero. A copy of the

full SAS program is available on request from the first author.

proc NLmixed data=data df=1000;

p11=exp(a11)/(1+exp(a11)+exp(a21)+ exp(a31));

p41=1 /(1+exp(a11)+exp(a21)+ exp(a31));

p42=1 /(1+exp(a12)+exp(a22)+ exp(a32));

p43=1 /(1+exp(a13)+exp(a23)+ exp(a33));

p44=1 /(1+exp(a14)+exp(a24)+ exp(a34));

ll = n11*log(p11)+n12*log(p12)+n13*log(p13)+n14*log(p14)+

n21*log(p21)+n22*log(p22)+n23*log(p23)+n24*log(p24)+

n31*log(p31)+n32*log(p32)+n33*log(p33)+n34*log(p34)+

n41*log(p41)+n42*log(p42)+n43*log(p43)+n44*log(p44);

model n11 ~ general(ll);

random a11 a21 a31 a12 a22 a32 a13 a23 a33 a14 a24 a34 ~

normal ([µ11, µ21, µ31, µ12, µ22, µ32, µ13, µ23, µ33, µ14, µ24, µ34],

[s1,0,s2,0,0,s3,0,0,0,s4,0,0,0,0,s5,0,0,0,0,0,s6,0,0,0,0,0,0,s7,0,0,0,0,0,0,0,s8,0,0,0,0,0,0,0,0,s

9,0,0,0,0,0,0,0,0,0,s10,0,0,0,0,0,0,0,0,0,0,s11,0,0,0,0,0,0,0,0,0,0,0,s12]) subject=nr;

Data are entered into the SAS system as they are given in appendix 1: n11, n21, n31, and 41

denote the numbers of patients with T1-stage cancer identified by EUS/MRI (separately) as T1,

T2, T3, or T4, respectively, and n12, n22, n32, n42 are the numbers of patients with T2-stage

cancer but identified as T1, T2, T3, or T4 by EUS/MRI.

�6�

The variable ll denotes the sum of the four multinomial log likelihoods associated with the true

T1-, T2-, T3-, and T4-stages, and the variables p11 .... p44 are the associated cell probabilities of

the 4-by-4 table. The parameters a11 ... a34 are the “logit”-transformed parameters of the cell

probabilities, which are assumed to be normally distributed with means µ11 ... µ34, and variances

s1 ... s12. In the example above all covariances are fixed to be zero, but this is not a necessary

requirement. We used different reference categories than in the main text of the paper; here we

used the most prevalent categories as references.

�6�

Chapter 9

Reference appendix 1

1. Herzog U, von Flue M, Tondelli P, Schuppisser JP. How accurate is endorectal ultrasound in the preopera-tive staging of rectal cancer? Dis Colon Rectum 1993;36:127-134.

2. Nielsen MB, Pedersen JF, Christiansen J. Rectal endosonography in the evaluation of stenotic rectal tumors. Dis Colon Rectum 1993;36:275-279.

3. Houvenaeghel G, Delpero JR, Giovannini M, et al. Staging of rectal cancer: a prospective study of digital ex-amination and endosonography before and after preoperative radiotherapy. Acta Chir Belg 1993;93:164-168.

4. Hulsmans FJ, Tio TL, Fockens P, Bosma A, Tytgat GN. Assessment of tumor infiltration depth in rectal cancer with transrectal sonography: caution is necessary. Radiology 1994;190:715-720.

5. Kim NK, Choi JS, Sohn SK, Min JS. Transrectal ultrasonography in preoperative staging of rectal cancer. Yonsei Med J 1994;35:396-403.

6. Adams WJ, Wong WD. Endorectal ultrasonic detection of malignancy within rectal villous lesions. Dis Colon Rectum 1995;38:1093-1096.

7. Starck M, Bohe M, Fork FT, Lindstrom C, Sjoberg S. Endoluminal ultrasound and low-field magnetic reso-nance imaging are superior to clinical examination in the preoperative staging of rectal cancer. Eur J Surg 1995;161:841-845.

8. Fedyaev EB, Volkova EA, Kuznetsova EE. Transrectal and transvaginal ultrasonography in the preoperative staging of rectal carcinoma. Eur J Radiol 1995;20:35-38.

9. Kaneko K, Boku N, Hosokawa K, et al. Diagnostic utility of endoscopic ultrasonography for preoperative rectal cancer staging estimation. Jpn J Clin Oncol 1996;26:30-35.

10. Pegios W, Vogl J, Mack MG, et al. MRI diagnosis and staging of rectal carcinoma. Abdom Imaging 1996;21:211-218.

11 Osti MF, Padovan FS, Pirolli C, et al. Comparison between transrectal ultrasonography and computed to-mography with rectal inflation of gas in preoperative staging of lower rectal cancer. Eur Radiol 1997;7:26-30.

12. Sailer M, Leppert R, Kraemer M, Fuchs KH, Thiede A. The value of endorectal ultrasound in the assessment of adenomas, T1- and T2-carcinomas. Int J Colorectal Dis 1997;12:214-219.

13. Hunerbein M, Schlag PM. Three-dimensional endosonography for staging of rectal cancer. Ann Surg 1997;225:432-438.

14. Maier AG, Barton PP, Neuhold NR, Herbst F, Teleky BK, Lechner GL. Peritumoral tissue reaction at tran-srectal US as a possible cause of overstaging in rectal cancer: histopathologic correlation. Radiology 1997;203:785-789.

15. Akasu T, Sugihara K, Moriya Y, Fujita S. Limitations and pitfalls of transrectal ultrasonography for staging of rectal cancer. Dis Colon Rectum 1997;40(10 Suppl):S10-S15.

16. Massari M, De Simone M, Cioffi U, Rosso L, Chiarelli M, Gabrielli F. Value and limits of endorectal ultraso-nography for preoperative staging of rectal carcinoma. Surg Laparosc Endosc 1998;8:438-444.

17. Scialpi M, Rotondo A, Angelelli G. Water enema transvaginal ultrasound for local staging of stenotic rectal carcinoma. Abdom Imaging 1999;24:132-136.

18. Barbaro B, Schulsinger A, Valentini V, Marano P, Rotman M. The accuracy of transrectal ultrasound in predicting the pathological stage of low-lying rectal cancer after preoperative chemoradiation therapy. Int J Radiat Oncol Biol Phys 1999;43:1043-1047.

19. Carmody BJ, Otchy DP. Learning curve of transrectal ultrasound. Dis Colon Rectum 2000;43:193-197.

20. Gualdi GF, Casciani E, Guadalaxara A, d’Orta C, Polettini E, Pappalardo G. Local staging of rectal cancer with transrectal ultrasound and endorectal magnetic resonance imaging: comparison with histologic find-ings. Dis Colon Rectum 2000;43:338-345.

21. Blomqvist L, Machado M, Rubio C, et al. Rectal tumour staging: MR imaging using pelvic phased-array and endorectal coils vs endoscopic ultrasonography. Eur Radiol 2000;10:653-660.

�6�

22. Akahoshi K, Kondoh A, Nagaie T, et al. Preoperative staging of rectal cancer using a 7.5 MHz front-loading US probe. Gastrointest Endosc 2000;52:529-534.

23. Akasu T, Kondo H, Moriya Y, et al. Endorectal ultrasonography and treatment of early stage rectal cancer. World J Surg 2000;24:1061-1068.

24. Hunerbein M, Pegios W, Rau B, Vogl TJ, Felix R, Schlag PM. Prospective comparison of endorectal ultra-sound, three-dimensional endorectal ultrasound, and endorectal MRI in the preoperative evaluation of rectal tumors. Preliminary results. Surg Endosc 2000;14:1005-1009.

25. Steele SR, Martin MJ, Place RJ. Flexible endorectal ultrasound for predicting pathologic stage of rectal cancers. Am J Surg 2002;184:126-130.

26. Garcia-Aguilar J, Pollack J, Lee SH, et al. Accuracy of endorectal ultrasonography in preoperative staging of rectal tumors. Dis Colon Rectum 2002;45:10-15.

27. Okizuka H, Sugimura K, Ishida T. Preoperative local staging of rectal carcinoma with MR imaging and a rectal balloon. J Magn Reson Imaging 1993;3:329-335.

28. Schnall MD, Furth EE, Rosato EF, Kressel HY. Rectal tumor stage: correlation of endorectal MR imaging and pathologic findings. Radiology 1994;190:709-714.

29. Murano A, Sasaki F, Kido C, et al. Endoscopic MRI using 3D-spoiled GRASS (SPGR) sequence for local staging of rectal carcinoma. J Comput Assist Tomogr 1995;19:586-591.

30. Hadfield MB, Nicholson AA, MacDonald AW, et al. Preoperative staging of rectal carcinoma by magnetic resonance imaging with a pelvic phased-array coil. Br J Surg 1997 April;84(4):529-31.

31. Vogl TJ, Pegios W, Mack MG, et al. Accuracy of staging rectal tumors with contrast-enhanced transrectal MR imaging. AJR Am J Roentgenol 1997;168:1427-1434.

32. Drew PJ, Farouk R, Turnbull LW, Ward SC, Hartley JE, Monson JR. Preoperative magnetic resonance stag-ing of rectal cancer with an endorectal coil and dynamic gadolinium enhancement. Br J Surg 1999;86:250-254.

33. Kim NK, Kim MJ, Park JK, Park SI, Min JS. Preoperative staging of rectal cancer with MRI: accuracy and clinical usefulness. Ann Surg Oncol 2000;7:732-737.

2C h a p t e r

Summary and Conclusions

�6�

summary and Conclusions

sUMMaRY

The aim of the work summarized in this thesis was to collect primary findings and to summarize

evidence for diagnosing and staging of several gynecological and gastrointestinal malignancies by

means of either ultrasonography (US), computed tomography (CT), magnetic resonance imaging

(MRI) or positron emission tomography with 18-fluorodeoxyglucose (FDG-PET). The results of our

analyses can be used for making evidence-based recommendations for clinical practice.

In chapter 2 lymph nodes detection by MRI in patients with vulva tumor was evaluated. As

lymphatic spread remains an important prognostic factor in these patients and no uniform results

are described in the literature, two observers retrospectively analyzed the accuracy of MRI for

lymph node detection in 60 patients with vulva carcinoma. The MRI findings were compared

to pathological results of sentinel node procedures or lymphadenectomy specimen. Sensitivity,

specificity, positive and negative predictive values were 52%, 85%, 46%, and 87% for observer

1 and 52% 89%, 52%, and 89% for observer 2.

We concluded that at this stage no role exists for standard MRI in evaluating lymph node

involvement in patients with vulva carcinoma.

In chapter 3 we performed a systematic review on the diagnostic performance of CT and

MRI in staging of cervical carcinoma. The diagnostic performance of both modalities in the evalu-

ation of parametrial invasion, bladder and rectum invasion, and lymph node involvement were

reported.

Sensitivity estimates for parametrial invasion were 74% (95% CI: 68%-79%) for MRI and 55%

(95% CI: 44%-66%) for CT, and for lymph node involvement, 60% (95% CI: 52%-68%) and 43%

(95% CI: 37%-57%), respectively. MRI and CT had comparable specificities for parametrial inva-

sion and lymph node involvement.

For bladder invasion and rectum invasion the sensitivities for MRI were 75% (95% CI: 66%-

83%) and 71% (95% CI: 53%-83%), respectively, both higher (not significant) compared than for

CT. The specificity in evaluating bladder invasion for MRI was significantly higher compared with

CT: 91% (95% CI: 83%-95%) for MRI and 73% (95% CI: 52%-87%) for CT. The specificities for

rectum invasion were comparable.

On the basis of our systematic review, the following clinical practice guidelines could be for-

mulated for staging cervical carcinoma. In clinically early-stage cancer, the prevalence of disease

spreading outside the cervix is low and therefore the additional value of MRI is limited. In more

advanced disease, MRI can play an important role, as clinical staging has significant limitations in

advanced disease.

In chapter 4 we compared the diagnostic value of endoluminal US (EUS), CT and MRI for local

staging and assessment of lymph node involvement in patients with rectal cancer by means of a

meta-analysis.

For muscularis propria invasion, EUS and MRI had similar sensitivities. Specificities were 86%

(95% CI: 80%-90%) and 69% (95% CI: 52%-82%) respectively. For perirectal tissue invasion,

sensitivity of EUS was 90% (95% CI: 88%-92%); significantly higher than that of CT (79% [95%

CI: 74%-84%]) and that of MRI (82% [95% CI: 74%-87%]). Specificities were comparable. For

�6�

adjacent organ invasion and lymph node involvement, estimates for EUS, CT, and MRI were

comparable.

The summary ROC curve for EUS in detecting perirectal tissue invasion showed better diagnos-

tic accuracy than for CT and MRI. Summary ROC curves for lymph node involvement showed no

differences in accuracy.

On the basis of the results of this meta-analysis, EUS seems to be a better diagnostic imaging

test for local staging than CT and MRI. However, EUS cannot identify the mesorectal fascia and

possible tumor involvement, while this is possible with MRI. Mesorectal fascia identification is im-

portant in determining the spread of tumors in patients considered for total mesorectal excision.

EUS could be helpful in selecting patients for available therapeutic strategies, such as transanal

endoscopic microsurgery. The identification of lymph nodes with EUS, CT, and MRI remains a

major point of concern.

In chapter 5 we systematically reviewed the evidence on the diagnostic value of US, CT and

MRI for diagnosis and determining resectability of pancreatic adenocarcinoma. The aim of this

study was to obtain summary estimates of US, conventional CT, helical CT, and MRI for the diag-

nosis and determination of resectability of pancreatic adenocarcinoma.

For diagnosis, sensitivities of US, conventional CT, helical CT, and MRI were 76%, 86%, 91%,

and 84% and specificities were 75%, 79%, 85%, and 82%, respectively. Sensitivities for MRI and

US were significantly lower than with helical CT (P = 0.04 and P = 0.0001).

For determining resectability, sensitivities of US, conventional CT, helical CT, and MRI were

83%, 82%, 81%, and 82% and specificities were 63%, 76%, 82%, and 78% respectively. Specific-

ity of US was significantly lower than with helical CT (P = 0.011).

Based on the high sensitivity for diagnosis of helical CT compared with MRI and US and the

high specificity for resectability compared with US, we feel that helical CT is preferable as imaging

modality for the diagnosis and assessment of resectability of pancreatic adenocarcinoma.

In chapter 6 we summarized the findings of a survey performed on the management of

patients with colorectal liver metastases in the Netherlands. In this survey we documented the

extent of variation in diagnosis and treatment strategies. This survey showed substantial variation

in the diagnostic and therapeutic work-up of patients with colorectal liver metastases. This varia-

tion reflects either under- or over-utilization of diagnosis and treatment options. Evidence-based

guidelines taking into account the available evidence, experience and availability could solve this

variation (presented in chapter 8).

In chapter 7 the results of a meta-analysis on the diagnostic accuracy of CT, MRI and FDG-PET

for detection of colorectal liver metastases were presented. The aim of this study was to obtain

estimates of sensitivity of CT, MRI and FDG-PET for detection of colorectal liver metastases on

per-patient and per-lesion basis.

Sensitivity estimates on a per-patient basis for non-helical CT, helical CT, 1.5 T MRI, and FDG-PET

were 60.2%, 64.7%, 75.8%, and 94.6%, respectively; FDG-PET was the most accurate modality.

On a per-lesion basis, sensitivity estimates for non-helical CT, helical CT, 1.0 T MRI, 1.5 T MRI,

and FDG-PET were 52.3%, 63.8%, 66.1%, 64.4%, and 75.9%, respectively; non-helical CT had

the lowest sensitivity. Estimates of gadolinium-enhanced MRI and superparamagnetic iron oxide

(SPIO)-enhanced MRI were significantly better, compared with non-enhanced MRI (P = 0.019 and

P < 0.001, respectively) and with helical CT with ≤ 45 g of iodine (P = 0.02 and P < 0.001, respec-

tively). For lesions ≥ 1 cm, SPIO-enhanced MRI was the most accurate modality (P < 0.001).

The choice between helical CT performed with > 45 g of iodine and MRI with a gadolinium-

based contrast agent or SPIO should also depend on availability and expertise and not on diag-

nostic accuracy only. The role of FDG-PET for the detection of colorectal liver metastases at this

moment is limited. FDG-PET will therefore mainly be used as an additional imaging modality for

detection of extrahepatic disease.

In chapter 8 we summarized an evidence-based guideline developed for diagnosis and treat-

ment of patients with colorectal liver metastases. The most important recommendations for

diagnosis and treatment are:

For synchronous liver metastases, imaging should comprise spiral CT or MRI. For evaluation of

lung metastases, imaging can be limited to chest radiography.

For detection of metachronous liver metastases, ultrasonography could be performed as initial

modality, if the entire liver is adequate visualized. In doubtful cases or potential candidates for

surgery, CT or MRI should be performed as additional imaging.

For evaluation of extrahepatic disease, abdominal and chest CT could be performed. FDG-PET

could be valuable in patients selected for surgery based on CT (liver/abdomen/chest), for identify-

ing extrahepatic disease.

Surgical resection is the treatment of choice with a 5-years survival of 30%-40%. Variation in

selection criteria for surgery is caused by equivocal literature data concerning surgical margins

<10 mm, presence of extrahepatic disease and the role of (neo)-adjuvant therapy. A substantial

variation also exists in surgical techniques (e.g. anatomic vs. non-anatomic). To minimize variation

in selection criteria, selection should be performed according to the guideline and preferable in

qualified centers.

If resection is not possible, due to extensive disease, palliative chemotherapy is recommended.

Systemic chemotherapy with fluoropyrimidine first-line chemotherapy (5-FU/Leucovorin) com-

bined with irinotecan or oxaliplatin should be considered as standard regimens.

Radiofrequency ablation, isolated hepatic perfusion, portal vein embolization, and intra-arterial

chemotherapy are considered experimental and should only be performed as part of a clinical

research protocol.

In chapter 9 we described a new multivariate random-effects approach for meta-analysis of

staging data, one that allows calculating estimates of correct staging, under and overstaging.

In cancer, the differentiation between disease and non-disease is not the only issue. Correct

staging, understaging and overstaging of the tumor are equally important. When using existing

meta-analytic approaches, such as the bivariate random-effects approach, data should be di-

chotomized for calculation of sensitivity and specificity. In a four category T-stage staging system,

three dichotomies can be made: T1 vs. T2+T3+T4, T1+T2 vs. T3+T4 and T1+T2+T3 vs. T4). These

dichotomies lose information on understaging and overstaging.

By using above mentioned multivariate random-effects approach, estimates of correct staging,

understaging and overstaging were calculated. We also calculated summary estimates of sensitiv-

�7�

ity and specificity for the three dichotomizations. For all these aims a subset of data sets previously

analyzed by the bivariate approach for comparing Endoluminal US (EUS) and MRI in staging rectal

cancer was used (chapter 4).

EUS was more accurate than MRI in staging (80% vs. 72%), but the differences were not

statistically significant. MRI especially seemed to more often overstage than EUS: T1 cancers were

more often seen by MRI as T2 or even T3 (12% vs. 2%), and T2 cancers were more often seen as

T3 (42% vs. 28%).

The sensitivity and specificity estimates of EUS or MRI for the three dichotomizations, calcu-

lated from the results of the multivariate approach, were comparable with the sensitivity and

specificity estimates of EUS or MRI obtained by the bivariate random-effects analyses.

We believe that the multivariate random-effects approach can be a very helpful, informative

and precise meta-analytic method for summarizing ordinal or nominal data in diagnostic accuracy

studies.

ConClUsIons anD fUTURe ReseaRCH

The work in this thesis illustrates that performing diagnostic studies of sufficient methodological

quality and systematically reviewing the literature are both valuable and essential tools in making

evidence-based recommendations for clinical practice.

MethodologyIn general, systematic reviews of diagnostic accuracy studies go through a number of key stages:

defining a clinical question, performing comprehensive literature search, defining explicit selec-

tion criteria to identify relevant studies, assessing the methodological quality of included studies,

assessing data on the diagnostic accuracy, explore differences among study results, and synthe-

size study results, if possible in a quantitative way through meta-analysis. At each level various

problems may arise, e.g. several studies may never appear in print (publication bias), studies may

be selectively included in reference lists, their methodological quality may be suboptimal or the

papers suffer from incomplete data reporting. Although advanced statistical approaches exist to

overcome some of these problems, a number of issues in analyzing the evidence need careful

consideration:

Like any other systematic review, systematic reviews of diagnostic accuracy studies are threat-

ened by publication-bias. In general, funnel plots are used to investigate publication bias in ran-

domized controlled trials; the log odds ratio is plotted against the sample size or precision (recip-

rocal of the standard error). Various statistical tests such as Begg’s rank correlation [1], Egger’s

regression test [2] and Macaskill’s regression [3] have been proposed to objectively study plot

asymmetry. Song et al [4] supposed that the funnel plots and Begg’s and Egger’s tests could also

be used for reviews of diagnostic tests. Deeks et al [5] recently however reported that the Begg’s,

Egger’s and Macaskill’s tests are misleading if used in reviews of diagnostics accuracy studies and

proposed a new approach: effective sample size funnel plots and associated regression tests for

�7�

asymmetry. Future research should focus on a more uniform approach on both studying publica-

tion bias and on handling publication bias in meta-analysis of diagnostic accuracy studies.

Many studies on diagnostic accuracy have major shortcomings in design or conduct. Several

authors showed the influence of shortcomings in methodological criteria on the diagnostic value

of a test [6-8]. Important criteria that may affect the diagnostic value of the test are: retrospective

study design, non-blinded reading of imaging and reference standard results, use of different ref-

erence standards (differential verification), the absence of details about the diagnostic criteria for

the diagnostic test and insufficient information about the study population. In our meta-analyses,

we assessed these criteria by the QUADAS tool [9] and studied the effect of these criteria on

the diagnostic accuracy and adjusted where needed. However it is important to remember that

studies may be assessed as having poor methodological quality either because they were poorly

conducted or poorly reported.

In many studies on diagnostic accuracy, the study design is not or incompletely reported.

The research methods, characteristics of study population and test procedures are often poorly

reported in primary research. A set of minimal reporting standards for diagnostic research has

recently been developed and introduced (Standards for Reporting of Diagnostic Accuracy initia-

tive [10] to improve the quality of the reporting of diagnostic studies. Without complete and ac-

curate reporting we cannot correctly identify potential sources of heterogeneity between studies.

Moreover adequate reporting of the study design will facilitate reviewing the evidence and the

implementation of the findings from the literature. In all, there is strong need: 1) to improve the

methodological quality of diagnostic accuracy studies and 2) to better reporting of studies.

The statistical methodology for meta-analysis of diagnostic accuracy studies is a long way

behind compared to the methodology for meta-analysis of the results of randomized controlled

trials [11-13]. In recent years, the importance of meta-analyses of diagnostic accuracy studies for

evidence-based guidelines is increasingly recognized. All existing methods use dichotomous data

(in terms of sensitivity and specificity). However, in cancer, the differentiation between disease

and non-disease is not the only issue. Correct staging, understaging and overstaging are equally

important. In chapter 9, we describe an approach how to analyze this type of data. Future meta-

analysis of staging data should therefore use a proper approach to summarize these findings in

stead of dichotomizing these data.

Clinical/radiological aspectsImprovements in imaging of several abdominal and pelvic malignancies areas are required to

improve the management of these patients.

In many cancers, such as in gynecological cancer and rectal cancer, evaluation of lymph nodes

is important. The presence or absence of malignant lymph nodes will alter management. So far,

the detection of malignant lymph nodes by any imaging modalities seems to be disappointing

(Chapters 2-4). Recent advantages in lymph node specific MRI contrast agents have shown im-

provements in the differentiation between benign and malignant lymph nodes. In several initial

studies the use of Ultrasmall SuperParamagnetic Iron Oxide particles (USPIO) led to improved

sensitivity of MRI in the detection of malignant lymph nodes [14-18]. Macrophage transport of

�7�

particles will result in accumulation of USPIO in the normal or hyperplastic lymph nodes. In malig-

nant lymph nodes however, macrophages are being replaced by metastatic cells that will prevent

accumulation of the iron particles in (area’s of) the lymph nodes. This technique can be used to

discriminate benign and malignant lymph nodes. At this moment the use of these iron particles

for the detection of malignant lymph nodes is in phase III clinical trials [19, 20].

Another major improvement in radiological imaging is the introduction of combined PET-com-

puted tomography (PET-CT) scanners. In the past few years, PET-CT imaging has increasingly been

used for the diagnosis, staging, monitoring of response and restaging of malignant diseases.

The success of this emerging modality has primarily been attributed to its ability to combine the

advantages of both PET and CT imaging while minimizing their weaknesses. PET is a functional

imaging modality, which is mainly used in clinical oncology to detect primary and metastatic

disease by assessing the metabolic activity of cells [21-25]. Images from dedicated PET scanners

are characterized by long scan durations, which increase patient motion. Moreover there is a lack

of an anatomic framework to correlate the functional information depicted in the PET image.

On the other hand, CT is an anatomic imaging modality that is characterized by high acquisition

speed, high resolution, relatively low noise, and excellent tissue differentiation. In isolation, CT

offers little functional information and depends largely on size and morphology in differentiating

tumor from normal structures [26, 27]. By combining a PET and a CT scanner in a single imaging

device, the disadvantages of one imaging modality could be offset by the other, leading to a

hybrid imaging device with great potential impact on diagnostic clinical oncology [28-32]. Numer-

ous reports have shows the usefulness of PET-CT in tumor diagnosis and staging, (pre-treatment),

monitoring response (post-treatment) and in the follow-up for detection of recurrence [33-36].

However clear-cut indications for the use of PET-CT are not defined for all oncological areas.

Future research should focus on these indications, and the role of PET-CT with respect to other

imaging techniques.

Diagnostic imaging is playing a major role in the management of patients with cancer in the

abdomen. To improve the management of these patients, several new techniques (MRI with liver

specific contrast agents, introduction of PET-CT, lymph node specific contrast agents) are being

developed and evaluated in the literature. The diagnostic value of these techniques should be

evaluated to determine the role and place in the management of these patients in order to de-

veloped evidence-based guidelines. This is mainly done by systematically reviewing the literature

and/or by meta-analysis. If so, it is important to use proper statistical approaches to summarize

the findings of the literature.

�7�

References

1. Begg CB, Mazumdar M. Operating characteristics of a rank correlation test for publication bias. Biometrics 1994;50:1088-1101.

2. Egger M, Davey SG, Schneider M, Minder C. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997;315:629-634.

3. Macaskill P, Walter SD, Irwig L. A comparison of methods to detect publication bias in meta-analysis. Stat Med 2001;20:641-654.

4. Song F, Khan KS, Dinnes J, Sutton AJ. Asymmetric funnel plots and publication bias in meta-analyses of diagnostic accuracy. Int J Epidemiol 2002;31:88-95.

5. Deeks JJ, Macaskill P, Irwig L. The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. J Clin Epidemiol 2005;58:882-893.

7. Whiting P, Rutjes AW, Reitsma JB, Glas AS, Bossuyt PM, Kleijnen J. Sources of variation and bias in studies of diagnostic accuracy: a systematic review. Ann Intern Med 2004;140:189-202.

8. Rutjes AW, Reitsma JB, Di Nisio M, Smidt N, van Rijn JC, Bossuyt PM. Evidence of bias and variation in diagnostic accuracy studies. CMAJ 2006 February 14;174(4):469-76.

10. Bossuyt PM, Reitsma JB, Bruns DE, et al. Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD Initiative. Radiology 2003;226:24-28.

11. Rutter CM, Gatsonis CA. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med 2001 October 15;20(19):2865-84.

12. Littenberg B, Moses LE. Estimating diagnostic accuracy from multiple conflicting reports: a new meta-analytic method. Med Decis Making 1993;13:313-321

14. Bellin MF, Lebleu L, Meric JB. Evaluation of retroperitoneal and pelvic lymph node metastases with MRI and MR lymphangiography. Abdom Imaging 2003;28:155-163

15. Harisinghani MG, Saini S, Weissleder R, et al. MR lymphangiography using ultrasmall superparamagnetic iron oxide in patients with primary abdominal and pelvic malignancies: radiographic-pathologic correla-tion. AJR Am J Roentgenol 1999;172:1347-1351.

16. Keller TM, Michel SC, Frohlich J, et al. USPIO-enhanced MRI for preoperative staging of gynecological pelvic tumors: preliminary results. Eur Radiol 2004;14:937-944.

17. Kim JH, Beets GL, Kim MJ, Kessels AG, Beets-Tan RG. High-resolution MR imaging for nodal staging in rectal cancer: are there any criteria in addition to the size? Eur J Radiol 2004;52:78-83.

18. Rockall AG, Sohaib SA, Harisinghani MG, et al. Diagnostic performance of nanoparticle-enhanced mag-netic resonance imaging in the diagnosis of lymph node metastases in patients with endometrial and cervical cancer. J Clin Oncol 2005;23:2813-2821.

19. Anzai Y, Piccoli CW, Outwater EK, et al. Evaluation of neck and body metastases to nodes with ferumox-tran 10-enhanced MR imaging: phase III safety and efficacy study. Radiology 2003;228:777-788.

20. Harisinghani MG, Saksena MA, Hahn PF, et al. Ferumoxtran-10-enhanced MR lymphangiography: does contrast-enhanced imaging alone suffice for accurate lymph node characterization? AJR Am J Roentgenol 2006;186:144-148.

21. Bomanji JB, Costa DC, Ell PJ. Clinical role of positron emission tomography in oncology. Lancet Oncol 2001;2:157-164.

�7�

22. Delbeke D. Oncological applications of FDG PET imaging: brain tumors, colorectal cancer, lymphoma and melanoma. J Nucl Med 1999;40591-40603.

23. Gambhir SS, Czernin J, Schwimmer J, Silverman DH, Coleman RE, Phelps ME. A tabulated summary of the FDG PET literature. J Nucl Med 2001;42(5 Suppl):1S-93S.

24. Hustinx R, Benard F, Alavi A. Whole-body FDG-PET imaging in the management of patients with cancer. Semin Nucl Med 2002;32:35-46.

25. Smith TA. FDG uptake, tumour characteristics and response to therapy: a review. Nucl Med Commun 1998;19:97-105.

26. Beigelman-Aubry C, Hill C, Guibal A, Savatovsky J, Grenier PA. Multi-detector row CT and postprocessing techniques in the assessment of diffuse lung disease. Radiographics 2005;25:1639-1652.

27. Rydberg J, Liang Y, Teague SD. Fundamentals of multichannel CT. Radiol Clin North Am 2003 May;41(3):465-74.

28. Antoch G, Stattaus J, Nemat AT, et al. Non-small cell lung cancer: dual-modality PET/CT in preoperative staging. Radiology 2003;229:526-533.

29. Lardinois D, Weder W, Hany TF, et al. Staging of non-small-cell lung cancer with integrated positron-emis-sion tomography and computed tomography. N Engl J Med 2003;348:2500-7.

30. Gould MK, Kuschner WG, Rydzak CE, et al. Test performance of positron emission tomography and computed tomography for mediastinal staging in patients with non-small-cell lung cancer: a meta-analysis. Ann Intern Med 2003;139:879-892.

31. Goerres GW, von Schulthess GK, Steinert HC. Why most PET of lung and head-and-neck cancer will be PET/CT. J Nucl Med 2004;45 Suppl 1:66S-71S.

32. Mawlawi O, Pan T, Macapinlac HA. PET/CT imaging techniques, considerations, and artifacts. J Thorac Imaging 2006;21:99-110.

33. Amit A, Beck D, Lowenstein L et al. The role of hybrid PET/CT in the evaluation of patients with cervical cancer. Gynecol Oncol 2006;100:65-69.

34. Antoch G, Saoudi N, Kuehl H, et al. Accuracy of whole-body dual-modality fluorine-18-2-fluoro-2-deoxy-D-glucose positron emission tomography and computed tomography (FDG-PET/CT) for tumor staging in solid tumors: comparison with CT and PET. J Clin Oncol 2004;22:4357-4368.

36. Israel O, Mor M, Guralnik L, et al. Is 18F-FDG PET/CT useful for imaging and management of patients with suspected occult recurrence of cancer? J Nucl Med 2004;45:2045-2051.

2C h a p t e r

Samenvatting en Conclusies

�7�

saMenVaTTInG

Het onderzoek in dit proefschrift had als doel het verzamelen van primaire resultaten en het sa-

menvatten van het wetenschappelijke bewijs ten aanzien van diagnose en stadiering van verschil-

lende gynaecologische en gastrointestinale maligniteiten door middel van echografie, computer

tomografie (CT), Magnetische Resonantie Imaging (MRI) en Positron Emissie Tomografie met

18-fluorodeoxyglucose (FDG-PET). De resultaten van onze analyses kunnen een bijdrage leveren

aan het ontwikkelen van evidence-based richtlijnen voor de klinische praktijk.

In hoofdstuk 2 is de diagnostische waarde van MRI voor de detectie van lymfeklieren in

patiënten met vulvatumor bepaald. Omdat de aanwezigheid van maligne lymfeklieren een be-

langrijke prognostische factor is in deze patiënten en er geen uniforme resultaten in de literatuur

beschreven zijn ten aanzien van preoperatieve detectie van de klieren, hebben twee beoorde-

laars, retrospectief en onafhankelijk van elkaar, MRI onderzoeken van 60 patiënten met vulva

tumor geëvalueerd op de aanwezigheid van maligne lymfeklieren. De MRI bevindingen werden

vergeleken met histopathologische bevindingen, verkregen middels sentinel node procedure

of lymfadenectomie. Sensitiviteit, specificiteit, positief voorspellende en negatief voorspellende

waarde waren respectievelijk 52%, 85%, 46%, en 87% voor beoordelaar 1 and 52% 89%, 52%,

en 89% voor beoordelaar 2.

De conclusie van dit onderzoek is dat er op dit moment geen rol is voor MRI bij de evaluatie

van lymfeklieren in patiënten met vulva tumor.

In hoofdstuk 3 zijn de resultaten van een systematische review naar de diagnostische waarde

van CT en MRI bij het stadieren van cervix carcinoom gerapporteerd. De diagnostische waarden

(sensitiviteit en specificiteit) van beide modaliteiten voor het identificeren van invasie in parame-

tria, blaas en rectum en voor de detectie van maligne lymfeklieren werden bepaald.

Sensitiviteiten van MRI en CT voor het beoordelen van invasie in parametria waren 74% (95%

CI: 68%-79%) en 55% (95% CI: 44%-66%) respectievelijk. Voor de detectie van maligne lymfe-

klieren, 60% (95% CI: 52%-68%) en 43% (95% CI: 37%-57%), respectievelijk. De specificiteiten

van beide modaliteiten voor het beoordelen van parametria invasie en lymfeklieren waren verge-

lijkbaar.

Voor het evalueren van invasie van de blaas en het rectum, waren de sensitiviteiten van MRI,

75% (95% CI: 66%-83%) en 71% (95% CI: 53%-83%), respectievelijk; beide (niet significant)

hoger vergeleken met de waarden van CT. De specificiteit van MRI voor het evalueren van blaas

invasie was significant hoger vergeleken met die van CT: 91% (95% CI: 83%-95%) en 73% (95%

CI: 52%-87%). De specificiteiten van beide modaliteiten voor de beoordeling van rectum invasie

waren vergelijkbaar.

Op basis van de bevindingen van onze systematische review, werden de volgende aanbeve-

lingen voor het stadieren van cervix carcinoom gedaan. In patiënten met klinische laagstadia

tumoren, is de prevalentie van invasie buiten de cervix laag en daarom de additionele waarde

van MRI ook beperkt. Echter in patiënten met hoogstadia tumoren kan MRI een belangrijke rol

spelen; klinisch onderzoek heeft namelijk beperkingen ten aanzien van het bepalen van invasie

buiten de cervix.

samenvatting en Conclusies

�7�

In hoofdstuk 4 hebben we de diagnostische waarden van endoluminale echografie (EUS), CT en

MRI voor het beoordelen van lokale en regionale (lymfeklieren) stadiering van rectum tumoren

vergeleken middels een meta-analyse.

Voor het beoordelen van muscularis propria invasie, hadden EUS en MRI vergelijkbare sensitivi-

teiten. Specificiteiten waren 86% (95% CI: 80%-90%) en 69% (95% CI: 52%-82%) respectievelijk.

Voor het beoordelen van de invasie van het perirectale weefsel, was de sensitiviteit van EUS 90%

(95% CI: 88%-92%); significant hoger dan die van CT en van MRI (79% [95% CI: 74%-84%] en

82% [95% CI: 74%-87%]). Specificiteiten waren vergelijkbaar. Voor de beoordeling van invasie

van aangrenzende organen/structuren en voor de detectie van maligne lymfeklieren hadden alle

modaliteiten vergelijkbaren sensitiviteiten en specificiteiten.

Ook de summary ROC curve van EUS bij het bepalen van invasie van het perirectale weefsel

liet een betere diagnostische accuratesse zien vergeleken met CT en MRI. Summary ROC curven

voor de detectie van lymfeklieren lieten geen verschillen zien in accuratesse tussen de drie moda-

liteiten.

Op basis van de resultaten van deze meta-analyse, lijkt EUS een betere diagnostische modali-

teit te zijn voor het bepalen van lokale stadiering dan CT en MRI. Echter EUS is niet in staat om

de mesorectale fascie te identificeren, terwijl dit wel mogelijk is met MRI. De identificatie van de

mesorectale fascie is belangrijk voor het bepalen van de afstand van de tumor tot deze fascie,

in patiënten die potentieel in aanmerking komen voor totale mesorectale excisie (TME). EUS

kan echter wel gebruikt worden voor de selectie van patiënten voor beschikbare therapeutische

strategieën, zoals transanale endoscopische microchirurgie (TEM) en de TME. De identificatie van

lymfeklieren met EUS, CT, en MRI blijft een belangrijk probleem.

In hoofdstuk 5 hebben wij het beschikbare wetenschappelijke onderzoek naar de diagnosti-

sche waarde van echografie (US), CT en MRI bij de diagnose en het bepalen van resectabiliteit van

pancreas carcinoom systematisch samengevat. Het doel van dit onderzoek was om diagnostische

uitkomstmaten te verkrijgen van echografie, conventionele CT, spiraal CT en MRI voor de diag-

nose en het bepalen van resectabiliteit van pancreas adenocarcinoom.

Voor de diagnose waren de sensitiviteiten van US, conventionele CT, spiraal CT, en MRI respec-

tievelijk 76%, 86%, 91%, en 84% en de specificiteiten respectievelijk 75%, 79%, 85%, en 82%.

De sensitiviteiten van MRI en US waren significant lager dan die van spiraal CT (P = 0.04 en P =

0.0001).

Voor het bepalen van resectabiliteit waren de sensitiviteiten van US, conventionele CT, spiraal

CT, en MRI respectievelijk 83%, 82%, 81%, en 82% en specificiteiten respectievelijk 63%, 76%,

82%, en 78%. Specificiteit van US was significant lager dan die van spiraal CT (P = 0.011).

Vanwege de hoge sensitiviteit van spiraal CT voor de diagnose van pancreascarcinoom verge-

leken met MRI en US en de hoge specificiteit van spiraal CT voor het bepalen van resectabiliteit

vergeleken met US, kan spiraal CT als modaliteit van keuze beschouwd worden voor zowel de

diagnose en het bepalen van de resectabiliteit van pancreas adenocarcinoom.

In hoofdstuk 6 hebben wij de resultaten van een enquête gepresenteerd. In de enquête werd

gevraagd naar de diagnostiek en behandeling van patiënten met colorectale levermetastasen. Uit

de resultaten van de enquête bleek dat er in Nederland veel variatie bestond in zowel de diagnose

��0

als behandeling van deze patiënten. Deze enquête diende ook om factoren die de implementatie

van de te ontwikkelen richtlijn in de weg zouden kunnen staan, op te sporen. Een evidence-based

richtlijn waarin rekening wordt gehouden met beschikbare evidence, de expertise en beschikbaar-

heid kan de variatie verhelpen (gepresenteerd in hoofdstuk 8).

In hoofdstuk 7 worden de resultaten van een meta-analyse gepresenteerd. In deze meta-

analyses werden de diagnostische uitkomstmaten van CT, MRI en FDG-PET voor de detectie van

colorectale levermetastasen vergeleken. Het doel van dit onderzoek was om schattingen van

sensitiviteit van CT, MRI en FDG-PET op per-patient en per-laesie basis te verkrijgen.

Op per-patient basis waren de sensitiviteiten van conventionele CT, spiraal CT, 1.5 T MRI,

en FDG-PET 60.2%, 64.7%, 75.8%, en 94.6%, respectievelijk; FDG-PET was de meest accurate

modaliteit.

Op per-laesie basis, waren de sensitiviteiten van conventionele CT, spiraal CT, 1.0 T MRI, 1.5 T

MRI, en FDG-PET, respectievelijk 52.3%, 63.8%, 66.1%, 64.4%, en 75.9%; conventionele CT had

de laagste sensitiviteit. Sensitiviteiten van MRI met gadolinium en MRI met superparamagnetische

ijzer oxide (SPIO) waren significant beter, vergeleken met MRI zonder contrast (P = 0.019 en P <

0.001, respectievelijk) en vergeleken met spiraal CT met ≤ 45 g Jodium (P = 0.02 en P < 0.001, res-

pectievelijk). Voor laesies ≥ 1 cm, was MRI met SPIO de meest accurate modaliteit (P < 0.001).

De keuze tussen spiraal CT met > 45 gr Jodium of MRI met gadolinium of MRI met SPIO, moet

ook afhankelijk zijn van de beschikbaarheid en expertise en niet alleen van de diagnostische ac-

curatesse. De rol van FDG-PET voor de detectie van colorectale levermetastasen op dit moment

is beperkt. FDG-PET zal daarom vooral als additionele imaging modaliteit voor de detectie van

extrahepatische afwijkingen worden toegepast.

In hoofdstuk 8 vatten we een evidence-based richtlijn samen, die ontwikkeld is voor de diag-

nose en behandeling van patiënten met colorectale levermetastasen. De belangrijkste aanbeve-

lingen zijn als volgt:

Voor de detectie van synchrone levermetastasen wordt geadviseerd om een spiraal CT of MRI

van de lever te verrichten. Voor de beoordeling van de longen wordt geadviseerd om beeldvor-

mende diagnostiek te beperken tot een X-thorax.

Voor het aantonen van metachrone levermetastasen kan beeldvorming beperkt blijven tot

echografie, als de lever als geheel echografisch goed te beoordelen is. Als dit niet het geval is, of

als irresectabiliteit niet kan worden vastgesteld, dient er aanvullend onderzoek middels Spiraal CT

of MRI verricht te worden.

Voor evaluatie van de extrahepatische afwijkingen, kan een abdominale en thoracale CT ver-

richt worden. FDG-PET kan een bijdrage leveren in patiënten die geselecteerd zijn voor resectie

op basis van CT (lever/abdomen/thorax), voor de identificatie van aanvullende extrahepatische

afwijkingen.

Resectie is de behandeling van keuze met een 5-jaars overleving van 30%-40%. Er blijkt echter

een enorme variatie in prognostische factoren te bestaan. Over marges < 10 mm bestaat er

geen duidelijkheid. Over het beleid bij aanwezigheid van extrahepatische metastasen bestaan

er twijfels. Er is een enorme variatie waarneembaar in de technische uitvoering van resecties

(anatomische vs. niet-anatomische). Duidelijke resultaten over de effectiviteit van combinatiebe-

��

handelingen met (neo)-adjuvante chemotherapie ontbreken. Om de variatie in selectie criteria

te minimaliseren, dienen patiënten geselecteerd te worden zoals beschreven in de richtlijn en bij

voorkeur in centra met veel ervaring.

Als resectie niet mogelijk is, gezien de uitgebreidheid en/of niet-resectabele metastasen, wordt

palliatieve chemotherapie geadviseerd. Voor systemische therapie wordt een fluoropyrimidine be-

vattende eerstelijns chemotherapie (zowel orale 5-FU prodrugs als intraveneus 5-FU/Leucovorin)

gecombineerd met oxaliplatin of irinotecan als standaardbehandeling beschouwd.

Radiofrequente ablatie, geisoleerde lever perfusie, vena-porta embolisatie en intra-arteriele

chemotherapie worden als experimentele behandelingstechnieken beschouwd en dienen bij

voorkeur in onderzoeksverband uitgevoerd te worden.

In hoofdstuk 9 hebben wij een nieuw methode voor het analyseren van stadiering data be-

schreven. Met dit multivariaat random-effects model kunnen schattingen van correcte stadiering

en de mate van onder- en overstadiering berekend worden.

Bij kanker is de differentiatie tussen ziekte en niet-ziekte niet het enige punt van belang, maar

ook de correcte stadiering en de mate van onder- en overstadiering zijn belangrijk. Bij bestaande

meta-analytische methoden, zoals het bivariaat random-effects model, dient data gedichotomi-

seerd te worden, alvorens sensitiviteit en specificiteit berekend kan worden. Bijvoorbeeld in 4-ca-

tegorie stadiering data, kunnen drie dichotomieën worden verkregen: T1 vs. T2+T3+T4, T1+T2 vs.

T3+T4 en T1+T2+T3 vs. T4). Hierbij gaat informatie met betrekking tot onder- en overstadiering

verloren.

Bovengenoemd multivariaat random-effects model werd gebruikt om schattingen van cor-

recte stadiering en van onder- en overstadiering te berekenen. Ook werden schattingen van

sensitiviteit en specificiteit voor de drie dichotomieën berekend. Voor deze doelen werd een

subset van data (data vanaf 1993) die eerder voor de vergelijking van Endoluminale US (EUS) en

MRI voor de stadiering van rectum tumoren (chapter 4) was geanalyseerd volgens het bivariaat

random-effects model opnieuw geanalyseerd.

EUS was accurater dan MRI in correcte stadiering (80% vs. 72%), maar de verschillen waren

niet statistisch significant. Overstadiering met MRI was groter vergeleken met EUS: middels MRI

werden T1 tumoren vaker als T2/T3 gestadieerd (12% vs. 2%), en T2 tumoren werden ook vaker

als T3 gestadieerd (42% vs. 28%).

De sensitiviteit en specificiteit van EUS en MRI voor de drie dichotomieën, berekend met de

resultaten van het multivariaat random-effects model, waren vergelijkbaar met de sensitiviteit en

specificiteit verkregen met bivariaat random-effects analyses.

Het multivariaat random-effects model kan een behulpzame, informatieve en precieze metho-

de zijn voor het analyseren van ordinale of nominale data voor meta-analyses van diagnostische

accuratesse studies.

��

ConClUsIes en ToeKoMsTIG onDeRZoeK

Het onderzoek beschreven in dit proefschrift laat zien dat zowel het uitvoeren van primaire diag-

nostische studies van voldoende methodologische kwaliteit en het systematisch samenvatten van

de literatuur, beiden waardevolle en essentiële middelen zijn voor het formuleren van evidence-

based aanbevelingen voor de klinische praktijk.

MethodologieOver het algemeen, zijn er een aantal essentiële stappen in het uitvoeren van systematische

reviews van diagnostische accuratesse studies: het definiëren van een klinische vraag, het uitvoe-

ren van een uitgebreide literatuur zoekstrategie, het definiëren van criteria voor het selecteren

van relevante artikelen, het scoren van methodologische kwaliteitskenmerken, het verzamelen

van data over de diagnostische accuratesse, het verklaren van heterogeniteit in studie resulta-

ten, het analyseren van studie resultaten en als het mogelijk is het analyseren van data in een

kwantitatieve methode “meta-analyses”. Op elk niveau kunnen er problemen zijn: bijvoorbeeld,

studies die nooit gepubliceerd zijn (publicatie bias), studies waarvan de methodologische kwaliteit

suboptimaal is of studies die niet alle relevante data (incompleet data) rapporteren. Ondanks dat

er geavanceerde statistische methoden zijn om bovengenoemde problemen te verhelpen, zijn er

een aantal issues, waar aandacht aan besteed dient te worden:

Zoals in elke systematische review, kan ook in systematische reviews van diagnostische accu-

ratesse studies sprake zijn van publicatie bias. Over het algemeen worden in gerandomiseerde

gecontroleerde trials, funnel plots gecreëerd voor het bestuderen van publicatie bias; de log odds

ratio wordt afgezet tegen het aantal patiënten of tegen de precisie (inverse van de standaard

error). Verschillende statistische testen zoals Begg’s rank correlatie [1], Egger’s regressie test [2]

en Macaskill’s regressie test [3] zijn in de literatuur beschreven om de plot asymmetrie objectief

te beoordelen. Song et al [4] suggereren dat funnel plots en Begg’s en Egger’s testen ook kun-

nen worden gebruikt voor het bestuderen van publicatie bias in reviews van diagnostische testen.

Deeks et al [5] echter rapporteren dat de Begg’s, Egger’s en Macaskill’s testen misleidend zijn als

ze worden toegepast in reviews van diagnostische accuratesse studies en komen met een nieuw

voorstel, de zogeheten effective sample size funnel plots en hieraan geassocieerde regressie

testen voor asymmetrie. Door de enorme diversiteit aan methoden, dient in de toekomst meer

aandacht besteed te worden aan het ontwikkelen van een uniform model voor het bestuderen

van publicatie bias en hoe om te gaan met publicatie bias in reviews van diagnostische accura-

tesse studies.

Veel diagnostische accuratesse studies hebben tekortkomingen in de methodologische ka-

rakteristieken. Verschillende auteurs hebben aangetoond dat deze tekortkomingen vaak effect

hebben op de diagnostische waarde van een bepaalde test [6-8]. Belangrijke karakteristieken die

een effect kunnen hebben op de diagnostische waarde van een test zijn: retrospective studie

design, niet geblindeerde beoordeling van imaging en referentie standaard bevindingen, toepas-

sen van verschillende referentie standaarden (differentiele verificatie), het ontbreken van details

over de diagnostische criteria van een diagnostische test en onvoldoende beschrijving van de

��

studie populatie. In onze meta-analyses, hebben we al deze karakteristieken gescoord volgens

de QUADAS methode [9] en hebben we bestudeerd wat de effecten van tekortkomingen op de

diagnostische waarde waren en waar nodig gecorrigeerd. Het is echter belangrijk om te weten

dat studies welke gescoord worden als methodologisch slechte studies, niet per se suboptimaal

uitgevoerd zijn, maar dat dit ook veroorzaakt kan worden door suboptimale rapportage van de

studies.

In veel studies over diagnostische accuratesse, wordt de uitvoering van de studie incompleet

gerapporteerd. Recent is er een lijst ontwikkeld waarin wordt aangegeven wat er minimaal ge-

rapporteerd dient te worden in een diagnostische accuratesse studie (Standards for Reporting

of Diagnostic Accuracy initiative [10]); dit voor het verbeteren van rapportage van diagnostische

studies. Zonder complete en accurate reportage, is het heel lastig om de oorzaken van de hete-

rogeniteit tussen studieresultaten te bestuderen. Ook kan adequate reportage van studie design

karakteristieken een positieve bijdrage leveren aan het verzamelen van wetenschappelijk bewijs

en de implementatie van deze bevindingen uit de literatuur in de praktijk. Concluderend, het is

belangrijk om 1) de methodologische kwaliteit van diagnostische accuratesse studies te verbete-

ren en 2) de rapportage van studies te verbeteren.

De statistische methodologie van meta-analyses van diagnostische accuratesse studies loopt

achter op de ontwikkeling van de methodologie van meta-analyses van gerandomiseerde gecon-

troleerde trials [11-13]. In de afgelopen jaren is het belang van meta-analyses van diagnostische

accuratesse studies voor het ontwikkelen van evidence-based richtlijnen enorm toegenomen.

Echter, alle bestaande methoden maken gebruik van dichotome data (voor de berekening van

sensitiviteit en specificiteit). Echter, bij patiënten met een tumor is het onderscheid tussen zieken

en niet-zieken niet alleen belangrijk, maar ook correcte stadiering, alsook de mate van onder- en

overstadiering. In hoofdstuk 9, beschrijven we een methode waarmee stadiering data geanaly-

seerd kan worden. Toekomstige meta-analyses van stadiering data dient middels een geschikte

methode geanalyseerd te worden in plaats van deze data dichotomiseren.

Klinische/radiologische aspectenOntwikkelingen op het gebied van imaging van abdominale en pelvine aandoeningen hebben

vooral als doel het verbeteren van de zorg van patiënten met deze aandoeningen.

Bij patiënten met bijvoorbeeld gynaecologische en rectum tumoren, is de evaluatie van lymfe-

klieren belangrijk. De aan- of afwezigheid van maligne lymfeklieren kan leiden tot het veranderen

van het beleid. Tot nu toe is de detectie of maligne lymfeklieren middels een beeldvormende

modaliteit teleurstellend (Hoofdstukken 2-4). Recente ontwikkelingen op het gebied van lym-

feklierspecifiek MRI contrast middelen (Ultrasmall SuperParamagnetic Iron Oxide=USPIO) laten

verbeteringen zien in het onderscheid tussen benigne en maligne lymfeklieren bij MRI. Verschil-

lende pilot studies rapporteren een hoge sensitiviteit voor de detectie van maligne lymfeklieren

als USPIO wordt toegediend [14-18]. Macrofagen in lymfeklieren zorgen voor transport van US-

PIO partikels, wat resulteert in accumulatie van deze partikels in normale of in hyperplastische

lymfeklieren. In maligne lymfeklieren echter wordt normaal lymfklierweefsel met macrofagen

vervangen door metastasen en vindt er geen of nauwelijks accumulatie van de partikels plaats.

��

Hierdoor kan er onderscheid gemaakt worden tussen benigne en maligne klieren. Op dit moment

worden de diagnostische waarde van toediening van USPIO partikels onderzocht in fase III studies

[19, 20].

Een andere belangrijke ontwikkeling op het gebied van radiologische imaging is de introductie

van de gecombineerde PET-CT scanners. In de afgelopen jaren is de toepassing van PET-CT voor

de diagnose, stadiering, monitoren van respons en restadiering van maligniteiten toegenomen.

Het succes van deze gecombineerde techniek komt primair tot stand door de combinatie van de

voordelen van afzonderlijke PET en CT technieken in combinatie met het minimaliseren van de

beperkingen van de technieken. PET is een functionele imaging modaliteit, welke vooral wordt

toegepast in de klinische oncologie voor de detectie van primaire tumor en metastasen, waarbij

de toegenomen metabolische activiteit van tumorweefsel wordt vastgesteld [21-25]. Echter, de

scan tijd van PET is lang, welke gepaard gaat met beweging van patiënten. Ook ontbreekt er een

anatomisch frame om de functionele informatie verkregen met het PET plaatje te correleren. CT

is juist een anatomische imaging modaliteit, gekarakteriseerd door hoge acquisitie snelheid, hoge

resolutie, relatief lage ruis en goede weefseldifferentiatie. CT verschaft echter weinig informatie

over functie en is erg afhankelijk van vorm en grootte voor het maken van onderscheid tussen

tumor en normale structuren [26, 27]. Door het combineren van PET en CT scanner in een enkele

modaliteit, konden de tekortkomingen van de ene modaliteit worden gecompenseerd door de

voordelen van de andere modaliteit [28-32]. Veel onderzoeken hebben het belang van de toe-

passing van PET-CT bij de diagnose en stadiering van tumor, het monitoren van respons en in de

follow-up voor de detectie van recidief [33-36] aangetoond. Echter duidelijke indicaties voor de

toepassing van PET-CT ontbreken nog voor veel toepassingen. Toekomstig onderzoek dient meer

aandacht te besteden aan deze indicaties, en aan de plaats van PET-CT in relatie tot de plaats van

andere imaging modaliteiten.

Diagnostiek middels radiologische technieken speelt een belangrijke rol bij de behandeling van

patiënten met tumoren in het abdomen en het bekken. Om het beleid voor deze patiënten te ver-

beteren, zijn verschillende nieuwe technieken (MRI met lever specifiek contrastmiddel, introductie

van PET-CT scanners, lymfeklier specifiek contrast middelen) ontwikkeld en ook geëvalueerd in

de literatuur. De diagnostische waarde van deze technieken dient geëvalueerd te worden om te

bepalen wat de rol en plaats van deze technieken in het beleid van deze patiënten is; dit alles

met als doel het ontwikkelen van evidence-based richtlijnen. De rol en plaats van deze technieken

wordt vooral bepaald door het systematisch reviewen van de literatuur en/of door het uitvoeren

van meta-analyses. Daarom is het dus belangrijk om geschikte statistische methoden te gebruiken

voor het analyseren van de bevindingen uit de literatuur.

��

Referenties