The Search for Meaning-Symptoms and Transvaginal Sonography Screening for Ovarian Cancer
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Original Article The Search for Meaning—Symptoms and Transvaginal Sonography Screening for Ovarian Cancer Predicting Malignancy Edward J. Pavlik, PhD1; Brook A. Saunders, MD1; Stacey Doran, BS1; Katherine W. McHugh, BA1; Frederick R. Ueland, MD1; Christopher P. DeSimone, MD1; Paul D. DePriest, MD1; Rachel A. Ware, MD1; Richard J. Kryscio, PhD2; and John R. van Nagell, Jr., MD1 BACKGROUND: The mortality rate of ovarian cancer is greater than that of all other major gynecologic malignancies. Detecting ovarian cancer at an early and curable stage long has been an objective of oncolo- gists. Recently, it was reported that certain symptom patterns are informative for the presence of ovarian malignancy. In this article, the authors report on how symptoms and ultrasound predict ovarian malignancy. METHODS: Two hundred seventy-two women who were participating in annual transvaginal sonography (TVS) screening were selected from among 31,748 women who were enrolled. Symptom results were corre- lated with ultrasound and surgical pathology findings. RESULTS: TVS performed better than symptoms analysis for detecting malignancies (sensitivity, 73.3% vs 20%), and symptoms analysis performed better for distinguishing benign tumors (specificity, 91.3% vs 74.4%). The use of TVS and symptoms analysis in se- ries resulted in poorer identification of malignancy (sensitivity, 16.7%) but improved the ability to distin- guish benign tumors (specificity, 97.9%). Decisions using either symptoms or TVS combined in parallel had small increases in sensitivity (þ3.3%) and had coordinated, small decreases in specificity (5.8%). CONCLU- SIONS: Symptoms did identify ovarian malignancies, but not as well as TVS. The current findings indicated that: 1) tumors that are negative by both ultrasound and a symptoms index are likely to be benign (speci- ficity, >97%), and 2) adding symptoms information that has weight equal to the weight of ultrasound only slightly improves the discrimination of malignancy (sensitivity increase, þ3.3%). Thus, a major benefit in dis- criminating malignancy was achieved through ultrasound, whereas the absence of symptoms in conjunction with an abnormal ultrasound (characterized by a low morphology index) indicated that the mass was be- nign and that surgery may not be required. Finally, informative symptoms can be expected to be absent in 80% of patients with ovarian malignancies. Cancer 2009;115:3689–98. V C 2009 American Cancer Society. KEY WORDS: ovary, screening, symptoms, ultrasound. Corresponding author: Edward J. Pavlik, PhD, Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, University of Kentucky Medical Center, 800 Rose Street, Lexington, KY 40536; Fax: (859) 323-1018; epaul1@uky.edu 1 Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, the University of Kentucky Chandler Medical Center-Markey Cancer Center, Lexington, Kentucky; 2Department of Biostatistics, the University of Kentucky Chandler Medical Center-Markey Cancer Center, Lexington, Kentucky See editorial on pages 3606–8, this issue. Received: September 12, 2008; Revised: November 4, 2008; Accepted: November 26, 2008 Published online: July 14, 2009 V C 2009 American Cancer Society DOI: 10.1002/cncr.24407, www.interscience.wiley.com Cancer August 15, 2009 3689
Original Article Ovarian cancer remains the fourth leading cause of cystic ovarian tumor with a solid or papillary projection 1 cancer death in US women. This year, the lives of over into its lumen.20 Morphology indexing has been useful in 15,000 women will be claimed by ovarian cancer in the predicting the risk of malignancy21-24 and was performed United States alone.2 Pelvic examination is notably inac- according to the classification of Ueland et al.20 Each tu- curate in detecting subtle changes in ovarian size and mor- mor was given a score of from 1 to 10 according to phology, particularly in postmenopausal women.3 increasing morphologic complexity and volume, as out- Although ovarian cancer has been perceived as a ‘‘silent lined in Figure 1, with increasing numerical scores for killer’’ that produces few specific symptoms, recent studies septa, papillary projections, solid areas, and extratumoral have indicated that certain symptoms are significantly free fluid. Each final morphology index (MI) score is the more common in women with ovarian cancer than in sum of the volume score (1-5) and the structure score (1- women in the general population.4,5 These findings have 5). resulted in the design of a symptoms index6 that report- Patients with ovarian cancer on frozen section or edly has utility for identifying early stage ovarian cancer, patients with obvious metastatic disease at laparoscopy which often is curable by conventional therapy.7,8 It underwent immediate exploratory laparotomy and stag- appears, then, that the symptoms index could have utility ing. Tumors were classified histologically according to the in a screening context to detect ovarian cancer at an earlier World Health Organization system and were staged and more curable stage.9-17 according to the International Federation of Gynecology Since 1987, the University of Kentucky Ovarian and Obstetrics system. Cancer Screening Project has examined the efficacy of an- The selection group consisted of those members of nual transvaginal sonography (TVS) as a screening the source group who underwent surgery after an method for ovarian cancer, providing free screening to over 31,000 women who participate in this program. The results of this screening study have been reported recently.18 The current report applies the symptoms index analysis6 to this large study group as a validation study to determine whether its addition improves the performance of ovarian cancer screening. MATERIALS AND METHODS Patients enrolled in the University of Kentucky Ovarian Cancer Screening Project from January 1987 to June 2008 composed the source group (n ¼ 31,748). Approval was received from the University of Kentucky Institu- tional Review Board. Eligibility, exclusions, instrumenta- tion, protocol, criteria for designating an abnormality, and data collected were as recently reported.18 Partici- pants in the screening program received free annual screening after a normal result (ie, 74.2% were scheduled 11-13 months after a normal screen, and 7.8% were scheduled even earlier, whereas 17.9% were scheduled later). After an abnormal screening result, free repeat screening was scheduled at 4 to 6 weeks. Criteria for ab- normality included ovarian volume (values >2 standard FIGURE 1. Morphology index schematic is shown. deviations above the mean volume of normal ovaries in premenopausal and postmenopausal women)19 and any 3690 Cancer August 15, 2009
Symptoms & TVS Screening for Ovarian CA/Pavlik et al abnormal TVS finding (n ¼ 450) and included women level. Analysis of receiver operating characteristic (ROC) with either malignant or benign tumors. The study group curves also was performed.25-27 was formed from the selection group and was made up of women in the selection group who had returned confident responses on the symptoms index questionnaire (n ¼ 272; RESULTS 60.4%). Excluded from this group were 178 women who There were 31,748 women who enrolled in the University had died (1 stage II and 3 stage III ovarian tumors), had with- of Kentucky ovarian screening study from January 1987 drawn from the study, who were unwilling to take the symp- to June 2008, and a family history of ovarian cancer was toms survey, or who were not confident in their answers. documented in 22.5% of these women. Symptoms Index Analysis Clinical Characteristics of the Women The symptoms survey in the form published by Goff et al6 Selected for Study on the Basis of Surgery was used and occupied an entire 8.5 11-inch page Related to TVS Screening printed in landscape format. The only deviations from the The candidates who were selected for the current study all symptoms survey published by Goff et al6 were that 1) were involved actively in the University of Kentucky dark-contrast symptoms space separators replaced the Ovarian Screening Program and had undergone surgery open white spaces used by Goff et al6, and 2) a single con- because of an abnormal TVS result. The selection group fidence assessment query was added. The dark-contrast contained 450 women who underwent surgery related to space separators improved the readability of the form and a TVS finding, and a family history of ovarian cancer was were intended to prevent visual errors arising from row or documented in 123 of those women (27.4%). Members column cross-over. The confidence query question was of the selection group who had confident responses to the ‘‘How confidently did you answer these questions?’’ symptoms questionnaire defined the study group of 272 Response choices for the query were 0 ¼ no confidence, women. Twenty-seven percent of women in the study 1 ¼ minimally sure, 2 ¼ more than minimally sure, 3 ¼ group reported a family history of ovarian cancer. The pretty sure, 4 ¼ sure, and 5 ¼ absolutely sure of accuracy.’’ clinical characteristics of the patients screened are shown The response query was used to identify recall bias or in Table 1. There were no significant differences between responses that were affected by the respondent’s memory. the selection group and the study group with respect to Thirteen individuals without malignancies were excluded age, gravidity, parity, weight, height, body mass index because they did not have a confidence response >3, and (BMI), body surface area, CA 125, family history of ovar- 165 individuals were excluded because they did not wish to ian, breast, or colon cancer, use of hormone-replacement participate or because they could not be contacted. The cri- therapy, or nulliparity (Table 1). There was no significant teria reported by Goff et al for any specific symptom (pelvic difference between the histopathologies of the selection pain, abdominal pain, increased abdominal size, bloating, group and the study group (ie, responders to the symp- feeling full, and difficulty eating), frequency (>12 times toms questionnaire) (Table 2). per month), and duration (12 days per month with a duration
Original Article Table 1. Clinical Characteristics of the Selection Group (n¼450), the Study Group (n¼272), and the Nonparticipating Group (n¼178) Selection Group Study Group Nonparticipating Group Variable* Mean6SEM Range No. (%) Mean6SEM Range No. (%) Mean6SEM Range No. (%) Age, y 58.40.5 32-89 450 58.40.6 32-89 272 58.40.8 34-84 178 Gravidity 2.50.08 0-12 450 2.60.1 0-12 272 2.40.1 0-7 178 Parity 2.20.07 0-10 450 2.30.1 0-10 272 2.10.1 0-6 178 Weight, pounds 163.31.7 80-350 450 164.32.2 99-350 272 161.82.6 80-300 178 Height, inches 64.50.12 54-72 450 64.50.2 54-72 272 64.50.2 57-71 178 BMI, lbs/in2 27.60.3 15.1-57.8 450 27.70.4 18-58 272 27.40.4 15-51 178 BSA, m2 1.80.01 1.3-2.8 450 1.80.01 1.4-2.8 272 1.80.02 1.2-2.5 178 CA 125, U/mL 37.111 1-1500 148 36.913.9 1-1550 113 3711.9 2-279 35 Family history Ovarian cancer 123 (27) 73 (27) 50 (28) Breast cancer 196 (44) 116 (43) 81 (45) Colon cancer 100 (22) 70 (26) 30 (17)y History of HRT 145 (32) 99 (36) 55 (26) Nulliparous 58 (13) 30 (11) 28 (16) SEM indicates standard error of the mean; BMI, body mass index; BSA, body surface area; HRT, hormone-replacement therapy. * There were no significant differences in age, parity, gravidity, weight, height, BMI, BSA, or CA 125; P < .9; Student t test) or in family history, use of HRT, or nulliparity (chi-square value, 3.285; P¼.511). y Significantly different (P¼.052). Table 2. Histologic Findings in the Selection Group Table 3. Summary of Reported Symptoms (n¼272)* (n¼450) and the Study Group (n¼272)* No. of Patients (%) No. of Patients (%) Symptom Goff Positive Goff Negative Finding Selection Study Group Group Back pain NA 85 (31.3) Frequent urination NA 84 (30.9) Primary ovarian cancer 49 (10.9) 30 (11) Urinary urgency NA 80 (29.4) Malignant 32 (7.1) 15 (5.5) Fatigue NA 79 (29) GC/LMP 17 (3.8) 15 (5.5) Constipation NA 74 (27.2) Serous cystadenoma 192 (42.7) 114 (41.9) Abdominal bloating 14 (5.1) 65 (23.9) Endometrioma 30 (6.7) 18 (6.6) Pelvic pain 15 (5.5) 59 (21.7) Mucinous cystadenoma 24 (5.7) 14 (5.1) Pain with intercourse NA 59 (21.7) Cystic teratoma 21 (4.7) 12 (4.4) Indigestion NA 58 (21.3) Hemorrhagic cyst 9 (2) 6 (2.2) Leg swelling NA 58 (21.3) Fibroma/thecoma/Brenner tumor 33 (7.7) 22 (8.1) Increased abdominal size 15 (5.5) 50 (18.4) Leiomyomata 19 (4.2) 16 (5.9) Irregular menses NA 50 (18.4) Hydrosalpinx/paratubal 29 (6.4) 16 (5.9) Diarrhea NA 49 (18) Other 44 (9.4) 24 (8.8) Difficulty breathing NA 45 (16.5) Bleeding after menopause NA 33 (12.1) Nausea/vomiting NA 32 (11.8) GC/LMP, granulosa cell tumor/tumor with low malignant potential. Bleeding with intercourse NA 31 (11.4) * No significant differences were observed between the selection group ver- Palpable abdomen NA 28 (10.3) sus the study group (chi-square statistic, 5.616: P¼.898). Feels full quickly 9 (3.3) 21 (7.7) Weight loss NA 21 (7.7) Abnormal eating 3 (1.1) 14 (5.1) (Table 3). Approximately the same number of women NA indicates not applicable. * Based on the symptoms survey in the form published by Goff et al.6 reported having no symptoms (n ¼ 124; 45.6%) as reported symptoms that were not considered positive (n ¼ women with ovarian cancer in the study group who 121; 44.5%) on the Goff symptoms index (Table 4). Of the responded to the symptoms questionnaire, 6 women (20%) 27 women who were positive on the Goff symptoms index, satisfied the Goff et al criteria for being symptoms-positive, 6 women had ovarian cancers (4 malignancies and 2 tumors and 80% did not. The study group was very similar to the of low malignant potential) (Table 5). Thus, of the 30 selection group with respect to tumor stage, and 75% had 3692 Cancer August 15, 2009
Symptoms & TVS Screening for Ovarian CA/Pavlik et al Table 4. Response to Symptoms Questionnaire (n¼272) Table 6. Tumor Stage Reported Symptoms No. of Patients (%) No. of Patients (%) Reported experiencing no symptoms 124 (45.6) Group Early Stage Late Stage Reported symptoms not 121 (44.5) (I/II) (III) considered positive Selection group (n¼450) 37 (75.5) 12 (24.5) on the symptoms index Study group (n¼272) 23 (76.7) 7 (23.3) Reported symptoms 27 (9.9) Informative symptoms 6 (100) 0 (0) considered positive on positive: TP the Goff symptoms index TP indicates true positive. Table 5. Histologic Findings in Women With a Positive Symptoms Index (n¼27) Table 7. Predictive Ability of the Symptoms Index on Initial or Subsequent Screening Visits (n¼272)* Histologic Finding No. of Women Primary ovarian cancer 6 No. of Patients Malignant 4 Prediction First Subsequent GC/LMP 2 Status Encounter Encounters Simple serous cyst 11 Cystic teratoma 1 True positive 4 2 Hemorrhagic cyst 1 False positive 12 9 Fibroma/thecoma/Brenner tumor 1 True negative 116 105 Leiomyomata 2 False negative 9 15 Hydrosalpinx/paratubal 2 Other 3 * There were no statistically significant differences (chi-square statistic, 2.779; P¼.669). GC/LMP, granulosa cell tumor/tumor with low malignant potential. early stage disease (Table 6). All of the women who had between the TP rate (TPR) ‘‘benefits’’ and the FP rate true-positive (TP) results identified by informative symp- (FPR) ‘‘costs,’’ was used to examine the effectiveness of toms had early stage disease (Table 6). TVS alone and when combined with the symptoms index Because symptoms that persisted for >12 months (Fig. 2). The TPR and the FPR were defined as noted by could be more likely to occur on the first screening encounter Suojanen with regard to correctness.28,29 Screening per- than on subsequent annual return screens, results originating formance was graphed at and above each MI score with from the entry screen were compared with results originating TVS alone in the selection group (n ¼ 450) and in the on subsequent screens. There were no significant differences study group (n ¼ 272) (Fig. 2, open squares). A line of no in TP, false-positive (FP), true-negative (TN), or false-nega- discrimination describing a random guess or outcome (ie, tive (FN) results that were identified on the entry screen ver- flipping a coin) is included on Figure 2 as a dashed diago- sus subsequent screens (Table 7). Consequently, no effect on nal line. For the study group, the distance from the line of the symptoms reporting was observed because of the possibil- no discrimination was greatest at MI scores >4 to 5, a ity of symptoms persisting for >12 months on the first visit, relation that was mirrored in the selection group (Fig. 2). supporting the examination of symptoms results without Combining the symptoms index with TVS screening regard to the number of screening visits. moved the ROC space plot dramatically closer to the line of no discrimination, drastically reducing both the area Identification of Malignancy: Comparisons under the ROC curve and the area between the ROC With the Symptoms Index Alone and With TVS curve and the line of no discrimination (Fig. 2, open Does the symptoms index in conjunction with circles). By using a logistic regression analysis, the MI transvaginal sonography improve the identification score alone (Fig. 2, open squares) significantly (P
Original Article chosen negative result calculated at 0.816. Adding the symptoms results (Fig. 2, open circles) resulted in an in- significant improvement (P ¼ .08; AUC ¼ 0.820) on this model. The performance of symptoms alone for identifying malignancy (Table 8, Group A) was examined in relation to TVS alone in the study group using the 2 dichotomiza- tions shown in Figure 2 that had the best ROC perform- ance (ie, Group B [MI score 4] and Group C [MI score 5]) (Table 8). TVS (based on either the Group B or the Group C dichotomization) had much higher sensitivity FIGURE 2. Receiver operating characteristic analysis of trans- and lower specificity than the symptoms index. Combin- vaginal sonography (TVS) and symptoms analysis is shown. The performance of TVS alone is represented by solid circles ing the symptoms index with TVS (Table 8, Groups D for the selection group and by open squares for the study and E) in series improved specificity but reduced sensitiv- group. Open circles represent the discrimination of malignancy by both TVS and symptoms together. The points were plotted ity. The results reported here with the symptoms index with respect to malignancy index values greater than or equal (Table 8, Group A) resulted in lower sensitivity (Table 8, to the number above each symbol. The diagonal dashed line Group H) and higher specificity than reported by Goff (line of no discrimination) describes a random guess or out- come. Ranges for sensitivity and specificity were obtained from et al6. Because sensitivity and specificity are derived inde- the 2007 report by Goff et al6 and were used for maximal and pendently of each other, we examined the range of sensi- minimal calculations (the box marked Published Symptoms Data). Solid squares at a true-positive (TP) rate (TPR) between tivities and specificities reported by Goff et al6 (Table 8, 0.8 and 0.9 near origin of the false-positive (FP) rate (FPR) Group H) in an ROC curve context, Figure 2 (the boxed superimpose the overall results from TVS in the entire Univer- window identified as ‘‘Published Symptoms Data sity of Kentucky screening group (for sensitivity and specificity source values, see Table 9, Group J). Sensitivity ¼ TP/(TP þ [Goff]’’). The published symptoms results underper- FN) ¼ the positive rate ¼ TPR. FPR ¼ 1specificity. TPR and formed TVS alone, and a portion of these results reached FPR were defined according to Suojanen.28,29 and crossed below the line of no discrimination. The Table 8. Performance of Either the Symptoms Index or Transvaginal Sonography for Identifying Malignancies When Used Together in the Study Group (n¼272) Group* TP TN FP FN No. Sensitivity, Specificity, PPV, NPV, ACC, %† %‡ % % % A: Symptoms positive 6 221 21 24 272 20 91.3 22.2 90.2 83.5 B: TVS, MI4 27 135 107 3 272 90 55.8 20.1 97.8 59.6 C: TVS, MI5 22 180 62 8 272 73.3 74.4 26.2 95.7 74.3 D: Symptoms positive and MI4 5 235 7 25 272 16.7 97.1 41.7 90.4 88.2 E: Symptoms positive and MI5 5 237 5 25 272 16.7 97.9 50 90.5 89 F: Symptoms positive or MI4 28 121 121 2 272 93.3 50 18.8 98.4 54.8 G: Symptoms positive or MI5 23 164 78 7 272 76.7 67.8 22.8 95.9 68.8 Reference basis H: Goff 20076 — — — — 233§ 33-47 61-75 — — — I: TVS: van Nagell 200718 51 24,954 313 9 25,327 85 98.7 14 -27.1| 99.96 98.7 J: TVS: Up to June 2008 62 31,287 388 11 31,748 84.9 98.8 13.8-20.6| 99.96 99.8 TP indicates true positive; TN, true negative; FP, false positive; FN, false negative; PPV, positive predictive value (TP/[TPþFP]); NPV, negative predictive value (TN/[TNþFN]); ACC, accuracy ([TPþTN]/[TPþTNþFPþFN]); TVS, transvaginal sonography; MI, morphology index. * Screening findings were dichotomized on the basis of an MI4 versus an MI5. Patients who had an MI4 made up Groups B, D, and F; and patients who had TVS findings with an MI5 made up Groups C, E, and G. y Sensitivity¼TP/(TPþFN)¼TP rate. z Specificity¼TN/(TNþFP). § The number of patients in the group that received ultrasound. | The PPV lower value was for the entire screening group and the higher value in range was for the period from 2000 to the present, when patients did not undergo surgery for simple cysts. 3694 Cancer August 15, 2009
Symptoms & TVS Screening for Ovarian CA/Pavlik et al Table 9. Age and the Performance of Either the Symptoms Index or Transvaginal Sonography for Identifying Malignancies Group* TP TN FP FN Sensitivity, Specificity, PPV, NPV, ACC, % % % % % Premenopausal (N¼45) A: Symptoms positive 1 36 6 2 33.3 85.7 14.3 94.7 33.3 B: TVS, MI4 2 25 17 1 66.7 59.5 10.5 96.2 66.7 C: TVS, MI5 2 32 10 1 66.7 76.2 16.7 97 66.7 D: Symptoms positive and MI4 0 39 3 3 0 92.9 0 92.9 0 E: Symptoms positive and MI5 0 40 2 3 0 95.2 0 93 0 F: Symptoms positive or MI4 3 22 20 0 100 52.4 13 100 100 G: Symptoms positive or MI5 3 28 14 0 100 66.7 17.6 100 100 Postmenopausal (N¼227) H: Symptoms positive 5 185 15 22 18.5 92.5 25 89.4 18.5 I: TVS, MI4 25 110 90 2 92.6 55 21.7 98.2 92.6 J: TVS, MI5 20 148 52 7 74.1 74 27.8 95.5 74.1 K: Symptoms positive and MI4 5 196 4 22 18.5 98 55.6 89.9 18.5 L: Symptoms positive and MI5 5 197 3 22 18.5 98.5 62.5 90 18.5 M: Symptoms positive or MI4 25 99 101 2 92.6 49.5 19.8 98 92.6 N: Symptoms positive or MI5 20 136 64 7 74.1 68 23.8 95.1 74.1 TP indicates true positive; TN, true negative; FP, false positive; FN, false negative; PPV, positive predictive value (TP/[TPþFP]); NPV, negative predictive value (TN/[TNþFN]); ACC, accuracy ([TPþTN]/[TPþTNþFPþFN]); TVS, transvaginal sonography; MI, morphology index. * Pair-wise chi-square tests were not significant at P < .05 (PAH¼.03311; PBI¼.3593; PCJ¼.5795; PDK¼.1053; PEL¼.2228; PFM¼.6299; PGN¼.8399). performance of TVS for ovarian cancer screening is sum- formation with TVS MI scores (Table 8, Groups D and marized in terms of the large group published results F) indicated that, when both TVS and symptoms crite- (Table 8, Group I) and subsequent accrual results up to ria are met, there is a reduction of FP results. This June 2008 (Table 8, Group J), both of which indicate reduction, although it improves the positive predictive better sensitivities and specificities than those produced value (PPV), is at the expense of identifying TPs so that by the symptoms index alone (Table 8, Group A) or in sensitivity suffers (Table 8). When either TVS or symp- combination with TVS (Table 8, Groups D and E). toms criteria are met (Table 8, Groups F and G), sensi- Finally, decision making was examined on the basis of tivity is affected positively, but specificity is meeting either symptoms or TVS criteria in parallel. In compromised, with the ability to distinguish malignant this context, TVS results for scores of both MI scores from benign (PPV) becoming similar to that of TVS 4 and MI scores 5 were explored to broadly test the alone (Table 8, Groups B and C). information added by symptoms. Only a minor increase Finally, when age was overlaid on these considera- in sensitivity occurred when either TVS MI scores 4 tions, the symptoms index alone appeared to have more or the symptoms index was applied in parallel (Table 8, sensitivity in premenopausal women (Table 9, Group A) Group F vs Group B). Similarly, a minor increase in but less specificity than in postmenopausal women (Table sensitivity occurred when either TVS MI scores 5 or 9, Group H). Conversely, TVS alone appeared to have the symptoms index was applied in parallel (Table 8, more sensitivity in postmenopausal women (Table 9, Group G vs Group C) with concomitant decreases in Groups I and J) with slightly higher specificity in premeno- specificity when symptoms were applied with TVS MI pausal women (Table 9, Groups B and C). When combin- scores 4 or 5. To explain how these findings were ing symptoms criteria in series with TVS, specificity made, 1 TP entered Group F (Table 8) because of a pos- appeared high in both premenopausal women (Table 9, itive symptoms index, and that case was not included in Groups D and E) and postmenopausal women (Table 9, Group B (Table 8) because of an MI score
Original Article group (Table 9, Groups F and G vs Groups M and N). >97%). Second, adding in parallel symptoms informa- However, differences between premenopausal and post- tion with equal weight to that of ultrasound (either/or) menopausal women were only apparent, because none only slightly improved the discrimination of malignancy were significant (P < .05; chi-square test). (1 additional TP with a sensitivity increase of þ3.3%). These results strongly indicate that the major screening benefit in discriminating malignancy is achieved with DISCUSSION ultrasound findings, whereas symptoms information can Our experience from nearly 20 years of screening has posi- aid in reducing surgery for women who have benign con- tioned us to reach certain conclusions regarding the bene- ditions that generate an ultrasound abnormality. fits and limitations of TVS as a screening method for A distinction between the data reported here and ovarian cancer, especially in relation to safety, patient those reported by Goff et al6 is that they differ in composi- acceptability, time-efficiency, costs, and interobserver var- tion: the Goff et al study was composed of 37% early stage iation.30-32 In this regard, we believed that it would be disease versus 75% in the current study. This variation in beneficial to determine how symptoms information could stage distribution alone could account for differences improve TVS screening for ovarian cancer. It is notewor- between the reports in sensitivity and specificity, particu- thy that, as the basis for this study, we were in a position larly if symptoms are more apparent in late-stage ovarian to draw on a group of patients all of whom had a sufficient cancer. We have used exclusion based on reported patient TVS abnormality that led to surgery and histologic evalu- uncertainty to constrain recall bias, whereas Goff et al6 ation of the ovarian tumor removed. used symptoms queries limited to the present. Even more For a screening test to be effective, sensitivity, speci- significant is the overall difference in the selection of the ficity, PPV, and negative predictive value should be study group. The study group that we used in the current high.33 With periodic use, screening should decrease stage report was composed of women from the general popula- at detection and should increase disease-specific survival tion who were actively participating in the University of in the screened population.34 TVS may be questioned as a Kentucky Ovarian Screening Program. In contrast, Goff screening method for ovarian cancer because of its moder- et al6 constructed a study group that surveyed 149 women ate sensitivity and relatively low PPV.18 Although TVS with ovarian cancer (55 early stage, 88 late-stage, 6 screening accurately predicted the presence of cancer unknown stage), 225 high-risk women who were enrolled involving the ovary in 62 asymptomatic women, there in an ovarian cancer early detection study, and 233 were 11 patients who developed ovarian cancer within 12 women with conditions indicating that they should months of a normal scan (Table 8, Group J, FN column), receive an ultrasound. It is entirely possible that this con- so that the addition of information from a symptoms index struction does not mirror the same cross-section of would have the potential of reducing both the FN and the women with and without symptoms that we observed in FP screening results. Such a reduction would have its great- our large screening effort in the general population. In est impact by increasing the PPV so that screening would addition, Daly and Ozols35 have presented a commentary better differentiate malignant from benign lesions. The on issues that may involve limitations of the control group results reported herein when both TVS and symptoms cri- used by Goff et al.6 An additional consideration is pre- teria are met, indicate a reduction of FP results. This reduc- sented in the 2008 report by Anderson et al that combines tion, although it improves the PPV, is at the expense of the Goff symptoms index with CA 125 to improve detec- identifying TP results, so that sensitivity suffers (Table 8). tion of ovarian cancer.36 In that report,36 sensitivity and When either TVS or symptoms criteria are met, sensitivity specificity of the symptoms index alone were considerably is affected positively, but specificity is compromised, and higher than first reported.6 The ranges reported initially the ability to distinguish malignant from benign (PPV) were 33% to 47% sensitivity and 61% to 75% specific- becomes similar to that of TVS alone (Table 8). ity6; whereas, in the recent report,36 the ranges were The clinical significance of the results reported here 52.1% to 74.8% for sensitivity and 83.6% to 91.9% for is that a screen that is negative by both ultrasound and the specificity. These differences raise several questions. First, symptoms index is highly likely to be benign (specificity, because the sample for the recent study is described as a 3696 Cancer August 15, 2009
Symptoms & TVS Screening for Ovarian CA/Pavlik et al subgroup of the previous study, indicating that different 6 malignancies in 27 women who were positive for in- study groups were not used, what accounts for the formative symptoms (22%) and 24 women with malig- increased sensitivity and specificity in the recent report? nancies that were absent informative symptoms among 30 Second, if the subgroup that was reported36 differed women who had malignancies (80%). It is possible that the intrinsically from the originating group from which it was informative symptoms coincided only serendipitously with drawn,6 does this indicate that there will be considerable malignancy; however, this can be neither substantiated nor variability in the results obtained with the symptoms refuted by the current study. Moreover, in the study index? Although there are differences in study design, our reported here, all women had a successful TVS. Performing findings indicate that the evaluation of informative symp- a definitive TVS should be the first line of response when toms as identified by Goff and collaborators6 can be useful encountering a patient who has informative symptoms. in helping distinguish benign disease when used in con- Unfortunately, TVS may not be reliable in clinical settings junction with TVS. Symptoms information cannot be if a large uterus is present or if the patient’s BMI is exces- ignored and is an important element in communication sive. In these cases, thorough assessment of a patient with between patient and physician. The efforts of Goff et al6 informative symptoms using other diagnostic testing is have done much to erode the idea that ovarian cancer has essential. It is important to educate patients that informa- no symptoms until it is advanced. In this regard, efforts tive symptoms should not be ignored and that the degree still are needed to define how this information frequents to which symptoms are a resultant indicator of early stage and affects the general population. ovarian malignancy has yet to be determined. With regard to the clinical outlook, the following In the absence of early detection, most patients with considerations are evident: 1) TVS alone performs better ovarian cancer will continue to present with advanced- than informative symptoms evaluation in identifying stage disease in which the cost of treatment is high and the malignancy, whereas informative symptoms evaluation in cure rate is low.40 We intend to continue to use symptoms concert with TVS improves sensitivity only modestly. 2) analysis prospectively in combination with TVS to evalu- The frequency of informative symptoms cannot be dis- ate more fully the performance of screening for ovarian missed because, even when they are present in only 20% cancer. As specific biomarkers are added to TVS ovarian of ovarian malignancies, the risk for these patients is screening protocols that include symptoms analysis, the higher than that for patients without informative symp- protection afforded by annual screening from ovarian can- toms. 3) The evaluation of informative symptoms is facile, cer mortality should improve. in that it is simple, quick, and inexpensive. Taken together, these considerations point in a clear Conflict of Interest Disclosures clinical direction. TVS performs with credibility in Supported by grants from the Telford Foundation, and the women who are both symptom-free and in those who Department of Health and Human Services, Commonwealth of Kentucky. report symptoms. By itself, TVS performs as well as or better than mammography,37 mammography-assisted by References sonography,38 or mammography combined with mag- netic resonance imaging,39 even with regard to PPV. In 1. Jemal A, Murray T, Ward E, et al. Cancer statistics 2005. CA Cancer J Clin. 2005;55:10-30. this sense, TVS lives up to the performance standards 2. Jemal A, Siegel R, Ward W, et al. Cancer statistics, 2008. used and accepted for screening. In a population in which CA Cancer J Clin. 2008;58:71-96. informative symptoms accompany malignancy in 20% of 3. Ueland FR, DePriest P, DeSimone C, et al. The accuracy of cases, informative symptoms evaluation in parallel equally examination under anesthesia and transvaginal sonography in weighted with TVS (Table 8, Groups F and G) carries a evaluating ovarian size. Gynecol Oncol. 2005;99:400-403. 3% to 4% improvement in detection (sensitivity). How- 4. Goff BA, Mandell L, Muntz HG, Melancon CH. Ovarian carcinoma diagnosis. Results of a national ovarian cancer ever, for the additional positive case that is detected, the survey. Cancer. 2000;89:2068-2075. data reported here indicate that from 4 to 14 additional 5. Goff BA, Mandelm LS, Melacon CH, Muntz H. Fre- women without malignancy will undergo surgery. The in- quency of symptoms of ovarian cancer in women present- formation that needs to be emphasized is that we observed ing to primary care clinics. JAMA. 2004;291:2705-2712. Cancer August 15, 2009 3697
Original Article 6. Goff BA, Mandel LS, Drescher CW, et al. Development of 22. DePriest PD, Shenson D, Fried A, et al. A morphology an ovarian cancer symptom index. Cancer. 2007;109:221-227. index based on sonographic findings in ovarian cancer. 7. Young RC, Walton LA, Ellenberg SS. Adjuvant therapy in Gynecol Oncol. 1993;51:7-11. stage I and II epithelial ovarian cancer: results of 2 prospec- 23. Bailey CL, Ueland FR, Land GL, et al. Malignant potential tive randomized trials. N Engl J Med. 1990;322:1021-1027. of small cystic ovarian tumors in postmenopausal women. 8. Ries LA, Kosary CL, Hankey BF, et al, eds. SEER Cancer Gynecol Oncol. 1998;69:3-7. Statistics Review, 1973-1995. Bethesda, Md: National Can- 24. Modesitt SC, Pavlik EJ, Ueland FR, et al. Risk of malig- cer Institute; 1998. nancy in unilocular ovarian cystic tumors less than 10 cen- 9. Campbell S, Bhan V, Royston P, et al. Transabdominal timeters in diameter. Obstet Gynecol. 2003;102:594-599. ultrasound screening for early ovarian cancer. BMJ. 25. Zweig MH, Campbell G. Receiver-operating characteristic 1989;299:1363-1367. (ROC) plots: a fundamental evaluation tool in clinical 10. van Nagell JR, Higgins RV, Donaldson ES, et al. Transva- medicine. Clin Chem. 1993;39:561-577. ginal sonography as a screening method for ovarian cancer: 26. Pepe MS. The Statistical Evaluation of Medical Tests for a report of the first 1000 cases screened. Cancer. Classification and Prediction. New York, NY: Oxford Uni- 1990;65:573-577. versity Press; 2003. 11. Jacobs IJ, Bridges J, Reynolds C, et al. Multimodal approach 27. Fawcett T. An introduction to ROC analysis. Pattern Recog- to screening for ovarian cancer. Lancet. 1988;2:268-271. nit Lett. 2006;27:861-874. 12. Bourne TH, Campbell S, Reynolds KM, et al. Screening 28. Suojanen JN. False false positive rates [letter]. N Engl J for early familial ovarian cancer with transvaginal ultraso- Med. 1999;341:131. nography and colour blood flow imaging. BMJ. 29. [Noauthors listed]Correction: false false positive rates [let- 1993;306:1025-1029. ter]. N Engl J Med. 1999;341:624. 13. DePriest PD, Gallion HH, Pavlik EJ, Kryscio RK, van 30. Pavlik EJ, van Nagell JR, DePriest PD, et al. Participation Nagell JR. Transvaginal sonography as a screening method in transvaginal ovarian cancer screening: compliance, corre- for the detection of early ovarian cancer. Gynecol Oncol. lation factors, and costs. Gynecol Oncol. 1995;57:395-400. 1997;65:408-414. 31. Pavlik EJ, Johnson TL, DePriest PD, Andrykowski MA, 14. Jacobs I, Skates SJ, MacDonald N, Menon U, Rosenthal A, Kryscio RJ, van Nagell JR. Continuing participation sup- Prys Davies A. Outcome of a pilot randomised controlled ports ultrasound screening for ovarian cancer. Ultrasound trial of ovarian cancer screening. Lancet. 1999;253:1207- Obstet Gynecol. 2000;15:354-364. 1210. 32. Higgins RV, van Nagell JR, Woods CH, Thompson EA, 15. Sato S, Yokoyama Y, Sakamoto T, et al. Usefulness of mass Kryscio RJ. Interobserver variation in ovarian measurements screening for ovarian carcinoma using transvaginal ultraso- using transvaginal sonography. Gynecol Oncol. 1990;39:69-71. nography. Cancer. 2000;89:582-588. 33. Prorok PC. Evaluation of screening program for the early 16. van Nagell JR, DePriest P, Reedy M, et al. The efficacy of detection of cancer. Statistical methods for cancer studies transvaginal sonographic screening in asymptomatic women (edited by RG Cornell). Natl Cancer Inst Stat Textbk at risk for ovarian cancer. Gynecol Oncol. 2000;77:350-356. Monogr. 1984;51:267-328. 17. Buys SS, Partridge E, Greene MH, et al. Ovarian cancer 34. Hulka BS. Cancer screening: degrees of proof and practical screening in the Prostate, Lung, Colorectal and Ovarian application. Cancer. 1989;62:1776-1789. (PLCO) Cancer Screening Trial: findings from the initial 35. Daly MB, Ozols RF. Symptoms of ovarian cancer—where screen of a randomized trial. Am J Obstet Gynecol. 2005;193:1630-1639. to set the bar? JAMA. 2004;291:2755-2756. 18. van Nagell JR, DePriest PD, Ueland FR, et al. Ovarian 36. Anderson MR, Goff BA, Lowe KA, et al. Combining a cancer screening with annual transvaginal sonography. symptoms index with CA 125 to improve detection of Findings of 25,000 women screened. Cancer. 2007; ovarian cancer. Cancer. 2008;113:484-489. 109:1887-1896. 37. Humphrey LL, Hefland M, Chan BKS, Woolf SH. Breast can- 19. Pavlik EJ, DePriest PD, Gallion HH, et al. Ovarian volume cer screening: a summary of the evidence for the US Preventive related to age. Gynecol Oncol. 2000;77:410-412. Services Task Force. Ann Intern Med. 2002; 137:347-360. 20. Ueland F, DePriest P, Pavlik E, Kryscio R, van Nagell JR. 38. Elmore JG, Armstrong K, Lehman CD, Fletcher SW. Preoperative differentiation of malignant from benign ovar- Screening for breast cancer. JAMA. 2005;293:1245-1256. ian tumors: the efficacy of morphology indexing and Dopp- 39. Lehman CD, Gatsonis C, Kuhl CK, et al. MRI evaluation ler flow sonography. Gynecol Oncol. 2003;91:46-50. of the contralateral breast in women with recently diag- 21. Sassone M, Timor-Tritsch I, Artner A, Westhoff C, Warren nosed breast cancer. N Engl J Med. 2007;356:1295-1303. WB. Transvaginal sonographic characterization of ovarian 40. Cooper AL, Nelson DF, Doran S, et al. Long-term survival disease: evaluation of a new scoring system to predict ovar- and cost of treatment in patients with stage IIIC epithelial ian malignancy. Obstet Gynecol. 1991;78:70-76. ovarian cancer. Current Women’s Health Reviews. 2009;5:44-50. 3698 Cancer August 15, 2009
You can also read