Until recently, there were no medications available with an FDA approved indication for the treatment of OCD. The first medication to receive such an indication, clomipramine (Anafranil) (CMI), was initially introduced in Europe as an antidepressant in 1966. Early reports of the efficacy of CMI in OCD and accumulating evidence in support of the serotonergic hypothesis on the etiology of OCD (
Flament et al. 1987;
Goodman et al. 1989a) led to several other serotonin reuptake inhibitors (SRIs) being tried as a treatment for this disorder. There is now empirical evidence from multi-center, double-blind, placebo-controlled trials supporting the efficacy of clomipramine (CMI) (clomipramine
clomipramine Collaborative Study Group 1991), fluoxetine (Prozac) (FLX) (
Tollefson et al. 1994), sertraline (Zoloft) (SER) (
Greist et al. 1995a), paroxetine (Paxil) (PAR) (
Wheadon et al. 1993), and fluvoxamine (FLV) (Luvox) (
Rasmussen et al. 1998) in the treatment of OCD. The FDA granted an indication for the treatment of OCD for CMI in 1989, FLX in 1990, FLV in 1994, and SER and PAR in 1996.
Treatment of choice: drugs, behavior therapy, or both?
In spite of the success of SRIs, a substantial proportion of OCD patients (roughly 30%) remain clinically unchanged after an adequate trial (
Rasmussen et al. 1993). Still others (8–15%) discontinue treatment due to side effects (
Greist et al. 1995b). Many treatment responders still have significant levels of symptomatology after treatment. For example, in the multi-center CMI study, mean scores on the Yale-Brown Obsessive Compulsive Scale (YBOCS) (
Goodman et al. 1989b) dropped from 26 to 15 after 10 weeks of treatment, which is in the mild, but still clinical range of severity (
Clomipramine Collaborative Study Group 1991). In addition, relapse rates following discontinuation of SRIs are high, i.e., in one study 89% of patients treated with CMI relapsed within 7 weeks after drug discontinuation even after 1 year of therapy at adequate dosages (
Pato et al. 1988); in another, patients treated successfully with PAR relapsed an average of 62.9 days following placebo substitution (
Steiner et al. 1995).
For behavior therapy, there are also problems to consider. Exposure therapy may involve considerable discomfort as patients expose themselves to anxiety-provoking situations while at the same time refraining from performing the rituals that would reduce their anxiety. Estimates on the percent of patients who complete ERP and are helped by it range from 67% to 90% (
Hafner et al. 1981;
Foa et al. 1992), and estimates on drop-out rates range from 20% to 25% (
Rachman and Hodgson 1980;
Hafner et al. 1981;
McDonald et al. 1988). Thus, this method may help only 50% of patients with OCD (
Hafner et al. 1981). Exposure therapy may be less successful for patients with obsessions alone (
Marks 1981), although more recent results based on specific identification of mental rituals are better (
Salkovskis and Westbrook 1989). In addition, patients with high levels of co-morbid depression do not appear to habituate during treatment (
Marks 1981). Finally, approximately 25% of patients refuse behavioral treatment (
Foa et al. 1983;
McDonald et al. 1988), making the overall percentage of patients helped even lower (
Baer and Minichiello 1990).
Given the advantages and disadvantages of each treatment, information is needed on their relative efficacy in order to guide treatment decisions. Several meta-analyses have been published comparing the relative efficacy of behavior therapy with pharmacologic interventions in OCD.
Christensen et al. (1987) conducted a meta-analysis comparing several treatment approaches, including several types of psychosurgery. Using combined observer and self-ratings, the effect size for ERP was 2.34, compared with 1.41 for tricyclic medication, 1.12 for psychosurgery, and 0.24 for placebo. Results provide some estimate of the relative efficacy of SRIs and ERP, although interpretations are tentative, due to the combining of the effect size for CMI, the only available SRI at the time, with the effect sizes of other non-SRI tricyclics, and the omission of data on the other four currently available SRIs (SER, FLX, FLV, PAR).
A second meta-analysis was published by
van Balkom and colleagues (1994). Effect size was reported separately for self-rated and assessor-rated outcome measures. On self-ratings, behavior therapy was significantly more effective than serotonergic antidepressants, and combined treatment was significantly more effective than antidepressants alone. However, on assessor ratings, no significant differences were found between the three treatment conditions.
While the van Balkom et al. study includes more recent information than the Christensen et al. study, it contains comparisons for only three of the five currently available SRIs (SER was not considered an SRI and thus was not included in the comparative analysis).
Abramowitz (1997) criticized the study on methodological grounds, i.e., all effect sizes were computed using pre- to post-test data, even for studies where a control group was used. Effect sizes of pre-to-post designs have been found to be significantly larger than other designs, due to the smaller variability that results from pooling the variance from the same patients.
Three meta-analyses examined the relative efficacy between the SRIs without comparison to ERP.
Stein and colleagues (1995) examined effect sizes separately for placebo-controlled (
n=12), all controlled (placebo-controlled and medication-controlled,
n=22), and all trials (controlled and uncontrolled,
n=28). Using ANOVA techniques, they found CMI had a significantly greater effect size (1.64) than FLX (0.51) in placebo-controlled trials, and significantly greater effect size (1.71) than FLX (1.08) and FLV (1.14) in all controlled trials. In all trials (controlled and uncontrolled), no significant difference was found between the SRIs; however, when this sample was restricted to studies with 50 or more subjects (CMI, FLX, and SER studies only), CMI had a significantly greater effect size than FLX (effect sizes not reported). While the study provides information on the relative efficacy of the SRIs, the use of ANOVA techniques to compare differences in effect sizes between classes has been criticized due to the violation of the assumption of homogeneity of variance (
Hedges and Olkin 1985;
Stevens 1986).
Hedges and Olkin (1985) note that due to differences in sample size, the error variance between studies can vary by a factor of 10 or 20, and that ANOVA does not provide information on whether the effect size estimates within each class are homogeneous.
Piccinelli and colleagues (1995) found CMI had a larger effect size (1.41) than FLX (0.57), FLV (0.57) or SER (0.52), which did not differ from each other. Their analyses were restricted to randomized, double-blind, medication- or placebo-controlled trials. Their analyses between the SRIs was also restricted to studies using the YBOCS or the National Institute of Mental Health Obsessive-Compulsive Rating Scale (NIMH-OC). A total of 12 studies was included in this analysis.
Greist and colleagues (1995b) conducted a meta-analysis of the four multi-center, placebo-controlled trials for CMI (
n=520), FLX (
n=355), FLV (
n=320), and SER (
n=325). Effect sizes were calculated by subtracting the endpoint drug change scores from the endpoint placebo change scores and dividing by the pooled change standard deviations. CMI had a significantly larger effect size (1.48) than FLX (0.69), FLV (0.50) or SER (0.35), which did not differ from each other. In addition to the large sample sizes, this study had the advantage of similar treatment designs and outcome measures, with several of the investigators involved in more than one trial.
Finally,
Abramowitz (1997) examined the effect sizes of the SRIs versus placebo (no statistical comparisons between individual SRIs were reported). He also examined the effect sizes of ERP compared to relaxation, cognitive therapy, and to its component parts (i.e., exposure alone and response prevention alone). ERP was not statistically compared to the SRIs. Studies were limited to randomized trials with multiple treatments or control groups. He also examined the relationship between effect size and the degree to which the study was unblinded, measured by the difference in the proportion of patients reporting side effects in drug versus placebo groups (“side effect contrast”). On clinician ratings, CMI had the largest effect size (1.31), followed by FLV (1.28), FLX (0.68) and SER (0.37). Interestingly, side effect contrast was significantly correlated with effect size [
r(21)=0.62,
P<0.01], with CMI having the largest side effect contrast [the correlation between effect size and side effect contrast for CMI was
r(10)=0.92,
P<0.001]. He speculates that part of the superiority of CMI may be attributable to the unblinding of the CMI patients due to side effects, an idea previously discussed by
Fisher and Greenberg (1993) in the context of the problems maintaining the blind in double-blind pharmaceutical trials. ERP was significantly better than relaxation, but not better than cognitive therapy or ERP’s individual components.
The purpose of the current study is to provide a comprehensive quantitative summary of the treatment literature on the relative efficacy of SRIs and ERP in the treatment of OCD. It differs from the previous studies in the following ways: 1) it provides the first comparative quantitative analysis of all the currently available SRIs and ERP for the treatment of OCD; 2) it is comprehensive, in that all studies (in both the published and unpublished literature) were included regardless of methodological features. This enabled an empirical examination of the relationship between methodological qualities and effect size; and 3) statistical procedures were employed to control for the differences in variability resulting from different sample sizes between studies, allowing for statistical comparison between treatment groups, and control of extraneous sources of variance (such as methodological differences between studies).
The study is designed to address the following questions:
1.
What is the relative efficacy between ERP and individual serotonergic antidepressants?
2.
What is the relative efficacy between ERP, serotonergic antidepressants as a whole, and the combination of ERP and serotonergic antidepressants (ERP/SRI)?
3.
To what extent do methodological variables (i.e., method of effect size calculation, random versus non-randomized assignment, use of control group, or publication source) impact effect size?