Data collected in the United States and Western Europe demonstrate shortcomings in the primary care management of depression
(1–
3). Many patients with significant depressive disorders go unrecognized
(4), and recognition may not lead to effective treatment
(3,
5). Among patients initiating either pharmacotherapy or psychotherapy, few receive levels of treatment consistent with research evidence and expert guidelines
(3,
6). Overall, less than one-quarter of primary care patients with depressive disorders receive adequate acute-phase treatment
(1,
3,
5).
Among depressed primary care patients, nontreatment or undertreatment has been associated with specific patient, physician, and health system characteristics. Patient characteristics include less severe depression
(7), male sex
(8,
9), and minority ethnicity
(10–
12). Associations between age and antidepressant treatment have been inconsistent
(8,
9). Rates of depression treatment also vary widely among physicians
(13–
15), with some evidence for higher rates among physicians with more thorough training or favorable attitudes toward treatment of mental disorders
(8,
16). Recognition and treatment of depression appear more frequent in primary care settings with greater continuity of care and more personal doctor-patient relationships
(8,
17). Financial barriers (absence of insurance coverage, higher out-of-pocket costs) are associated with lower rates of antidepressant treatment
(18) and specialty mental health utilization
(19,
20).
This report uses data from the Longitudinal Investigation of Depression Outcomes (LIDO) study to examine treatment received by depressed patients in six diverse primary care sites. The LIDO project extended previous work by collecting systematic data on use of specialty mental health care, dose of antidepressant treatment, and patient-reported barriers to treatment. We use these data to examine three questions: What proportion of primary care patients with current depressive disorder receives specific treatment (pharmacotherapy or psychotherapy)? What patient characteristics are associated with receipt of treatment? What barriers might account for nontreatment or undertreatment of depression?
Method
The LIDO project was a longitudinal study of depressive symptoms, quality of life, and health services use among primary care patients in Barcelona (Spain), Be’er Sheva (Israel), Melbourne (Australia), Porto Alegre (Brazil), St. Petersburg (Russia), and Seattle (United States). Methods are described in detail elsewhere
(21,
22) and are summarized here. At each site, investigators identified one or more primary care clinics considered typical of local primary health care delivery (e.g., not academic or teaching clinics, no specific focus on mental health care, generally representative in sociodemographic characteristics). As described in earlier publications
(21,
23), clinics varied widely in style of primary care practice, typical treatment of depression, and availability of specialty mental health care. Characteristics of participating clinics are summarized in
Table 1.
At each site, consecutive adult (age 18–75 years) visitors were invited to participate in a study of physical health, emotional health, and quality of life. After providing informed consent, participants completed a screening questionnaire that included the Center for Epidemiologic Studies Depression (CES-D) Scale
(24), the 12-Item Short-Form Health Survey Functional Status Questionnaire
(25), and questions regarding treatment for depression (e.g., “Any medications or therapy for depression?”) in the last 3 months. Patients were considered eligible for a second-stage diagnostic assessment on the basis of the following criteria: CES-D Scale score of 16 or greater, no depression treatment in the past 3 months, and no plans to move from the area during the next 12 months. In primary care samples, this CES-D Scale threshold has shown sensitivity of approximately 80% and specificity of approximately 75% in identifying current major depression
(26). Each site continued screening until approximately 200 patients were enrolled in the longitudinal study (described later in this section).
Table 1 displays the proportion of patients at each site with CES-D Scale scores ≥16 and the proportion of “screen-positive” patients excluded because of recent depression treatment. Across all sites, 18,456 patients completed screening and 4,662 were eligible for the second-stage diagnostic interview. All participants provided written informed consent for participation in the diagnostic assessment and longitudinal study.
The second-stage assessment included the depressive disorders module of the Composite International Diagnostic Interview
(27), the anxiety subscale extracted from the Hopkins Symptom Checklist 90 (SCL-90)
(28), a checklist of chronic physical conditions, a six-item yes/no questionnaire regarding perceived barriers to receiving health care, questions regarding frequency and quantity of alcohol use
(29), and other measures not described here. The Composite International Diagnostic Interview is a completely structured diagnostic assessment instrument developed for use in cross-national epidemiologic studies. Of the 4,662 eligible patients, 2,359 (51%) completed the diagnostic assessment and 49% either refused or failed to keep assessment appointments. Compared to those completing the baseline assessment, nonparticipants were slightly younger (mean of 40.5 years versus 41.6 years), more likely to be male (35% versus 31.4%), less educated (mean of 11.2 years versus 11.8 years), and less distressed as measured by the CES-D Scale score (mean of 24.7 versus 26.4) or the 12-Item Short-Form Health Survey mental component score (mean of 43.7 versus 42.6). While all of these comparisons were statistically significant (p<0.01 for all, t tests for continuous measures or chi-square tests for proportions), all were small in practical or clinical terms. Of those completing the baseline assessment, 1,193 patients (51%) met DSM-IV criterion A for current major depressive episode and were invited to enroll in the longitudinal study; 1,117 (94% of those eligible) agreed to do so.
For all patients enrolled, the treating primary care physician received a letter reporting that the research interview “indicated a probable diagnosis of depression,” but no specific treatments were recommended.
All participants in the longitudinal study were asked to complete reassessments after 6 weeks, 3 months, 9 months, and 12 months. The 3- and 9-month assessments included a detailed questionnaire regarding health service use during the prior 3 months. This Resource Use Questionnaire
(30) assessed names and doses of all medications and all use of specialty mental health services (outpatient visits, day treatment or day hospital, and inpatient services) for two 3-month periods (months 1 through 3 and months 7 through 9 after enrollment). Of the patients who entered the longitudinal study, 93% completed the 3-month assessment and 87% completed the 9-month assessment.
The minimum criteria for potentially effective antidepressant doses were based on the lower end of the dosing ranges recommended in the U.S. Agency for Health Care Policy and Research guidelines
(31) for treatment of depression in primary care (e.g., 75 mg/day for imipramine, 20 mg/day for fluoxetine). For drugs not included in the Agency for Health Care Policy and Research guidelines, equivalent doses were estimated by study investigators (
Appendix 1). For specialty mental health care, a “potentially effective” level of treatment was defined as three or more visits in a 3-month period, slightly below the lowest level of treatment found effective in primary care randomized trials
(32,
33).
Participants reporting any one of 12 chronic physical conditions were considered to have a chronic comorbid physical condition. Participants scoring 1.7 or higher on the SCL-90 anxiety scale were considered to have comorbid anxiety disorder. Participants reporting either binge drinking (more than 6 drinks on one occasion) at least once a month or a high level of regular alcohol use (more than 14 drinks per week for women, more than 21 drinks per week for men) were considered to have an “at risk” drinking pattern.
These analyses include data for all patients who completed the baseline, 3-month, and 9-month assessments. Comparisons of proportions across sites were made by using chi-square tests. Predictors of treatment within sites were examined by using separate logistic regression models for each site. A 5% threshold (two-sided) for statistical significance was used throughout. All analyses were performed with the SPSS software package (SPSS Inc., Chicago).
Results
The proportions of patients receiving various treatments at each site are shown in
Table 2. Rates of specialty mental health visits and pharmacotherapy varied widely, and the proportion exposed to any treatment ranged from 3% in St. Petersburg to 49% in Seattle. At no site did the proportion of patients who received potentially effective depression treatment exceed 40%. Antidepressant pharmacotherapy was more common than specialty mental health care in Barcelona and Seattle, while the opposite pattern was seen in Be’er Sheva and Melbourne. Because lack of treatment might be appropriate when depression resolves spontaneously, subgroup analyses were used to examine rates of treatment among those with persistent depressive symptoms (CES-D Scale score ≥16) after 9 months (N=345). In this subgroup, the proportion receiving any antidepressant treatment ranged from 37% in Seattle to 0% in St. Petersburg (χ
2=34.9, df=5, p<0.001), the proportion with any specialty mental health contact ranged from 31% in Melbourne to 4% in St. Petersburg (χ
2=37.6, df=5, p<0.001), and the proportion receiving any treatment ranged from 48% in Seattle to 4% in St. Petersburg (χ
2=37.2, df=5, p<0.001). Rates of benzodiazepine use were below 15% at all sites. Among those with an SCL-90 anxiety score ≥1.7, the proportion reporting any benzodiazepine use ranged from 25% in Barcelona to 0% in Porto Alegre, St. Petersburg, and Barcelona.
Table 3 displays patient-reported barriers to receiving treatment at each site. Concern about costs was the most common barrier and ranked first or second at every site. Concerns about adverse effects of medication were the second most common perceived barrier overall and were among the top three barriers at every site except St. Petersburg. Stigma-related barriers (perception of potential effects of treatment on job opportunities) were less common, with overall prevalence rates of 14%–20%. Financial barriers were reported most often in St. Petersburg, while concern about adverse medication effects was most common in Seattle. To simplify further analyses, data on barriers to treatment were collapsed into three categories: cost and travel barriers (includes “treatment is not affordable” and “treatment is a long distance away or inconvenient”), stigma-related barriers (includes “treatment is embarrassing,” “seeking treatment might compromise job opportunities,” and “family or friends discourage treatment”), and concerns about adverse effects of medication (includes “taking medication makes you feel unwell”).
Table 4 shows baseline characteristics of patients who did and did not receive antidepressant pharmacotherapy at a potentially effective dose. Of the 40 comparisons, seven were significant at the 5% level. Severity of depression was significantly associated with adequate antidepressant treatment at only one of the four sites examined. Patient-reported barriers were not consistently associated with use of antidepressant drugs within sites, and the few significant relationships were the opposite of those predicted (more self-reported barriers among those treated). If the threshold p value was adjusted to 0.00125 to account for multiple comparisons (Bonferroni correction), then none of the observed differences would be considered statistically significant.
Table 5 displays baseline characteristics of patients who did and did not make three or more visits to a specialty mental health provider. Of 40 comparisons, only three were statistically significant at the 5% level (approximately the number expected by chance). The mean CES-D Scale score was higher among treated patients at three sites, but this difference was not statistically significant at any site. If the threshold p value was adjusted to 0.00125 to account for multiple comparisons (Bonferroni correction), then none of the observed differences would be considered statistically significant.
Discussion
Among primary care patients with previously untreated depression, the proportion receiving a potentially effective treatment over the next 9 months varied widely across study sites—from 1% in St. Petersburg to 40% in Seattle. Within sites, demographic or clinical characteristics were relatively weak predictors of treatment. Judging by patients’ reports, out-of-pocket cost appears to be the most frequent barrier to receiving care.
Several limitations must be considered in interpreting these findings. First, study sites were selected for diversity (of patient populations and health care systems) and were not intended to represent any specific range of culture or health care delivery. Second, study clinics were selected to be representative of local rather than national primary health care. At best, our results generalize to cities or regions rather than countries. Third, our data regarding treatment received are limited in detail. We have no data regarding antidepressant use between the periods covered by the 3- and 9-month assessments and no data regarding the content of mental health visits (e.g., whether patients received a type of psychotherapy proven effective). Fourth, our data on barriers to care were drawn from general questions regarding medical treatment rather than questions on specific aspects of depression treatment. Fifth, we relied on patients’ recall of treatment rather than records or claims data. Previous research has found that recall of services over 3 months is generally good, but not perfect
(34). Sixth, those who declined to complete the screening or the baseline assessment might differ from those included in the sample. Finally, we would not draw any conclusions about effectiveness of treatment based on these data. Comparisons of treated and untreated patients using observational data are subject to significant bias given the bidirectional relationship between treatment and severity of illness (i.e., treatment can reduce severity of depression, but more severe depression may prompt more intensive treatment)
(35).
Our sample size (approximately 160 patients at each site, including fewer than 50 treated patients) allows only limited power to examine patient-level predictors of treatment. As shown in
Table 4 and
Table 5, modest differences (such as 5-point differences in CES-D Scale scores) did not reach the 5% level of statistical significance. While pooling data across sites would increase statistical power, we see two arguments against a pooled analysis. First, pooling would require questionable assumptions about homogeneity across sites. For example, age and sex (which might be presumed to have similar meaning across sites) have quite variable relationships to receipt of treatment, indicating that pooling is probably inappropriate. Second, the wide variation in treatment rates across sites would confound patient characteristics with site differences. For example, Seattle and Barcelona contributed the bulk of patients treated with antidepressants, while St. Petersburg and Be’er Sheva contributed almost none. A pooled analysis of the predictors of antidepressant use might tell us more about differences between primary care patients in Seattle and St. Petersburg (ecological association) than about patient characteristics predicting depression treatment (true association).
Some of the predictors (such as cost-related or stigma-related barriers) may operate primarily as site-level factors rather than as patient-level factors. While these characteristics were not significantly associated with the probability of receiving treatment within sites (
Table 4 and
Table 5), they may account for differences observed between sites. For example, cost barriers were reported most often in St. Petersburg and Porto Alegre, two sites with low rates of overall treatment and the lowest rates of specialty mental health care. As discussed earlier, our data do not allow us to definitively distinguish between ecological and true associations.
We should emphasize two important characteristics of our study sample. First, patients who reported current depression treatment at the screening assessment were excluded from the longitudinal study. As shown in
Table 1, the proportion of screen-positive patients already treated for depression was greater than 10% at all sites except Be’er Sheva. Second, primary care physicians were informed regarding each patient’s diagnosis of current depressive disorder. Therefore, our results indicate the proportion of “recognized” patients who receive treatment rather than the proportion of depressed patients normally recognized. The observed low treatment rates reflect the many points of potential failure between notifying physicians about screening results and patients’ eventual receipt of appropriate treatment: the physician may disagree with the screening result, the patient may not return, competing priorities may take precedence, the patient and physician may decide that no specific treatment is necessary, the patient may not accept the physician’s treatment recommendation, or the patient may not be able to afford recommended treatment. Our findings are consistent with those of several other studies
(36–
38) that simple feedback of depression screening results is not sufficient to ensure effective treatment or improved outcomes. Recognition of depression must be followed by affordable, appropriate, and sustained treatment.
In contrast to some previous studies
(7,
39), we found relatively low rates of benzodiazepine use among depressed primary care patients. This finding may reflect the particular characteristics of the study sites or a general trend toward decreasing use of benzodiazepines. Consistent with previous studies, rates of benzodiazepine use varied widely across sites
(7).
Given our generous criteria for potentially effective treatment, these rates should be considered upper bound estimates. For example, our dosing criteria reflected the lowest dose likely to be effective among primary care patients. Many patients require substantially higher doses. Similarly, the criterion of three or more specialty visits in 3 months represents a minimal level of care—without even considering the content of those visits. These generous criteria were intended to identify any potentially effective treatment (high sensitivity, low specificity) rather than treatment likely to be effective for most patients.
Data on barriers to treatment yielded a few unexpected findings. First, financial barriers were reported much more commonly than were barriers related to stigma. At all sites, concern about out-of-pocket costs was reported more often than any stigma-related concern. Second, concern about adverse effects of medication was the first or second most frequently reported barrier at all sites except St. Petersburg. Given low rates of treatment at many sites, it is likely that concerns about adverse effects of medication were based on preconception rather than experience. Third, rates of reported stigma-related barriers were highest in Seattle rather than in sites with lower overall treatment rates. This finding is remarkable, given that Seattle had the highest overall treatment rates and that stigmatization of depression or other mental disorders is often thought to be greater outside the United States and Western Europe. Fourth, cost was by far the most commonly cited barrier in St. Petersburg, the site with the lowest rate of depression treatment.
We should acknowledge that our data do not thoroughly examine stigma or cultural barriers to treatment. Doing so would require more detailed exploration of patients’ perceptions, attitudes, and explanatory models, as well as study of community residents who do not seek help in primary care.
Although study site was strongly associated with treatment, patient characteristics were relatively weak predictors. In general, higher CES-D Scale scores were associated with greater probability of using either type of treatment, but the relationship was statistically significant only in two of 12 comparisons. We also did not observe differential predictors—characteristics predicting receipt of pharmacotherapy rather than specialty mental health visits or the reverse. We cannot, of course, examine patient-level predictors of treatment at sites with very low treatment rates (e.g., we cannot examine the effect of cost barriers in St. Petersburg).
Our most striking finding was the 10-fold variation in probability of receiving treatment across study sites. Compared to this large variation, the variability within study sites was relatively small. Our data (
Table 4 and
Table 5) do not suggest that cross-site variability in patients’ demographic or clinical characteristics accounts for large cross-site differences in treatment rates. Instead, we must look to differences in health care systems to explain large variations in treatment. The WHO primary care survey suggested that cultural differences (how patients present to physicians, physicians’ attitudes regarding care of mental disorders) were important factors in the recognition and treatment of depression in primary care
(17,
40). Our data suggest that patients perceive cost barriers (out-of-pocket costs) and practical barriers (absolute availability of treatments, time or distance to treatment) as major obstacles to treatment. Attributing low rates of treatment primarily to cultural or attitudinal barriers may lead to nihilism about concrete solutions. Fortunately, cost barriers or practical barriers should be more amenable to specific policy actions (such as changes in reimbursement for mental health care or location of mental health services) than are cultural or attitudinal barriers.