Clinical context
Bipolar disorders are chronic multidimensional conditions affecting about 3–6% of the population (
1–
3). The onset of affective episodes usually occurs during adolescence or the early adult years (
4,
5) and follows an irregular course in which acute abnormal mood states alternate with periods of full or partial remission. The affective dimension is the cardinal feature of bipolar disorders but is seldom the only expression of these complex brain disorders. In addition to the full syndromal episodes that define mood disorders, patients with bipolar disorders often experience functional impairment due to interepisode subsyndromal affective symptomatology (
2,
6), comorbid nonaffective psychopathology (
7–
16) (e.g., anxiety disorders, substance misuse, and cognitive impairment), and general medical conditions (
17–
22) (e.g., obesity, migraine headache, and inflammatory disorders).
Bipolar disorder ranks as the sixth leading cause of disability worldwide and is associated with increased mortality (
23–
25). The excess mortality associated with bipolar disorder is not limited to suicide. Mortality ratios comparing patients with bipolar disorders with the general population reveal elevated death rates due to a number of general medical conditions including heart disease, stroke, and cancer (
26,
27).
Diagnosis of bipolar disorder remains an area plagued by uncertainty, despite the often dramatic psychopathology obtained from patients with bipolar disorders. Field trials, however, show that the diagnostic criteria for mania are the most reliable of the DSM-IV diagnoses tested.
The complexity of symptomatology associated with bipolar disorder often leads to confusion and frustration, which undermine confidence in treatment decisions. A basic fund of knowledge related to bipolar disorder and DSM-IV nosology is presented in the following to facilitate the processes of clinical diagnosis and management of bipolar illness. After discussion of these issues, a process is offered to guide integration of clinical knowledge and evidence from clinical trials.
Understanding DSM-IV bipolar disorder nosology
Emil Kraepelin, the father of the modern mood disorder concept, recognized the inherent unreliability of basing psychiatric diagnosis on signs and symptoms. Kraepelin differentiated manic-depressive insanity from dementia and schizophrenia on the basis of longitudinal factors, early age of onset, and course of illness more than cross-sectional symptoms (
28). He found manic-depressive illness had a peak onset during the late adolescent and early adult years and observed that periods of illness were characterized by prominent mood symptoms. Kraepelin regarded the differences between patients in which the predominant mood state was euphoria, irritability, or depression as slight in comparison with their similarities, especially the course of illness, which was marked by periods of wellness with full restitution of function. Karl Leonhard attempted to classify psychiatric illness into two large groups on the basis of whether symptoms were consistently of one type or fluctuated between opposite presentations (
29). Of the 13 bipolar-type illnesses he proposed, only the distinction of monopolar versus bipolar mood disorders was subsequently validated. The DSM-III divided mood disorders into unipolar and bipolar largely on the basis of family studies, which showed more relatives with bipolar disorders in the families of bipolar probands than in the families of unipolar probands (
30,
31).
The DSM-III also introduced operational criteria for diagnosis of mania and depression, and the occurrence of a manic episode was used as the defining characteristic of bipolar disorder. The DSM-IV nosology for mood disorders provides criteria for classification at three conceptual levels (
32). At the first level, lifetime mood disorder diagnoses, DSM-IV criteria differentiate primary mood disorders into unipolar mood disorder and bipolar mood disorder and their subtypes largely on the basis of the history of diagnosed episodes. Thus, at the second level, criteria that define the discrete abnormal mood states that DSM-IV designates as depression, hypomania, mania, or mixed episodes (for a modified summary see Table 1) are the building blocks underlying diagnosis of lifetime disorder and subtype. The diagnostic feature distinguishing bipolar disorders from other mental disorders is the occurrence of abnormal mood elevation not attributable to substance abuse or a general medical condition. Only a single such episode is required. The occurrence of major depressive episodes and hypomanias comprises the bipolar II diagnosis. If a patient previously diagnosed with bipolar II has an episode meeting the full criteria for mania, the lifetime diagnosis changes to bipolar I. The DSM-IV does not include age of onset or course of illness as diagnostic criteria for mood disorders, and no diagnostic laboratory tests are available (Table 2).
Course specifiers, a third conceptual dimension, are not thought to represent separate pathological conditions; the criteria merely describe common patterns in the course of episodes within the defined DSM-IV mood disorders (Table 3). Rapid cycling requires the occurrence of four episodes or two complete cycles over a 12-month period. Post-partum onset refers to episodes within 4 weeks after childbirth. The seasonal pattern course specifier requires a temporal relationship between onset of episodes and time of year for the majority of episodes over the patient’s lifetime and that no episodes occurred outside the expected interval over the preceding 2 years.
Challenges to prompt accurate diagnosis of bipolar disorder
Patient survey results indicate the average patient with bipolar disorder does not receive a proper diagnosis for nearly a decade after the onset of his or her first episode (
33,
34). Why is this? Bipolar disorder is diagnosed by the occurrence of pathological mood elevation, but abnormal mood elevation seldom dominates the lifetime course of bipolar disorder, and the faculty required to perceive mood elevation may be impaired by the illness (
35).
Mood elevation is subject to poor recall and even when perceived tends to be reported as briefer than reports from observers. The distinction between bipolar I and bipolar II can be difficult. Significant impairment of social or occupational function or hospitalization, the main criteria separating mania and hypomania, may be subjective or reflect non-clinical circumstances such as insurance coverage or the availability of supports.
For most patients with bipolar disorder, the clinical picture is made complex because mood episodes are but one manifestation of a complex condition. The diagnosis and treatment of bipolar disorder represent a challenge for the clinician, because the illness is irregular, comorbidity is common, reporting is done through the prism of the current mood state, and DSM-IV criteria for the illness parse mental illness based on current and past signs and symptoms of depression and mania without consideration of other dimensions of illness such as longitudinal factors and family history (Table 4).
Diagnostic assessment for patients with possible abnormal mood elevation can be greatly simplified by systematic evaluation procedures (Table 5). The Systematic Treatment Enhancement Program for Bipolar Disorder (STEP-BD) was a National Institutes of Mental Health-sponsored collaboration in which treatment effectiveness was studied in 26 centers across the United States. To better assess the effectiveness of treatments, the STEP-BD treatment centers adopted a common disease management program that consisted of standardized assessment techniques and the collaborative care approach to treatment described below. The systematic record-keeping procedures used in STEP-BD were designed for use in routine clinical practice. The STEP-BD Affective Disorder Evaluation (available at
http://www.manicdepressive.org) provides a modular template for evaluating patients, which increases diagnostic confidence by assessing longitudinal factors characteristics of the Kraepelinian conception of manic-depressive illness as well as DSM-IV bipolar disorder.
Every method of differential diagnosis for mood disorders requires characterization of the most extreme episode of mood elevation. The STEP-BD Affective Disorder Evaluation explicitly evaluates whether the most severe identifiable episode meets the criteria for mania, and practicing clinicians focusing their examination in a similar manner can determine whether patients meet the criteria for bipolar I disorder. Most patients (even those who are currently mildly to moderately ill) are able to report sufficient symptoms for the clinician to make a confident diagnosis. Collateral sources aid in the evaluation process and should be included whenever possible. Screening forms such as the Mood Disorder Questionnaire (
36) may help identify patients with bipolar disorders but are not a substitute for a thorough diagnostic evaluation. The Mood Disorder Questionnaire is a 15-item self-report inventory that can be completed and scored in less than 5 minutes.
Schema for systematic individualized care
Patients with bipolar disorder enter treatment in different phases of their illness and their presentation changes over time. The multiple dimensions of bipolar illness and the need to shift treatment priorities make it difficult to apply linear algorithms to the management of bipolar disorder.
Systematic treatment strategies, can, however, be applied to management of bipolar disorder. After diagnosis, clinical management can be conceptualized as an iterative process in which each follow-up assessment defines decision points for which interventions may be offered (
37). Figure 1 offers a general schema for managing iterative clinical care process. The three essential features of this schema are
•
defining the decision point,
•
choosing an intervention, and
The collaborative care disease management model used in STEP-BD provided a system of routine clinical measures in the form of an enhanced progress note and taught clinicians to engage patients as collaborators (
38).
STEP-BD added three key elements to the basic schema: 1) A menu of reasonable choices was used as a means of offering patients the range of interventions the clinician judged as appropriate, on the basis of his or her knowledge of the patient and knowledge of the evidence pertaining to the relevant clinical decision point. 2) Education was provided for patients and families in the form of a workbook and DVD. Such durable materials offered patients and their supports the opportunity to participate as informed collaborators in their own care. 3) Clinicians were taught simple negotiation skills. STEP-BD encouraged clinicians to use an adaptation of the paradigm of principled negotiation (
39), which aimed to increase the frequency of wise agreements between patient and care providers. The principled negotiation paradigm avoids the adversarial aspects of positional negotiation by applying external standards. In this paradigm, the ratings derived from structured patient self-reports and clinician progress notes provide standards by which the patient and clinician judge the need to revise the treatment plan. The clinician and patient actively engage in a therapeutic alliance to revise the treatment plan in accordance with measured outcomes.
Evidence: benchmarks and metrics
Implicit in the general consensus that the principles of evidence-based medicine provide the best guidance for clinical practice is the idea of offering proven treatments before unproven treatments. Use of this principle necessitates a working knowledge of medical evidence. Consumers of medical evidence can assess the clinical meaning of published studies by evaluating the quality of the evidence and by gauging the effect size of various interventions. Simple metrics are offered below to integrate these processes, basic to evidence-based medicine, into clinical practice.
Although there remain areas for which little or no high-quality data are available, the past decade produced strong growth in the knowledge base pertaining to clinical care of patients with bipolar disorder. The daunting task of transforming the accumulated evidence into useful guidance becomes more manageable and less error prone by limiting consideration to the highest quality studies. Results from studies with sufficient methodological rigor to allow valid causal inference, referred to here as category A evidence, represents the gold standard for evidence-based medicine. Category A evidence can be derived from randomized, double-blind, placebo-controlled studies with adequate sample size. Calculating sample size adequacy is a way of certifying that a study had adequate power to detect statistically significant differences, if, in fact, such differences were present. Formal power calculations can be complicated. A simple rule of thumb is, however, often sufficient to help clinicians judge the adequacy of sample size in mood disorder treatment studies. Clinical trials with fewer than 80 subjects are unlikely to meet the criteria as category A evidence.
This simple benchmark establishes lower bounds on the range of studies we include as having high-quality evidence. Another metric, effect size, is needed, because the convention of declaring results to be statistically significant, if the probability that the differences observed could be produced by chance alone is less than 5%, does not inform us whether an intervention is sufficiently robust to be clinically meaningful (
40,
41). Among the methods available to calculate effect size (
42), the easiest to use is the number needed to treat (NNT). Calculation of this metric only requires taking the reciprocal of the difference in response rates between treatment groups [NNT = 1/(Response rate
active − Response rate
placebo)]. Thus, if the response rates reported from a study were active treatment = 50% versus placebo = 30%, the reciprocal of 20% = 5. This tells us that to produce one additional positive response beyond what would be produced by placebo treatment, it is necessary to treat five patients.
As seen in Table 6, category A evidence is available to support the use of lithium, valproate, carbamazepine, lamotrigine, haloperidol, olanzapine, risperidone, quetiapine, ziprasidone, and aripiprazole. Tables 6 through 9 apply NNT analysis to high-quality studies for acute mania, bipolar depression, and maintenance.
Acute mania: summary of category a data
The category A studies for acute mania (Table 6) demonstrate the efficacy of six dopamine-blocking agents (olanzapine, ziprasidone, risperidone, haloperidol, quetiapine, and aripiprazole) and three non-dopamine-blocking agents (lithium, valproate, and carbamazepine).
This body of evidence offers important clinical guidance that may not be obvious from a review of the individual studies as follows.
1.
Patients receiving monotherapy are likely to remain acutely manic despite 3 weeks of treatment. After 3 weeks of treatment under the controlled conditions of a randomized, controlled trial, the mean mania rating scale score for subjects receiving any one of the proven antimanic agents exceeds the minimum required for study entry at baseline. This finding both highlights the need for sustained treatment and provides a rationale for combination treatment.
2.
Although there are undoubtedly individual differences in response to antimanic agents, the preponderance of accumulated evidence does not indicate important differences in overall efficacy. Nearly all direct comparisons between active agents yield no statistically significant differences in overall antimanic efficacy [lithium versus chlorpromazine (
43), haloperidol versus risperidone (
44), olanzapine versus divalproex (
45), olanzapine versus haloperidol (
46), aripiprazole versus haloperidol (
47), quetiapine versus lithium (
48), or quetiapine versus haloperidol (
49)]. Two exceptions to this pattern are noteworthy. Tohen et al. (
50) found olanzapine to have a small, but statistically significant, efficacy advantage over divalproex. This advantage was, however, at least partially offset by disadvantages in tolerability. Conversely, the comparison of aripiprazole and haloperidol reported by Vieta et al. (
47) showed no difference in efficacy, but a significant advantage for aripiprazole in overall effectiveness because of its greater tolerability.
3.
NNT analyses show that it is necessary to treat three to six patients with an antimanic agent to yield one additional responsive patient. The desire to compare results across studies by computing effect size is understandable, but making comparisons of the NNT across studies is of questionable validity. An NNT analysis does correct results for placebo response but does not overcome the methodological limitations that prevent drawing conclusions based on comparisons of treatments not available within the same randomized study. Comparing outcomes across placebo-controlled monotherapy mania studies is confounded by differences in study samples as well as study procedures. For instance, the antimanic efficacy of risperidone appears twice as robust in a study reported from a sample accessioned in India (
51) than in a sample of study sites in the United States (
52) that used nearly the same treatment protocol.
4.
Category A studies suggest that addition of a dopamine-blocking antimanic agent confers about the same increment of extra benefit over placebo whether it is administered as an adjunct to valproate or lithium or used as a monotherapy (
53,
54). Valproate was also superior to placebo as an adjunct to antipsychotic treatment (
55). The data are as yet insufficient to prove that two agents are superior to monotherapy, because the advantage of adding a second active agent has only been demonstrated in patient samples that included patients with inadequate response to prior treatment. Nonetheless, combination treatment is a reasonable approach for more severely ill patients because the preponderance of evidence from these studies shows that dropout rates are lower among subjects receiving two active treatments than among those receiving placebo and one active treatment (
56,
57).
In addition, placebo-controlled adjunct studies have established the efficacy of adding valproate to dopamine-blocking agents and the efficacy of adding risperidone, haloperidol, olanzapine, or quetiapine to the non-dopamine-blocking agents lithium or valproate. Category A placebo-controlled clinical trials comparing gabapentin, lamotrigine, topiramate, and oxcarbazepine with placebo have to date produced only negative results or failed studies.
Bipolar depression
Meta-analysis supports the adjunctive use of standard antidepressants as treatment for bipolar depression (
58); however, no standard antidepressants have shown efficacy in a category A study. Altshuler et al. (
59) found that only about 15% of patients with bipolar depression for whom an anti-depressant was prescribed met the criteria for treatment response. Category A evidence supports the use of lamotrigine (
60), olanzapine (
61), quetiapine (
62), and the combination of olanzapine and fluoxetine as treatment for acute bipolar depression (
61).
Maintenance treatment
Category A studies establishing the efficacy of lithium, lamotrigine (
63–
65), olanzapine (
66–
68), and aripiprazole (
69)for preventing recurrence of acute episodes all included randomly assigned patients who had experienced a remission of acute-phase symptoms during treatment with the study medication before randomization. This methodological issue has important clinical implications. The data from these successful maintenance studies cannot support the practice of switching from acute-phase treatments to a new maintenance treatment after resolution of an acute episode. Instead the data support continued treatment with these agents if they were part of a successful acute-phase regimen.
Questions and controversy
Controversy about bipolar disorders stems from the obscure nature of bipolar disorder itself and the unmet needs of many patients. Whether bipolar disorders are defined too narrowly or too broadly will only become clear when the pathophysiology processes underlying bipolar disorders are known. Research studies focused on the episodes of depression or mood elevation are of limited utility in helping us to grasp the nature of this lifelong multidimensional illness. Understanding the nature of heart disease led to treatments that prevent rather than treat myocardial infarction. Similarly, understanding the causes of bipolar disorder will lead to development of interventions that ameliorate the disease process itself rather than simply offer treatment for the abnormal mood episodes. It is possible that interventions that target signal transduction pathways, reduce stress, stabilize circadian rhythms, manage negative expressed emotion, or improve resilience will prevent or alter the course of bipolar disorder.
Treatment of bipolar depression remains the area of greatest unmet need, and the role of standard antidepressants may be the most controversial issue for clinicians treating bipolar disorder. The available evidence offers little support for the use of standard antidepressants, and it does not offer strong reasons to believe that standard antidepressants are generally harmful. Currently, venlafaxine is the only standard antidepressant for which a statistically significant increase in the risk of treatment-emergent affective switch has been demonstrated in a high-quality double-blind study (
70). It seems likely that questions about standard antidepressant treatment will persist until robustly effective treatments for bipolar depression become available.