Bipolar disorder is a highly heritable disorder that is characterized by recurrent episodes of mania and depression. The core features of mania are heightened incentive motivation and compulsiveness toward positive cues despite negative consequences, whereas lack of interest in positive events is characteristic for depression. These symptoms can be seen as an expression of motivational dysregulation underlying bipolar disorder.
Motivation is defined as the process of initiating, controlling, and maintaining behavior with the goal of maximizing pleasant outcomes (
1). Importantly, motivational regulation depends on intact reversal learning consisting of two processes: learning the stimulus-reinforcement contingencies and adapting behavior according to the changing stimulus-reinforcement contingencies. On a neural level, reversal learning is mediated by the medial orbitofrontal cortex associated with the representation of reward and the lateral orbitofrontal cortex known to suppress previously rewarded responses (
2) and to code the representation of punishment (
3). Furthermore, brain structures associated with the processing of positive and negative feedback, such as the amygdala, the anterior cingulate cortex, and the ventral striatum, are crucially involved in reversal learning (
4).
Increased activation of the orbitofrontal cortex has been repeatedly observed in manic, depressed, and euthymic patients with bipolar disorder during tasks examining both emotional and cognitive processing (
5,
6). Reversal learning studies have shown that manic, depressed, and euthymic patients with bipolar disorder commit more errors when reward contingencies change (
7–
11). Thus, increased orbitofrontal cortex activation and impaired reversal learning occur in bipolar disorder independent of the current mood state.
As intact reversal learning was shown to depend on the orbitofrontal cortex in healthy persons (
2), it is plausible that altered orbitofrontal activation represents the neural correlate of impaired reversal learning in bipolar disorder, but findings from neuroimaging studies in bipolar patients to date provide no clear evidence for this assumption. In euthymic bipolar patients, the number of errors during a behavioral reversal learning task was negatively correlated with activity in the orbitofrontal cortex and the striatum that was recorded in a separate word production task (
12). In contrast, no differences in neural activation were observed in a small sample of depressed patients with bipolar disorder when a reversal learning task was conducted during functional MRI (fMRI) (
13). Furthermore, impaired behavioral adaptation in response to changing reward contingencies as well as increased activation in the parietal and frontal but not the orbitofrontal brain regions during reversal learning has been observed in pediatric bipolar patients (
14).
In addition to being plausibly linked to the clinical symptoms of bipolar disorder, motivational dysregulation has been proposed as a potential endophenotype of bipolar disorder (
15). Endophenotypes were initially conceptualized to facilitate genetic analyses through the identification of simpler phenotypes that show higher penetrance (
16), but one of their most attractive aspects is that they nominate biological systems that contribute to the genetic risk architecture of mental illnesses, advancing pathophysiological understanding and potentially offering new therapeutic targets (
17). However, not only must endophenotypes be associated with a disease, be heritable, be cosegregated with the disease within families, and be state independent, but they should also occur in unaffected relatives at a higher rate than in the general population (
16).
In this study, we investigated whether the neural correlates of motivational dysregulation, assessed with a probabilistic reversal learning task (
18) during event-related fMRI in euthymic patients with bipolar I disorder and their unaffected first-degree relatives, could be an endophenotype of bipolar I disorder. We hypothesized that both bipolar patients and their relatives would show increased activation in the orbitofrontal cortex, especially during reversal of reward contingencies, and show altered activation in brain regions associated with feedback processing (the anterior cingulate cortex, the amygdala, and the striatum).
Method
Participants
We invited patients with bipolar I disorder who had already participated in epidemiological studies at the Central Institute of Mental Health in Manheim, Germany, or who had frequented local support groups and unaffected first-degree relatives of patients with bipolar I disorder to participate in the study. To recruit healthy comparison subjects' matching patients and relatives, we drew a large random sample from the registry office of the city of Mannheim and contacted these persons by mail. All persons interested in participating were screened by telephone. Exclusion criteria for all participants were age under 18, lifetime alcohol or drug abuse or dependence, history of a neurological disorder or head trauma with unconsciousness, common MRI exclusion criteria, and lack of fluency in German. Relatives and comparison subjects were also excluded if they fulfilled the criteria for any lifetime or current DSM-IV axis I mental disorder (
19) or took any psychotropic medication. Additional exclusion criteria for bipolar patients were having a diagnosis of rapid cycling or schizoaffective disorder and taking an antipsychotic medication known to influence reward processing (
20). After the screening, 22 patients with bipolar I disorder, 22 relatives, and 43 healthy comparison subjects without a family history of psychotic or affective disorders were eligible for the study (see Figure 1 in the data supplement that accompanies the online edition of this article).
Sample 1.
From the original sample, three bipolar patients and two healthy comparison subjects had to be excluded from data analysis because they exceeded the minimum movement thresholds during their fMRI scans (2 mm or 2°) or showed abnormal brain morphology. In the final sample of 19 bipolar patients and 19 comparison subjects, the two groups did not significantly differ in gender, age, years of education, marital status, current employment, intelligence, Hamilton Depression Rating Scale (HAM-D) score (
21), or consumption of caffeine, nicotine, and alcohol. Although the patients were euthymic, they scored significantly higher on subclinical scales of the Young Mania Rating Scale (
22) and the Beck Depression Inventory (BDI;
23). Two patients were diagnosed with lifetime but currently remitted panic disorder. Demographic and clinical characteristics are summarized in
Table 1.
Sample 2.
The relatives (N=22) did not differ significantly from comparison subjects (N=22) in gender, age, years of education, marital status, current employment, intelligence, or consumption of caffeine, nicotine, and alcohol. Current symptoms of depression and mania were comparable between groups (
Table 1). Nine relatives were siblings, and 13 were children of patients with bipolar I disorder. Eleven relatives were from simplex families (one case in the family), and 11 were from multiplex families (two or more cases in the family).
Procedure
The Structured Clinical Interview for DSM-IV Axis I Disorders (
25) was administered to all study participants and index cases of relatives to verify main diagnosis (for patients) and exclusion criteria (for relatives and comparison subjects). Residual mood symptoms were assessed with the Young Mania Rating Scale, the HAM-D, and the BDI. The cutoffs for euthymia were scores ≤4 on the Young Mania Rating Scale, ≤5 on the HAM-D, and ≤7 on the BDI. We ensured that patients had been euthymic for at least 2 months before testing.
We constructed a life chart with all of the patients to assess number of past depressive and manic episodes, age at illness onset, number of hospitalizations, age at first hospitalization, and time in remission. We also verified that the medication status of all patients had been stable during the past 6 months. We coded the dosage of each antidepressant and mood stabilizer (
25) and calculated the composite measure of total psychotropic medication load reflecting both dosage and variety of different medications taken. Additionally, we collected data on caffeine, nicotine, and alcohol consumption and estimated the IQ of all participants using the Culture Fair Intelligence Test (
26).
The ethics committee at the University of Heidelberg in Germany approved the study, and all participants gave written informed consent.
Reliability
For a subset of the sample (N=21), two clinicians (J.L., A.V.K.) evaluated participants with respect to current symptoms and diagnoses of mental disorders to determine interrater reliability. We had an interrater agreement of 100% for the main diagnosis of bipolar I disorder. For current symptoms, interrater reliability assessed by intraclass correlation was 0.87 for the HAM-D and 0.91 for the Young Mania Rating Scale.
Reversal Learning Task
The probabilistic reversal learning task has been described in detail elsewhere (
4,
18). Participants had to learn which of two playing cards was the “better” one based on positive and negative feedback in terms of winning or losing a small amount of money (€0.10–€1.00). Choice of the better card was followed by either positive or negative probabilistic feedback with a ratio of 8:2, whereas the choice of the other card always led to negative feedback. After five to eight consecutive choices of the better card, the other card became the better one (rule reversal) and participants had to adapt their choice of card in order to maximize their gain. For fMRI analyses, we focused on three events: 1) positive feedback (“win”), 2) negative feedback not leading to a change in the choice of card (“lose/no shift”), and 3) negative feedback leading to a change in the choice of card (“lose/shift”). In contrast to other studies (
5), we did not differentiate whether lose/shift and lose/no shift events occurred after a real rule reversal or after a probabilistic negative feedback following the correct choice of the better card, that is, because we believe that the same psychological process—adaptation of behavior in accordance with subjectively apparent changes in the environment—underlies both types of events as participants were blind to the experimental schedule.
In addition to these experimental trials, baseline trials were randomly interspersed. During baseline events, participants knew in advance which card to choose and received only a neutral feedback (“choice made”). The task was conducted in three separate functional runs with an average duration of 12 minutes. Details on timing parameters of the experiment, the types of trials, and statistical contrast are depicted in
Figure 1.
Functional Image Acquisition
All MRI sequences were performed on a 3-T whole body scanner (Magnetom Trio, Siemens Medical Solutions, Erlangen, Germany). We conducted one high-resolution T1-weighted three-dimensional MRI sequence (slice thickness=1.1 mm, field of view=256×240×176 mm3, matrix=256×240×160). For fMRI, we acquired 40 gradient-echo T2*-weighted slices (slice thickness=2.3 mm) per volume using a generalized autocalibrating partially parallel acquisition technique (acceleration factor 2) and the following parameters: TR=2,700 msec, flip angle=90°, TE=27 msec, field of view=220 mm2, matrix=96×96, slice gap=0.7 mm.
Data Analysis
Demographic characteristics, clinical data, and behavioral data were analyzed using Predictive Analytics SoftWare, version 18.0.0 (PASW Statistics 18) (
27). For the reversal learning task, the following behavioral outcome variables were calculated: average number and reaction times of the different events, number of scans, and average amount of money won. We used Student's t test (between patients and comparison subjects and between relatives and comparison subjects) and analysis of covariance (ANCOVA) with age as covariate (patients compared with relatives) to determine group differences.
Imaging data were preprocessed and analyzed according to standard procedures using SPM5 (
http://www.fil.ion.ucl.ac.uk/spm). Preprocessing involved realignment to the first scan of each run, slice-timing, coregistration, and normalization into Talairach and Tournoux stereotaxic space using Montreal Neurological Institute templates (
28). During normalization, the images were resampled every 3 mm using sinc interpolation and smoothed with a 9×9×9 mm Gaussian kernel to decrease spatial noise. Statistical analyses were carried out in the context of the general linear model. Blood-oxygen-level-dependent (BOLD) signal changes were modeled to the onset of the feedback presentation, and movement parameters calculated during realignment were included as parameters of no interest.
We computed the following contrasts: wins minus baseline to assess the main effect of reward, “lose/no shift” minus baseline to assess the main effect of punishment, and “lose/shift” minus baseline to assess rule reversal. The contrast of wins, lose/no shift events, and lose/shift events against the baseline was important, as patients and relatives might differ from comparison subjects in processing positive and negative feedback (win and lose/no shift events) and reversal learning (lose/shift events) (
2). Thus, contrasting against baseline was the only way to adequately assess the main effects of reward, punishment, and rule reversal.
For each contrast, the individual statistical parametric maps were entered in a second-level random-effects analysis to test for activation differences between bipolar patients and comparison subjects as well as between relatives and comparison subjects. We used an ANCOVA with money won, mania, and depression scores as nuisance variables to account for group differences between bipolar patients and comparison subjects in these variables. For comparison between relatives and comparison subjects, we used an ANCOVA with money won as the nuisance variable. We also compared bipolar patients and relatives using an ANCOVA with age as the covariate.
We used a region-of-interest approach (
2,
4,
18) to test our hypotheses, restricting analyses to five bilateral regions of interest: the medial and lateral orbitofrontal cortex, the amygdala, the anterior cingulate cortex, and the striatum (including the nucleus caudate, the nucleus accumbens, and the putamen), derived from the Wake Forest University PickAtlas v2.0 (
29). Hypotheses of altered activation in the five regions of interest were tested with a threshold of p<0.05, accounting for multiple comparisons by controlling for family-wise error rates. Additionally, we applied the Holm-Bonferroni method, a sequentially rejective version of the simple Bonferroni correction, which adjusts the p values that were already corrected for family-wise error rates within each region of interest according to the total number of regions of interest used in the analyses. This method resulted in increasing thresholds of p<0.010, p<0.013, p<0.017, p<0.025, and p<0.050 for the first to the fifth regions of interest in order to avoid false positive findings. For interpreting group differences, we extracted percent signal change for each event from a 3-mm
3 sphere centered on the peak voxel of the cluster showing significant group differences with MarsBaR (
30). To examine potential confounding effects of medication load, the mean contrast values were imported in PASW Statistics 18 and correlated with total medication load. We report only correlations that passed the threshold of p<0.006 (corresponding to p=0.05 after a Bonferroni correction for eight comparisons corresponding to the number of regions of interest, where group differences were observed in either sample).
Results
Behavioral Data
We did not observe significant differences in reaction times and frequency of any type of event during the reversal learning task between bipolar patients and comparison subjects, between relatives and comparison subjects, or between bipolar patients and relatives. However, relatives won significantly smaller amounts of money than comparison subjects, and bipolar patients tended to win less money than comparison subjects (
Table 2).
Functional Neuroimaging Data
Bipolar patients versus comparison subjects.
Relative to comparison subjects, euthymic bipolar patients showed significantly higher activation of the medial orbitofrontal cortex during reward and rule reversal but not during punishment. In addition, rule reversal was associated with higher activation of the right lateral orbitofrontal cortex, the right amygdala, the dorsal anterior cingulate cortex, and the putamen in euthymic bipolar patients relative to comparison subjects (see
Table 3 and
Figure 2). We observed a significant negative correlation between medication load and mean activation of the right amygdala in response to reward in patients with bipolar disorder (r=–0.46, p=0.002; see Figure 2 in the online data supplement). No other statistically significant correlations between medication load and BOLD response changes were observed.
Relatives versus comparison subjects.
The relatives of the bipolar I patients showed significantly higher activation of the medial orbitofrontal cortex in response to reward, punishment, and rule reversal relative to comparison subjects. Additionally, we observed significantly higher activation in the right amygdala in response to reward and rule reversal in the relatives compared with healthy comparison subjects (see
Table 3 and
Figure 2). Relatives who had passed the peak onset age (
31) did not differ significantly from the younger ones, and we observed no differences between relatives stemming from multiplex or simplex families.
Bipolar patients versus relatives.
In response to reward, we observed increased activation in the right amygdala in relatives compared with bipolar patients, whereas rule reversal was related to higher activation of the right lateral orbitofrontal cortex and the putamen in patients compared with relatives (
Table 3).
Discussion
To our knowledge, this is the first study to show dysfunctional activation in neural systems related to motivation and reward in euthymic bipolar I patients and unaffected first-degree relatives of bipolar I patients, and it identifies a potential neural intermediate phenotype of bipolar disorder. Specifically, increased activity of the medial orbitofrontal cortex in response to reward and rule reversal was found in both euthymic bipolar patients and their first-degree relatives compared with healthy comparison subjects. In addition, we observed increased activation in the right amygdala during rule reversal in both euthymic bipolar patients and their relatives compared with healthy comparison subjects. Such similar findings in medicated patients and unmedicated high-risk persons are of particular importance, because they suggest that the results are unlikely to be affected by medication. Interestingly, our data also suggest that increased amygdala activation in response to reward, which was observed in relatives only, may have been suppressed by psychotropic medication.
Previous studies suggest that the medial orbitofrontal cortex codes the expectation of reinforcement, which is acquired via input from the amygdala and is used to guide behavior (
32). In the present study, the amygdala and the medial orbitofrontal cortex showed greater neural activation in response to reward and decreased neural activation in response to negative feedback in all groups. It has been suggested that reduced activity of the amygdala and the medial orbitofrontal cortex in response to unexpected negative feedback occurring during rule reversal represents a prediction error signaling that a change in behavior is required (
3). Thus, we propose that heightened activation of the medial orbitofrontal cortex and the amygdala during wins in bipolar patients and unaffected relatives probably represents heightened sensitivity toward reward, whereas greater activation of the amygdala and reduced deactivation of the medial orbitofrontal cortex during rule reversal in bipolar patients and their relatives represents an attenuated prediction error signal. This attenuated prediction error signal, indicated by increased medial orbitofrontal cortex activation, was particularly pronounced in unaffected relatives during negative feedback that was not followed by a behavioral change (“lose/no shift” events). This result suggests clinical manifestations in manic bipolar patients, who continue to pursue immediate rewards despite negative consequences (
19).
Similar activation patterns were observed in patients and relatives, although some relatives had a lower risk of developing the disorder than others as they had already passed the peak age of illness onset or came from simplex rather than multiplex families. However, additional analyses revealed no differences between these subgroups. This raises the question of how the observed abnormalities might contribute to an increased vulnerability for bipolar disorder. The behavioral activation system dysregulation theory (
33) suggests that a hypersensitive behavioral activation system, which regulates approach motivation and goal-directed behavior and is dependent on the orbitofrontal cortex among other regions (
34), mediates vulnerability for bipolar disorder. This system is triggered by reward or obstructed reward, eventually leading to manic or depressive episodes. Indeed, exploratory analyses revealed positive correlations between the score on the behavioral activation system scale (
35) and neural activation in the medial orbitofrontal cortex in response to reward (r=0.36, p=0.012) and rule reversal (r=0.37, p=0.008) in bipolar patients and relatives (see Table 1 in the online data supplement).
Apart from the reported similar activation patterns in response to reward and rule reversal in bipolar patients and relatives, bipolar patients exhibited increased activity in the ventral putamen and the lateral orbitofrontal cortex during rule reversal in contrast to comparison subjects and relatives. We speculate that these neural abnormalities do not represent vulnerability but a state marker. In this context, increased activity in the lateral orbitofrontal cortex signaling punishment could represent a compensatory mechanism that aids in suppressing previously rewarded responses (
2) and potentially enables adequate performance during euthymia.
Our results are in line with studies showing impaired reversal learning on a behavioral level in bipolar patients (
7–
11), altered functioning of the amygdala and orbitofrontal cortex during emotional processing in bipolar patients (
36–
37), and the behavioral activation system dysregulation model for bipolar disorder (
33), suggesting a hypersensitive behavioral activation system as a vulnerability in bipolar disorder. Moreover, our findings extend those of previous studies by identifying the neural correlates of altered reversal learning in bipolar disorder and showing that heightened emotional reactivity in bipolar disorder is not limited to primary emotional cues but also occurs for positive and negative feedback, leading to emotional and motivational dysregulation. Most importantly, the results from patients and their relatives suggest that increased activation of the amygdala and the medial orbitofrontal cortex constitutes an endophenotype of bipolar disorder. These results are further supported by another study by our group (
38) that showed increased amygdala activation in response to reward in carriers of the risk allele of
CACNA1C rs1006737, a genome-wide supported risk variant for bipolar disorder. It is worth noting that in schizophrenia, neural responses to reward are variable (
39–
40) and are related to disease state (
39). Therefore, reward-related processing could represent a neural “point of rarity” distinguishing between these two genetically related yet clinically different disorders.
This study of motivation-related circuits as a candidate endophenotype was limited by sample sizes and sample characteristics that preclude generalizability to other ethnicities and countries. Moreover, it is possible that our use of tight exclusion criteria, designed to minimize confounds inherent in bipolar research (e.g., substance abuse, medication, and residual mood symptoms), may limit the generalizability of our findings. Further work to evaluate this possibility is needed.
Conclusion
Our results identify increased activation of the medial orbitofrontal cortex and the amygdala that was related to heightened reward sensitivity and deficient prediction error signal as a candidate brain functional endophenotype of bipolar disorder. In sum, our results support a role of motivational processing in the risk architecture of bipolar disorder and identify a new systems-level therapeutic target for the illness.
Acknowledgments
The authors thank Manuela Glasbrenner, Katja Nitsche, Christiane Soennekes, Kristina Schaffer, Claudia Stief, and Birgül Sarun for their assistance in data acquisition.