Multiple sclerosis (MS) is a neuroinflammatory disease that causes sensory and motor disturbances, diplopia, ataxia, bladder disturbance, fatigue, and cognitive dysfunction.
1,2 Among these symptoms, cognitive impairments produced by MS, including impaired processing speed, attention, memory, and executive function, may adversely affect patients’ daily lives.
3A characteristic feature of MS is that more women are affected than men. In addition to prevalence, gender differences have been addressed in MS with respect to genetic susceptibility, clinical presentation, effects of sex hormones, immune system, and response to therapy.
2 Previous studies have proposed several links between gender effects and neuroimaging findings, such as cortical atrophy, functional connectivity, and white matter changes.
4–7 However, despite the clinical importance of cognition for overall well-being in the MS population, research on the direct association between gender and cognitive dysfunction has not been widely reported in MS.
Schoonheim et al.
7 demonstrated that male subjects with MS performed more poorly than male controls in several cognitive domains, such as executive functioning, verbal memory, processing speed, working memory, attention, and psychomotor speed, yet these domains were relatively preserved in female subjects. They also observed a stronger correlation between subcortical volume and a cognitive marker (the average Z score of a battery of cognitive tests) in males with MS. In addition, significant white matter changes, such as lower fractional anisotropy, mean diffusivity, axial diffusivity, and radial diffusivity, in diffusion tensor imaging (DTI) data have been observed in male subjects with lower cognitive Z scores but not in female subjects whose cognition was relatively intact.
6 In a functional magnetic resonance imaging (fMRI) study, male subjects demonstrated decreased resting-state functional connectivity, as well as lower network efficiency, in association with deteriorating cognitive performance,
4 especially reduced visuospatial memory. While these studies focused on the relationship between cognitive dysfunction and functional/structural changes in the brain, less emphasis has been on the characterization of cognitive differences and gender effects in MS.
Several factors appear to be predictors of cognitive dysfunction in MS: early age at onset, male sex, secondary progressive course, neurodegeneration, and low baseline intelligence.
8 Males are more likely to develop a severe disease course, including physical disability and cognitive impairments. In MS patients, it appears that cognitive dysfunction in females is less dependent on factors such as physical disability, while cognitive decline in males is correlated with Expanded Disability Status Scale (EDSS) score, age, disease duration, and education level.
5 Another study found that female subjects performed better on the Mini-Mental State Examination, the Wisconsin Card Sorting Test (WCST), language assessment, and memory tests in the Repeatable Battery for the Assessment of Neuropsychological Status Update.
9 Thus, males appear to be particularly cognitively vulnerable in MS.
The “clinical” assessment of cognitive function of individuals with MS must consider several factors, including but not limited to determining the priorities regarding which cognitive domains need to be assessed and possible physical limitations (i.e., motor difficulties in manipulating a pencil, vision problems such as oscillopsia), as well as the individual’s vulnerability to fatigue. The field of neuropsychology has addressed these issues with different testing approaches. Rao et al.
10 created a 20-minute screening battery with a 71% sensitivity value and a 94% specificity value in its ability to identify cognitively impaired MS patients. Benedict et al.
11 developed the Minimal Assessment of Cognitive Function in MS (MACFIMS) test battery, which distinguishes relapsing-remitting MS from secondary progressive MS. Verbal memory and executive function were most predictive of vocational status. This 90-minute battery includes an attentional working memory task (PASAT), a measure of processing speed (symbol digit modalities test), verbal and visual memory (CVLT II and BVMT-R), an executive sorting task (D-KEFS), a visual spatial task (JOL), and verbal fluency (COWAT). 3T MRI has shown that the MACFIMS subtests correlate with the number of cortical lesions, cortical volume, white matter lesions, and white matter volume.
12 The Brief International Cognitive Assessment for MS (BICAMS) is another assessment tool for examining cognition in MS that has an international validation, has guidelines for defining significant change from repeated assessments, and is endorsed by the American Academy of Neurology as a cognitive measure for MS.
13 This battery is completed in 15 minutes and includes the domains of information processing speed and verbal and visual memory.
In the present study, eight neuropsychological tests that employed executive function were administered to an MS cohort (13 males, 33 females). Since these tests all probe aspects of executive function, it is likely that performance on these individual tests will be related to (correlated with) one another, yet each test will provide some unique information. We aimed to determine whether there was an established overall pattern of neuropsychological deficits seen in MS and if this pattern of deficits could be predicted based on disease severity and other demographic factors, including gender. Our implicit hypothesis is that a combination of gender, age, education, and clinical indices will correspond with a combination of results seen in neuropsychological tests. We employed canonical-correlation analysis (CCA), a type of machine learning method used to identify patterns in large data sets. We show that a combination of demographic (i.e., gender, age, education, alone or in combination) and clinical (i.e., disease duration, severity of disability, affective states) indices accurately predict results of frontal lobe testing. When we investigated the specific combination of demographic factors that predicted neuropsychological test performance in MS, we found that gender had the greatest influence.
The results reported in the current study are from a larger study in which traditional tasks of executive functioning were administered as the test battery. The testing was not for clinical assessment purposes. Our original study solely assessed executive functioning and was not aimed at examining other cognitive domains such as memory or tasks found in MACFIMS or BICAMS. The test battery was selected for a specific research project that focused on executive functioning and the possible relationships between biomarkers in MS (i.e., myelin water, fMRI connectivity) and performance on neuropsychological tests that traditionally have been used to assess different aspects of executive function. The findings reported here represent a secondary analysis of the neuropsychological test results.
Materials and Methods
Participants
Forty-six relapsing-remitting MS subjects (13 males, 33 females) were enrolled in a study examining the relations between results of various MRI sequences (including DTI, myelin water imaging, lesion load, resting-state fMRI functional connectivity, and cortical thickness), cognitive performance, clinical measures, and demographic characteristics. For the purposes of the present study, we focused on clinical measures and demographic variables (
Table 1), as the imaging will be the subject of another study. All the subjects fulfilled the McDonald criteria for MS diagnosis and were recruited from the MS clinic at the UBC Hospital. Referred individuals underwent a telephone screen (BK), and persons with significant untreated depression and/or other psychiatric illness, learning disabilities, history of drug or alcohol abuse, or steroid use in the last 3 months or those who showed evidence of an active flare were excluded from this study. Participants were also screened for adequate motor skills for their ability to manipulate a pencil, which would have direct impact on some of the tasks (i.e., coding and symbol search).
Ethics approval was issued by the University of British Columbia's Research Ethics Board, and all subjects provided signed informed consent.
Test Battery
As mentioned above, data collected for the original, larger study were not based on clinical assessments or interest in other cognitive domains such as memory. Therefore, test batteries used in assessments of MS populations such as MACFIMS or BICAMS were not utilized. The focus of the larger study was on executive functioning and the traditional tests utilized to measure aspects of this cognitive domain.
All subjects underwent eight psychometric measures assessing processing speed, working memory, executive function, and attention. This battery was not selected for clinical purposes. Tests included were the Wechsler Adult Intelligence Scale-IV (WAIS IV) subtests (digit span, arithmetic, letter number sequencing, symbol search, and coding), Verbal Letter Fluency Test, WCST, and Trail-Making Test, Parts A and B (TMT A and B). Composite index scores from the WAIS-IV were also obtained, including Working Memory Index (WMI), which utilized scores from digit span, arithmetic, and letter number sequencing subtests. The WAIS-IV Processing Speed Index (PSI) is based on the symbol search and Coding subtests. Clinical questionnaires were also administrated to examine affective status, which included the Multiscore Depression Inventory (MDI) and State-Trait Anxiety Inventory (STAI). The tests are summarized in
Table 2. The other measure administered was a widely used fatigue measure in the MS field, the Fatigue Severity Scale (FSS). The duration of the assessment was approximately one hour. Raw cognitive scores were converted to standardized scores using published normative data that consider psychometric factors such as age, education, and gender. Disability rated by the Kurtzke EDSS was determined by a neurologist at the time of recruitment or scanning.
Statistical Analyses
Two-sample t tests (two-tailed) were carried out with all the scores among males and females. Transformed scores were calculated in R (version 3.2.0), a language and environment for statistical computing, if normality did not hold. CCA
14 was done in MATLAB (MathWorks) to determine whether demographic variables were related to cognitive scores. CCA was chosen to study complex multivariate data, and it can be considered one of the original “machine learning” algorithms that are now used to explore “big data” in many fields.
15–17 The advantage of the CCA method is that it is an extension of regression that enables additional factors to be included. If we have two sets of correlated variables (e.g., the performance on the neuropsychological tests will likely be correlated, and some demographic factors such as disease duration and age will likely be correlated), CCA attempts to identify the linear combinations of these two sets of variables (i.e., vectors) that maximally correlate with each other. It is noteworthy that this will likely be a more powerful approach than regression where one could try and predict the neuropsychological performance on one test (i.e., in regression there is only one dependent variable).
In keeping with prior neuropsychological approaches, we analyzed timed and untimed tests separately. We performed CCA on cognitive scores divided into two groups (timed and untimed tests) and demographical variables. The timed group included four affective variables (MDI total t scores, STAI standardized state scores and trait scores, and FSS raw scores) and eight variables from the tests in which subjects were timed during performance, including three scaled scores in WAIS-IV (arithmetic, symbol search, and coding), Verbal Letter Fluency Test, and four measures of TMT (transformed TMT A scores, TMT A Z scores, transformed TMT B scores, and TMT B Z scores). Due to nonnormality, raw scores of TMT A and B were transformed by taking the square root of the reciprocal of the raw scores. The transformed score could be expressed as follows:
In the untimed group, four affective variables (same as timed group) and eight cognitive variables from the tests in which subjects were not timed were included. The eight cognitive variables were two scaled scores on WAIS-IV (digit span and letter number sequencing) and six measures on WCST (standard score of errors, perseverative response, perseverative errors, raw scores of categories completed, trials to complete first category, and failure to maintain set). Age, gender, education, EDSS scores, and disease duration were included as demographic variables in both groups.
Calculations were performed in leave-one-out fashion to control for multiple comparisons and to ensure the robustness of our results. Specifically, we excluded one subject at a time and performed the CCA analysis each time. The variability in the weightings, when each of the subjects was removed, was recorded. This gives an estimate of how much the results could change if more subjects were added.
Results
In demographic and clinical measures, only education was significantly different between males and females (
Table 1). Female subjects had more years of education than male subjects (female/male: 15.3±2.2 and 13.7±2.7, uncorrected p=0.04, df=44). Among all the scores, the following tests showed significant differences between male and female subjects: WCST categories completed (female/male: 5.5±1.1 and 3.3±1.9, uncorrected p=1.97e-05, df=44), WCST trials to complete first category (female/male: 12.7±4.4 and 38.6±38.3, uncorrected p=3.32e-04, df=44), and TMT A (female/male: 30.7±12.3 and 40.3±13.3, uncorrected p=0.02, df=44) (
Table 1), as well as Transformed TMT A (female/male: 0.2±0.0 and 0.2±0.0, uncorrected p=0.02, df=44) and B (female/male: 0.1±0.0 and 0.1±0.0, uncorrected p=0.003, df=44) scores.
CCA results showed that gender (mean canonical coefficient: timed=1.01, untimed=0.94) appears to be an important factor (i.e., weightings can be distinguished from zero) in the combination of demographic variables for both timed and untimed tests. In the untimed group, EDSS (mean canonical coefficient: 0.26) was influential in addition to gender. Age, education, and disease duration showed limited effects on both timed and untimed tests (mean canonical coefficient: timed/untimed=0.07/0.00, 0.03/−0.05, -0.02/−0.03, respectively) (
Figures 1 and
2).
Since we assigned male gender as 1 and female gender as 0 in the calculation, the variables that showed positive canonical coefficient indicated higher scores in male subjects as long as gender also showed positive weightings. On the other hand, the tests on which female subjects obtained higher scores show negative canonical coefficients. This was because the model treated female gender (0) as “baseline.” The weightings of gender indicate the effects of male compared with female in combination of demographic variables, as well as the combination of cognitive scores. In the timed group, transformed TMT A and B scores showed strongly negative effects compared with other variables (mean canonical coefficient: –28.3 and –31.8, respectively) (
Figure 1). On the contrary, the TMT A Z score demonstrated a positive mean canonical coefficient with 0.64 in the combination of cognitive scores, illustrating that there was a gender-specific pattern in timed cognitive tests where female subjects had higher transformed TMT A and B scores and male subjects had higher TMT A Z scores. In addition, cognitive scores and demographic variables were highly correlated with each other (correlation r=0.84, p<0.001 [
Figure 1]), meaning that the linear combination of all the variables in demographics and all the variables in timed cognitive scores forms a significant model to explain and predict the data.
Figure 2 shows the results from the untimed group. Male subjects had higher scores (i.e., these scores showed higher weightings among male subjects) on WCST errors, WCST perseverative errors, trials to complete first category, trait anxiety, and fatigue (mean canonical coefficient: 0.03, 0.14, 0.01, 0.02, and 0.01, respectively), while female subjects obtained higher scores (i.e., these scores were more influential on the linear model of all scores among female subjects) on WAIS-IV digit span, WAIS-IV letter number sequencing, WCST perseverative responses, WCST categories completed, failure to maintain set, and state anxiety (mean canonical coefficient: –0.03, –0.04, –0.14, –0.29, –0.06, and –0.01, respectively). This implied that female subjects completed more categories on the WCST than males, but higher perseverative responses also indicated that females were potentially more prone to cognitive inflexibility as they changed strategies less frequently. Moreover, females also performed better on the digit span and letter-numbering sequencing tasks, meaning that they had better attentional abilities, since the scores of these two tests partially form WMI. Finally, the two sets of variables (combination of cognitive scores and combination of demographic characteristics) were highly correlated with each other in untimed group (correlation r=0.84, p<0.001 [
Figure 2]), indicating that the linear combination of all the variables in demographics and untimed cognitive scores forms a model explaining the data.
Discussion
With the CCA method, we have reported that 1) there were gender-specific cognitive patterns in MS and 2) some demographical variables had stronger effects on cognitive performances. More importantly, utilizing a focused test battery of assessments sensitive to aspects of executive functioning, there were gender differences. The findings from the present study endorse the need for sensitivity to include both untimed and timed tasks in the cognitive assessment of individuals with remitting and relapsing MS.
Gender is an Influential Factor in Cognition Among MS Subjects
Gender differences have been reported in previous studies showing that females perform better than males on memory tasks and the WCST.
4,9 Moreover, male sex has been speculated as a risk factor for development of severe cognitive decline.
8 These studies investigated cognitive function based on the responses on cognitive tests among males and females separately. Given that cognition is a complex multidimensional entity that can be assessed within different domains,
18 it is reasonable to assume that there are multiple intercorrelated variables modulating cognition. Therefore, we investigated the
intercorrelation from combinations of demographic and cognitive variables through CCA, determining which linear combination of demographic and cognitive scores best represent the data among the two genders as a whole. CCA is an approach for exploring relations between two multivariate sets of variables and evaluating which linear combinations of variables can best explain the variability. In this study, the cognitive/affective scores and demographical indices were the two multivariate sets of variables in both groups. Since we did not normalize variables in our analysis, we could only interpret high weightings as “important factors” (i.e., significantly different from zero) rather than “the most important factors,” as they were not on the same scale. Several cognitive scores have been normalized based on normal population; therefore, it would not be reasonable to normalize all the scores again.
As shown in
Figures 1 and
2,
gender was a strong demographic factor in timed tests, and both gender and EDSS were influential in untimed tests, as the remaining variables had limited effects (weightings <0.1) (
Figures 1 and
2). Demographical factors were taken into account in our model, which highlighted the fact that both EDSS and gender were influential factors for cognition in our data. Our results were in agreement with previous research showing that gender was a predictor for cognitive dysfunction in MS and male subjects generally showed worse performance.
7,8Education and affective states can directly affect cognition.
19 However, in this study, education did not show effects on cognitive profile even though female subjects had higher education. Moreover, none of the affective variables were significantly different between the two genders, and all of them had limited weightings in CCA results. Although there were trends that the females in this study were more educated, it was not a significant difference (see
Table 1). Therefore, we concluded that education and affective status did not significantly influence cognition in our cohort.
Distinct Cognitive Patterns in Male and Female Subjects With Remitting and Relapsing MS
Instead of evaluating cognition by individual tests, we assessed the cognitive profile in MS by including more than one variable in our calculation. Due to the complexity of human cognition, different cognitive performances have relationships with demographic variables. Therefore, we included all the cognitive and demographical variables in our model and separated cognitive scores into two groups: timed and untimed groups.
Because
transformed TMT scores were the square roots of
reciprocal TMT raw scores (which had opposite meanings to
raw TMT scores), our interpretation of high weightings in TMT scores (
Figure 1) is that female subjects performed better on both TMT A and B (i.e., faster responses). We conclude that set-shifting abilities assessed by TMT A and B were less impaired in female MS subjects than in males. Finally, the high correlation between the combination of demographic characteristics and combination of timed cognitive scores illustrated that these linear combinations were significant and robust enough to explain the data.
Figure 2 demonstrates that gender had strong weightings in the combination of demographic variables and untimed scores. Among all the untimed scores, male subjects obtained high weightings on WCST perseverative errors and errors, illustrating poor performance on the WCST. In contrast, females tended to perform better on the WCST because they had higher scores on WCST categories completed. However, they also had higher scores on WCST perseverative responses and failure to maintain set, which possibly indicates an inability to use feedback to modify their response behavior. It is interesting to know that perseverative behavior, whether errors or responses, are found among both genders. This is more indicative of the neuropathology (i.e., white matter damages reduce neuronal communication) seen in MS. In addition, females obtained higher scores on WAIS-IV digit span and letter-number sequencing tests, indicating that, compared with male subjects, their basic ability to sustain attention was less affected.
Finally, the untimed group also demonstrated high correlations between the combination of cognitive scores and combination of demographic variables, and again, gender and a subset of WCST scores were the strong factors that explained cognitive patterns.
Separating Timed/Untimed Tests is More Sensitive to Detect Gender Differences in MS
Gender differences on cognitive tasks have been long established.
20 Separating the timed from nontimed tasks is an important, growing trend in the analysis of neuropsychological test data in MS.
21We also analyzed timed and untimed scores together, but the model was not significant. The current results indicate that analyzing timed and untimed scores separately is more effective in distinguishing cognitive profiles between male and female MS subjects.
Limitations
There were only 13 male subjects in our study, and only one of them had a high EDSS score, which raises the issue of sample size and that the tested population may not have been sufficiently representative of the male MS population. Although we obtained a fairly sufficient amount of data for each individual, the total number of subjects was not as large as that of other studies that focused on behavioral data. This was because the original design of the study was to investigate the association between behavioral data (clinical indices and neuropsychological scores) and neuroimaging findings such as functional connectivity and structural integrity. This cohort was relatively large for imaging studies but small for analyses of neuropsychological variables. In our cohort, females had more education then males (approximately 2 years), but the range of years of education was very similar for both males and females. It is quite possible that this education advantage may be contributing to cognitive reserve and better performance on psychometric tests. This will need to be replicated in future studies. In addition, this study only examined the relations between demographic variables and multiple measures of executive function. The same approach should be implemented to study how demographics are related to other neuropsychological functions. Finally, since our aim was to investigate the “relationships” and “combinations” between variables in MS, we did not include healthy subjects in the analyses.
Executive Functioning and Localization
As mentioned previously, our original investigation was to examine executive functioning and connectivity in patients with remitting and relapsing MS. The test battery for this study was based on widely used tasks in assessing executive functioning.
In neuropsychology, our understanding of brain functioning and localization has evolved over time and has been challenged with the development of imaging techniques such as fMRI. More importantly, skilled clinicians have become astutely aware that tasks such as the WCST may not be localized specifically to certain brain regions as originally proposed, for example, to the dorsolateral prefrontal cortex or to the left hemisphere.
22 However, there are many variables contributing to the diversity of findings, including what WCST administration technique was used, were the norms based on normal controls, neurological patients (i.e., patients with epilepsy) versus psychiatric populations. Again, a skilled clinician has awareness of these findings but can still utilize the task to comment on impairment on executive skills such as concept formation, set switching, maintaining set, and ability to utilize feedback to change behavior, as well as providing evidence for perseveration.
Conclusions
Taken together, we conclude that gender is one of the influential factors on cognitive performances in subjects with MS. This study utilized cognitive tasks specifically involving frontal connectivity. Moreover, there are specific cognitive patterns in MS subjects. First, TMT A and B tests have the strong weightings that distinguish from zero in female subjects, meaning that female subjects performed TMT tests better than males. Second, in untimed tests, male subjects made more errors and female subjects performed better on the WCST. Finally, our results imply that with this particular test battery, female subjects with MS were less cognitively impaired than male subjects. Our study is unique in that we have discovered gender differences exist on selected executive tasks and report a trend that gender differences may also exist on timed tasks versus untimed tasks, which has become an important area of focus in the cognitive assessment of individuals with remitting and relapsing MS. These findings need to be replicated but promote the importance of selecting tests and test batteries that are sensitive to these issues.
To our knowledge, no prior research has examined the relation between cognition and demographic variables using CCA in MS. With this approach, we were able to reveal gender-specific cognitive patterns in MS. These findings have implications for not only the assessment of cognitive functioning in MS, but also our understanding of sex differences in cognition in the presence of chronic illnesses.