Among U.S. adults, pain is a highly prevalent condition (
1) that is often treated with prescription opioids. Between the 1990s and 2011, the number of U.S. opioid prescriptions nearly tripled (
2). While the number has since declined, it remains very high (>142 million in 2020 [
3]), making adverse consequences of prescribed opioids a serious public health concern. Fatal prescription opioid overdoses, the most serious adverse consequence, soared, and while they have since declined alongside declining prescription rates, more than 14,000 individuals died from prescription opioid overdoses in 2019 (
4). An additional serious consequence of prescription opioid use is addiction to prescription opioids, or prescription opioid use disorder (POUD) (
5,
6). However, among patients receiving prescribed opioids for chronic pain, determining the prevalence of prescription opioid addiction/POUD and trends in this prevalence is challenging due to conceptualization and measurement issues.
The terms “opioid use disorder,” “addiction,” and “dependence” are often used inconsistently and interchangeably (
5,
7,
8), leading to conceptual confusion and measurement inconsistencies (
9). DSM-5 did not include the term “addiction” (page 485) because of its uncertain meaning and potentially pejorative connotations (
10–
12), but many in the field continue to use the term. In addition, provider confusion between the term “addiction” and physiological dependence (
11–
14) may lead some to diagnose POUD erroneously. These inconsistencies are likely to have contributed to widely varying POUD prevalences (
9,
15) among patients who have prescriptions for opioids for pain (
16–
24), leaving the actual prevalence of POUD among such patients unclear.
Diagnosing opioid use disorder in patients receiving prescribed opioids for chronic pain is inherently complex and difficult. DSM-5 defines substance use disorder, including POUD, with 11 criteria (
10), including tolerance and withdrawal (physiological dependence [
11]) and nine behavioral or subjective criteria (e.g., craving, considerable time using). DSM-5 also provides definitions of three severity levels: mild, moderate, and severe (2–3, 4–5, and 6–11 criteria, respectively) (
10,
25). The DSM-5 criteria for opioid use disorder (OUD) overlap almost entirely with the DSM-IV criteria (
25) and have near-perfect agreement in case identification (
26). These definitions of OUD have major importance to the field because they determine eligibility for participation in most OUD treatment trials (e.g.,
27,
28), including POUD (
29).
OUD criteria were initially developed in studies in heroin users at addiction clinics in the 1980s (
30–
32), when virtually all opioid use was illicit. Recognizing the increases in use of licit opioids prescribed for pain, DSM-5 recommended not diagnosing POUD in individuals using opioids as prescribed when the only criteria met were tolerance and/or withdrawal (
10,
25), the expected physiological adaptations to treatment with opioids (
5,
6,
9,
11). Some have disagreed with this approach (
33–
35), while others have proposed an entirely different syndrome for this situation, namely, “complex persistent opioid dependence” (
34–
36), which is based on clinical observation and in-depth expert neurobiological reasoning. Complex persistent opioid dependence is not yet operationalized in a research measure, although the DSM-5 approach to tolerance and withdrawal can be readily operationalized to study its reliability and validity. Another important concern about DSM-5 as it pertains to prescribed opioids is that in the behavioral and subjective POUD criteria, the role of pain relief is not addressed.
Empirically, substance use disorders cluster within the broader domain of externalizing disorders and traits (
37–
41) (e.g., antisocial personality disorder, sensation-seeking, impulsivity [
38–
40]) that share early onset and etiology (
42–
45). From this perspective, behavioral OUD criteria can be seen as phenotypic manifestations of externalizing psychopathology. However, patients receiving legal, prescribed opioids for the therapeutic (indicated) purpose of pain relief are not typically characterized by externalizing traits (
23,
46,
47). Thus, while behavioral and subjective POUD criteria may appear phenomenologically similar in patients treated for pain and substance users treated for addiction, the criteria may represent different underlying phenomena if, among patients treated for pain, the observed behavioral symptoms have therapeutic intent (i.e., pain relief) rather than motives such as externalizing reasons, such as to get high (
10,
48), or motives such as coping with internalizing conditions (e.g., depression, anxiety) that are common among those with pain and are associated with addiction (
49–
52). DSM-5 clarifies that among individuals with a prescription for analgesic opioids for pain relief who are using more opioids than prescribed, OUD can be diagnosed if the signs and symptoms are present, but “not only because of persistent pain” (p. 542). Therefore, ignoring pain relief motives when assessing POUD criteria could lead to misdiagnosed POUD/addiction. Until now, patient-oriented assessment of pain relief (therapeutic intent) has not been incorporated into the measurement of DSM-5 POUD criteria.
Given the many issues related to prescription opioids, the U.S. Food and Drug Administration (FDA) held a series of meetings for pain, opioid, and addiction experts, providers, and stakeholders and issued a set of postmarketing requirements (PMRs) to study prescription opioids, including PMR #3033-5 to develop and validate a standardized measure of prescription opioid substance use disorder/addiction that incorporated patient-oriented assessment of pain relief/therapeutic intent (
15,
53). For this, the clinician-administered, computer-assisted Psychiatric Research Interview for Substance and Mental Disorders, DSM-5 opioid version (PRISM-5-OP) was designed (
15). The PRISM-5-OP was adapted from the PRISM (DSM-IV) (
54–
56) and the PRISM-5 (DSM-5) (
57,
58), which were used previously in reliability and validity studies (
57–
59) and in a study of POUD in primary care (
60). The PRISM-5-OP assesses POUD in three ways (
15): 1) completely unadjusted: all DSM-5 criteria are considered positive if they occurred, including tolerance and withdrawal, regardless of whether opioids were used only as prescribed or not; 2) DSM-5: withdrawal and tolerance are not considered positive if patients used prescribed opioids only as prescribed; and 3) pain-adjusted: POUD behavioral/subjective criteria are not considered positive if pain relief/therapeutic intent was the
only motive for the behavior described by the criterion. The validation strategy consisted of comparing the strength of the associations of these three PRISM-5-OP POUD measures with a set of 10 convergent and discriminant validators. The primary comparisons of interest were between the pain-adjusted and DSM-5 POUD measures, indicating whether the pain adjustments improved validity over the DSM-5 measure. Comparing DSM-5 and completely unadjusted POUD explored the validity of the DSM-5 adjustment for tolerance and withdrawal compared to no adjustment.
Methods
Settings and Procedures
Urban and suburban patients were recruited from two types of settings: inpatient substance treatment (two sites) and outpatient pain clinics (four sites) that utilized procedures to avoid prescribing opioids to patients for nontherapeutic reasons. These two types of settings were selected to determine whether distributions of POUD measures differed between patients with established substance use disorders and medical pain patients (an informal validity check), and to provide sufficient variance to conduct the analyses. After learning about the study from flyers or clinic staff, patients contacted study coordinators, who explained the study, screened for eligibility (≥18 years old, ever received a prescription for a ≥30-day supply of opioids to treat chronic pain, English-speaking), and obtained informed consent (see recruitment and enrollment details in Figure S1 in the
online supplement). The final sample included 606 patients, of whom 348 were from pain clinics and 258 from substance treatment. Institutional review boards at New York State Psychiatric Institute, Cornell Hospital (Cornell University), Holy Name Medical Center (Rutgers University), Bellevue Hospital (New York University), and South Oaks Hospital (Hofstra University) approved the study.
Interviews, self-administered questionnaires, and electronic medical records (EMRs) provided the study data. Interviews were conducted between February 29, 2016, and April 24, 2017, by telephone (49.8%) or in person. For efficiency, all substance treatment patients were assessed in person; most pain clinic patients were assessed by telephone, based on patient or clinic preferences. In-person interviewees completed self-administered questionnaires on a tablet; telephone interviewees used a web link. Trained coordinators extracted EMR variables, entering them into a form on a tablet. Of the 606 patients, 92.4% agreed to retest reliability interviews, and by design, we reinterviewed approximately one-third of the sample (N=206; substance treatment patients, N=97; pain clinic patients, N=109). Test and retest interviews were conducted by different interviewers, with retest interviewers blinded to initial interview results. Interviews were randomly assigned to available interviewers; 60.6% of retests were completed by telephone. The mean test-retest interval was 7.2 days (SD=6.34, range=0–19). Because the purpose of the retest was to determine replicability of the measure when administered by two independent raters, a short interval was selected to reduce the impact of true change in clinical condition on agreement between the first and second interviewer.
PRISM-5-OP interviewers (N=26) had master’s degrees in social work or clinical psychology and had clinical experience (mean=3.9 years, SD=4.6, range=1–24). Interviewers received 2 days of PRISM-5-OP training (didactic, role-playing) from experienced supervisors, practiced the interview, and submitted recordings, which were reviewed by supervisors. Interviewers were certified after supervisors rated two interviews as acceptable. To maintain consistency during the study, supervisors rated recordings of randomly selected interviews (10%), using structured forms, and supervised interviewers as necessary.
Measures
PRISM-5-OP.
The PRISM-5-OP assesses DSM-5 criteria for substance use disorders and psychiatric disorders, with an expanded POUD module near the beginning of the interview. This module includes additional opioid-related items (e.g., used only as prescribed, more/other than as prescribed), and for POUD behavioral/subjective criteria, reasons the criterion occurred, classified as pain relief (therapeutic) and nontherapeutic (e.g., to feel high, for affect, for sleep, in reaction to conditioned cues).
Three POUD criteria sets: completely unadjusted, DSM-5, pain-adjusted.
Each DSM-5 POUD criterion is evaluated for occurrence within the past 12 months. In the completely unadjusted criteria set, all criteria that occurred are considered positive and are counted toward a POUD diagnosis. In the DSM-5 set, tolerance and withdrawal are coded positive only if they occurred among those using prescription opioids more than or other than as prescribed. In the pain-adjusted criteria set, tolerance and withdrawal are rated as in the DSM-5 set, while the behavioral/subjective criteria are considered positive only if they occurred for a nontherapeutic reason (i.e., other than pain relief).
Dimensional POUD measures.
By summing the criteria, dimensional measures were created for each POUD criteria set. The completely unadjusted measure was a count of all positive unadjusted criteria. The DSM-5 measure was a count of all criteria that occurred except tolerance and withdrawal, which were considered positive if they occurred only in patients using opioids in nonprescribed ways. The pain-adjusted measure included the DSM-5 adjustment and, in addition, considered as positive only the criteria reflecting behaviors that occurred for nontherapeutic reasons (i.e., for reasons other than pain relief). The completely unadjusted, DSM-5, and pain-adjusted measures had excellent internal consistency (Cronbach’s alpha, 0.92, 0.93, and 0.95, respectively). In exploratory factor analysis, all three criteria sets fit a one-factor model—that is, one eigenvalue >1.0 and the rest well below 1.0, standard fit indices within recommended values (comparative fit index ≥0.95, root mean square error of approximation <0.06), and factor loadings of 0.79–0.96 for completely unadjusted, 0.82–0.99 for DSM-5, and 0.79–0.99 for pain-adjusted criteria (see Table S1 in the
online supplement).
POUD diagnosis.
Binary diagnoses were created from the dimensional measures using the DSM-5 thresholds of ≥2 criteria to indicate any DSM-5 diagnosis, and ≥4 criteria to indicate moderate to severe disorders. (Exploring other thresholds did not indicate improved validity of any novel thresholds; see the
online supplement for supporting text, Tables S2 and S3, and Figures S2 and S3.)
Validators.
Ten validators were used, of which eight were convergent validators: 1) substance treatment: recruitment from substance settings, endorsing treatment in the PRISM-5-OP interview; any positive=yes; 2) family history of drug use disorders: PRISM-5-OP using validated items from general population surveys (
61,
62); any parent or sibling with drug problems=yes; 3) DSM-5 other substance use disorder (alcohol, nicotine, cannabis, illicit drugs): PRISM-5-OP or EMR; any positive=yes; 4) antisocial personality disorder: PRISM-5-OP; 5) internalizing DSM-5 disorders: major or persistent depression, generalized anxiety, posttraumatic stress, PRISM-5-OP or EMR; any positive=yes; 6) tampering with prescribed opioid medication: crushing, dissolving, etc. (self-administered questionnaire); any positive=yes; 7) and 8) impulsivity, sensation-seeking (traits associated with illicit drug use), brief validated scales (
63,
64) added to the self-administered questionnaire midway through the study; completed by 314 patients. The other two validators were discriminant validators: 9) worst pain, past week: Brief Pain Inventory (
65–
67), a validated, widely used pain scale (self-administered questionnaire); and 10) prescription for legitimate reason: whether patients always obtained prescription opioids from a physician “for actual pain or a legitimate medical reason” (self-administered questionnaire).
Sociodemographic characteristics.
Sociodemographic characteristics included age categories (18–29, 30–39, 40–49, and ≥50 years), sex, race/ethnicity (non-Hispanic White, Black, and Other; Hispanic ethnicity), marital status (married, living together, neither), educational attainment (any college, yes/no), employed (yes/no), and health insurance (public, private, none).
Statistical Analysis
Reliability.
Cohen’s kappa and intraclass correlation coefficients (ICCs) with 95% confidence intervals indicated chance-corrected agreement between test and retest interviews (
68) for binary and dimensional variables, respectively. Values of 0.21–0.40 indicate fair agreement, 0.41–0.60 moderate agreement, 0.61–0.80 substantial agreement, and 0.81–1.00 excellent agreement (
69,
70).
Validity.
Validity was evaluated by testing differences between the pain-adjusted and DSM-5 POUD measures in the strength of their associations with the validators. Bivariate correlated-outcome regressions (
71,
72) simultaneously modeled the two POUD measures as outcomes (either two dimensional or two dichotomous), taking into account their within-participant correlation using generalized estimating equations (implemented with Proc GENMOD in SAS). Separate models were fitted for each validator (predictor) to determine its association with both of the POUD measures, adjusting for covariates and testing whether one POUD measure was more strongly associated with the validator. The POUD dimensional measures were modeled using a negative binomial distribution with log link based on analyses that identified the best-fitting distribution; the dichotomous POUD measures were modeled using a logistic link. Results were expressed in mean ratios for dimensional POUD measures and odds ratios for dichotomous ones. Mean ratios and odds ratios ≥1.0 show greater severity or odds, respectively, of the POUD measure for participants with the validator present rather than absent, or at higher values of the validator, supporting validity for convergent validators; values <1.0 show greater severity or odds when the validator is absent, supporting validity for discriminant validators. Ratios of the mean ratios or odds ratios for pain-adjusted and DSM-5 POUD measures indicated whether one version showed stronger association with the validators than the other. Analyses controlled for age, sex, race, education, marital status, employment, and health insurance. Using the same models, to explore the validity of the DSM-5 adjustments for withdrawal and tolerance, we also compared the validity of the completely unadjusted and DSM-5 POUD measures.
Results
Sample Characteristics
The characteristics of the full sample and the test-retest reliability subset were similar (see Table S4 in the
online supplement). About half the participants were ≥50 years of age, female, and/or unmarried; about 80% were White; about two-thirds had some college education; and about 22% were employed. Compared to patients in substance treatment, patients from pain clinics were more often older, female, college-educated, and married, and they had higher pain scores, lower sensation-seeking and impulsivity scores, and fewer substance use or psychiatric disorders.
The prevalences of completely unadjusted, DSM-5, and pain-adjusted POUD diagnoses with ≥2 and ≥4 thresholds ranged from 25.3% to 51.0% in the full sample, 53.9% to 62.0% among patients in substance treatment, and 4.0% to 42.8% among pain clinic patients (
Table 1). Pain adjustments made few significant differences in POUD prevalence among patients in substance treatment, who rarely engaged in POUD criteria only for pain-relief (therapeutic) reasons (
Table 1). However, pain adjustments substantially reduced prevalence among pain clinic patients. Similar patterns were observed for the individual criteria, with a significantly greater adjustment effect among pain clinic than substance treatment patients for all but one criterion (see Table S5 in the
online supplement). Means of the dimensional POUD measures (
Table 1) showed similar patterns.
Test-Retest Reliability
The completely unadjusted, DSM-5, and pain-adjusted PRISM-5-OP POUD dimensional measures (
Table 2) had excellent reliability (ICCs, 0.79–0.82), which was somewhat better among pain patients (ICCs, 0.87–0.93) than among patients in substance treatment (all ICCs=0.72). For diagnostic POUD measures, reliability was moderate for completely unadjusted and DSM-5 diagnoses (kappa values, 0.53–0.59) and moderate to substantial for pain-adjusted diagnoses (kappa values, 0.59–0.65), with reliability higher among pain patients, especially for pain-adjusted POUD (kappa values, 0.71–0.80). Two PRISM-5-OP items—taking more opioids than prescribed and using without a prescription—were used in the DSM-5 algorithm for withdrawal and tolerance, so their reliability was examined. Taking more than prescribed had substantial reliability overall (kappa=0.80, 95% CI=0.70, 0.90) and among patients in substance treatment (kappa=0.78, 95% CI=0.63, 0.94), and excellent reliability among pain patients (kappa=0.82, 95% CI=0.69, 0.95). Taking opioids without a prescription had substantial reliability overall (kappa=0.77, 95% CI=0.66, 0.88) and among patients in substance treatment (kappa=0.66, 95% CI=0.51, 0.81) and was excellent among pain patients (kappa=1.00).
Validity: Pain-Adjusted Versus DSM-5 POUD Measures
Dimensional POUD measures.
All convergent validators (
Table 3) were positively associated with the pain-adjusted (mean ratios, 1.60–5.30) and DSM-5 POUD (mean ratios, 1.26–2.30) measures. Associations were significantly stronger with the pain-adjusted measures than with the DSM-5 measures (ratios of mean ratios, 1.22–2.31). Pain-adjusted POUD was negatively associated with both discriminant validators (mean ratios, 0.96 and 0.46). DSM-5 POUD was unrelated to worst pain, but negatively associated with legitimate opioid use (mean ratio, 0.73). Pain-adjusted dimensional POUD had significantly stronger associations with both discriminant validators than DSM-5 dimensional POUD.
POUD binary diagnostic measures.
At the ≥2 threshold, a pain-adjusted POUD diagnosis was associated with all 10 validators, and at the ≥4 threshold, it was associated with nine of the 10 validators (
Table 4). The DSM-5 POUD measures at the ≥2 and ≥4 thresholds were associated with five and seven validators, respectively. At both thresholds, odds ratios were significantly stronger for pain-adjusted diagnoses than DSM-5 diagnoses with all validators but two (internalizing disorders, impulsivity).
Validity: Completely Unadjusted Versus DSM-5 POUD Measures
For dimensional measures (see Table S6 in the
online supplement), completely unadjusted POUD was associated with all validators except worst pain; DSM-5 dimensional POUD had significantly stronger associations than completely unadjusted POUD with all validators. For binary measures (see Table S7 in the
online supplement), completely unadjusted POUD at the ≥2 and ≥4 thresholds was associated with six and seven validators, respectively. DSM-5 POUD at the ≥2 threshold had significantly stronger associations than completely unadjusted POUD with eight validators, and at the ≥4 threshold, with three validators.
Discussion
DSM-5 criteria for substance use disorders have been extensively validated for illicit drugs, including heroin (
25), but not for opioids prescribed for pain. The FDA required a study to develop and validate a standardized measure of POUD that incorporated patient-oriented information on the patient’s pain relief/therapeutic intent in evaluating POUD criteria (
15). In the instrument developed for this purpose (
15), the PRISM-5-OP, pain-adjusted POUD had significantly stronger associations than DSM-5 POUD with 10 convergent and discriminant validators, suggesting greater validity of the pain-adjusted POUD measures that differentiated pain relief/therapeutic intent from other motives, including externalizing motives (e.g., to get high) and internalizing motives (e.g., to cope with negative or anxious affect).
Previous studies of POUD/addiction have produced widely varying prevalence estimates (
9,
16,
17,
24), attributed to methodological inconsistencies (
9). In our study, different POUD operationalizations substantially changed prevalences (range, 4%–42.8% among pain clinic patients), supporting previous concerns that inconsistent concepts or nonstandardized measures artifactually influenced rates (
15). Our prevalence of pain-adjusted POUD was lower than the DSM-5 approach among pain patients; nevertheless, this prevalence was greater than the prevalence of addiction found in one-third of the studies in a widely cited review (
17). Thus, while confusion between physiological dependence and addiction/POUD (
11) may lead to expectations of much higher POUD prevalence than what we found, our prevalence of pain-adjusted POUD is within the range reported in previous research. Additionally, the pain-adjusted POUD measures had the strongest associations with the validators, supporting the validity of estimates in the 4%–10% range. Although rates of U.S. opioid prescriptions have declined, they remain high (>142 million in 2020 [
3]). Thus, even low-end POUD prevalence estimates represent a substantial burden in the overall population (
73). In addition, although high-dose opioid use, regardless of the motivation, is a major risk factor for opioid overdose, especially among pain patients who escalate their dosage, quantification of actual increases in risk due to POUD among pain patients who are opioid users remains surprisingly unclear; studies are needed to clarify this relationship.
For the DSM-5 behavioral/subjective POUD criteria, the PRISM-5-OP differentiation between pain relief and nonindicated motives could help guide clinician approaches when discussing treatment options. Patients using prescribed opioids for a high or to ameliorate depression or anxiety might benefit from a different presentation than patients whose behavioral/subjective POUD criteria are entirely for pain-relief reasons, even if the proffered treatment is the same in either case—for example, nonpharmacologic treatments, or buprenorphine, which may provide safety and analgesia benefits for those transitioning off opioids (
74) and assist in treating internalizing psychiatric comorbidity (
75–
77). In individuals with pain conditions, mood/anxiety and pain are highly related, so it can be unclear, even to the patient, what is driving medication-seeking, since chronic opioid use, regardless of the reasons, has an impact on mood, anxiety, and the functioning of the endogenous opioid system, which is involved in the regulation and functioning of multiple systems (
78). Therefore, while acknowledging patients’ perspectives may help engage them in the discussion about treatment options, education about the fact that taking opioid medications to relieve emotional pain or dysphoria is a clear pathway to the development of OUD is also important. We note that others have also called for measures of prescription opioid problems that differentiate pain relief from other motives (
20,
79). Research is needed to investigate the clinical utility of this differentiation.
Our findings suggest an appreciable burden of POUD in pain patients, even after incorporating the pain adjustments that lowered prevalence. A larger ongoing PMR study using the PRISM-5-OP will provide more precise estimates of prevalence and incidence among patients using opioids for chronic pain (
15). In addition, our study could not address the reliability or validity of the alternative syndrome described as “complex persistent opioid dependence” (
34–
36). Such studies should be conducted when a measure of complex persistent opioid dependence is developed.
Several limitations of this study should be noted. First, stigma or social desirability bias could affect responses about POUD criteria. However, among substance treatment patients, similar patterns of pain-adjusted and DSM-5 measures (see Table S5 in the
online supplement) show that most of these patients reported using prescribed opioids for nontherapeutic reasons, while among pain clinic patients, responses indicated considerable therapeutic intent/pain relief. The consistent evidence from 10 external validators suggests that pain patient responses were truthful, since a complex set of fabrications would be needed to produce consistent results across all 10 validators. Second, patients were recruited from only two types of settings. Studies in primary care and mental health care settings (where opioid prescriptions are common [
80,
81]) would be useful. Third, the PRISM-5-OP did not include pain adjustments for cannabis, which is increasingly legal for treating pain (
82). Such adjustments could prove useful and should be studied in the future. Fourth, patients with dysphoria could attribute their dysphoria to pain, possibly leading to overadjustment. Investigation of this issue was outside the scope of the present study, but should be done in the future. Finally, the study design did not include prospective validation, but future studies should undertake this next logical step. Identifying POUD in chronic pain patients using opioids is an inherently difficult problem, so multiple studies are needed.
Chronic opioid use is not indicated for most pain conditions, so clinicians should always seek alternatives to opioids for chronic pain. However, given the continuing high numbers of opioid prescriptions, measures enabling study of risk factors for and consequences of POUD remain important. We found that the PRISM-5-OP measures of pain-adjusted POUD measures were reliable and more valid than standard DSM-5 concepts of substance use disorder. However, many external validators were associated with DSM-5 as well as pain-adjusted POUD measures. Studies should report both sets of results and further investigate differences between pain-adjusted and DSM-5 POUD and their consequences.