Validity and Utility of Four Pain Intensity Measures for Use in International Research

Correspondence: Pramote Euasobhon Department of Anesthesiology, Faculty of Medicine, Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, Bangkok, 10700, Thailand, Phone: Tel +66 2419 7995, Fax: Fax +66 2411 3256, Email pramoteo@hotmail.com

Received 2021 Jan 22; Accepted 2021 Mar 15. Copyright © 2021 Atisook et al.

This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).

Abstract

Background

The majority of previous research that has examined the validity of pain intensity rating scales has been conducted in western and developed countries. Research to evaluate the generalizability of previous findings in non-developed countries is necessary for identifying the scales that are most appropriate for use in international research.

Purpose

The aims of the current study were to (1) evaluate the validity and utility of four commonly used measures of pain intensity in a sample of patients with chronic pain from Thailand and (2) compare findings in the current sample with published findings from research conducted in other countries, in order to identify the measure or measures which might be most appropriate for cross-country research.

Methods

Three hundred and sixty patients with chronic pain seen in a hospital in Bangkok, Thailand, were asked to rate their current pain and average, worst, and least pain intensity in the past week using the Visual Analogue Scale (VAS), 6-point Verbal Rating Scale (VRS-6), 0–10 Numerical Rating Scale (NRS-11), and Faces Pain Scale-Revised (FPS-R). We evaluated the utility and validity of each measure by examining the (1) rates of correct responding and (2) association of each measure with a factor score representing the variance shared across measures, respectively. We also evaluated the associations between incorrect response rates and both age and education level, and then compared the findings from this sample with the findings from research conducted in other countries.

Results

The results indicated support for the validity of all measures among participants who were able to use these measures. However, there was variability in the incorrect response rates, with the VAS having the highest (45%) and the NRS-11 having the lowest (15%) incorrect response rates. The VAS was also the least preferred (9%) and the NRS-11 the most preferred (52%) scale. Education and age were significantly associated with incorrect response rates, and education level with scale preference.

Conclusion

The findings indicate that the NRS-11 has the most utility in our sample of Thai individuals with chronic pain. However, when considered in light of the findings from other countries, the results of this study suggest that the FPS-R may have the most utility for use in cross-cultural and international research. Research in additional samples in developing countries is needed to evaluate the generalizability of the current findings.

Keywords: cross-country comparison, Face Pain Scale-Revised, Numerical Rating Scale, pain assessment, pain scale preference, Verbal Rating Scale, Visual Analogue Scale

Plain Language Summary

In order to increase our understanding of pain and its treatment, we must be able to measure it. Although a great deal is known about the strengths and weaknesses of the most commonly used measures of pain intensity, most of our understanding comes from research conducted in Western and developed countries. In order to understand which pain intensity measures are most useful among people everywhere, research on pain measures must be conducted in people with pain from non-Western and developing countries. This study found that 0–10 Numerical Rating Scale has the most utility in our sample of Thai individuals with chronic pain. However, when considered in light of the findings from other countries, the results of this study suggest that the FPS-R may have the most utility for use in cross-cultural and international research.

Introduction

Pain relief and treatment for pain-related conditions are arguably the most common reasons people have for seeking health care. 1 , 2 Effective pain management remains a critical health concern worldwide. 3 To be able to evaluate the efficacy of pain treatments in research – including in multisite research conducted in different countries – pain must be assessed using measures with established reliability and validity.

Pain intensity is the pain domain most often assessed in clinical and research settings. 4 , 5 Knowledge regarding the psychometric properties of different pain intensity measures across different patient populations is therefore critical for advancing the treatment and scientific understanding of pain. The most commonly used measures of pain intensity are the Visual Analogue Scale (VAS), Verbal Rating Scale (VRS), Numerical Rating Scale (NRS), and Faces Pain Scale-Revised (FPS-R). 6–8 Each of these scales has evidence supporting their test-retest reliability, and ratings of these scales tend to correlate strongly with each other, supporting their validity. 9 , 10

However, each of these scales also has demonstrated strengths and weaknesses in different populations. 6 , 11 Although the VAS has the potential to be more sensitive to small changes in pain, 4 , 7 , 12 this scale has also been found to be more challenging to comprehend and use by elderly individuals. 7 , 12 The NRS and VRS have been found to be easier to administer than the VAS, and to have lower incorrect response rates. 13 However, a recent study in a sample of patients with chronic pain from Nepal reported higher incorrect response rates for the NRS than the VRS, VAS, or FPS-R. This finding was attributed to limited education of the study participants, as well as the fact that numbers are rarely used to describe magnitude in Nepalese culture. 8 Thus, preliminary findings suggest that both demographic and cultural factors appear to impact the validity and utility of pain measures; no single measure has been found to be universally superior across patient populations.

A significant limitation of the research that has been published to date on this topic is that most of it has been conducted in western populations. 6 , 14–18 Based on the findings from this research, consensus groups have identified the NRS – specifically, the 11-point 0–10 NRS (also referred to as the NRS-11) – as the best measure of pain intensity in most situations and with most populations. 4 However, a systematic review by Hjermstad and colleagues concluded that the VRS was preferred over the NRS by those with less education and the elderly, while people with more education and younger individuals preferred the NRS. 15 In addition, although studies conducted in China and Nepal have demonstrated good reliability and validity for all four scales among those who are able to use these scales without errors, the most preferred measure of pain intensity in individuals from these countries has been shown to be the FPS-R. 7 , 8

Thus, in large part because of the lack of research on pain measures in non-Western countries, it is not yet possible to draw strong conclusions regarding which measure or measures should be used when conducting cross-country or cross-cultural research. Additional research from a variety of countries is needed to help us understand which measure(s) are most universally valid and useful, especially when the goal is to compare research findings across different countries. One group of patients that has not yet been studied in this context is patients from Thailand.

Given these considerations, the aims of the current study were to (1) evaluate the validity and utility of four commonly used measures of pain intensity in a sample of patients with chronic pain from Thailand and (2) compare findings in the current sample with those from other countries, to help identify the measure or measures which might be most appropriate for cross-country research. Based on the available evidence, cited above, we hypothesized that the findings would support the validity of all four rating scales as measures of pain intensity among those who are able to use all measures in the study sample. With respect to scale utility, and given previous research findings in other samples of individuals with pain, we hypothesized that older participants and those with less education would evidence higher rates of incorrect responding than younger participants and those with more education in our sample. We also hypothesized greater preferences for the categorical measures (eg, FPS-R and VRS) than the VAS.

Materials and Methods

Subjects and Procedures

From August to November 2018, 360 patients with chronic pain seen at the outpatient pain clinic at the Siriraj hospital in Bangkok, Thailand, were enrolled in this cross-sectional study. Potential participants were notified about the study via a written pamphlet that was posted on the notice board in the pain clinic. In order to participate in the study, the potential participants had to be aged 18 years or older, report having experienced pain in at least one part of their body for more than three months, and be able to understand and speak Thai. Patients with pain due to cancer and those with a history of cognitive impairment, visual impairment, or significant psychiatric illness per chart review were excluded.

Once participants were enrolled, they were administered a questionnaire asking them to provide demographic data (ie, gender, age, marital status, employment status, place of residence [ie, metropolitan or provincial areas] and educational level). The participants were then asked to rate four domains of pain intensity (worst pain, average pain, and least pain in the past week, and current pain) using four different pain intensity rating scales; the VAS, VRS-6, NRS-11, and FPS-R. As noted in the Introduction section, all four of these scales have been shown to be reliable and valid in previous research, and are commonly used in clinical and research settings. 5 , 6 , 10 , 12 , 13 , 15 The participants were provided verbal instructions on how to use each scale, and the instructions were repeated up to three times if requested by the participants. To control for possible order effects, the four scales were administered randomly using a Latin square design, such that each type of scale was presented in each possible order (first, second, third, or fourth) on a separate sheet. Participants were asked to rate their worst pain, average pain, and least pain in the past week, and current pain, using each scale before moving on to the next.

The participants were then asked to select the scale that they would prefer to use when asked to rate their pain intensity in the future. While the participants rated their pain intensity using each scale, the research staff did not provide any specific guidance or corrections, even if the participant was observed to make an error (eg, rating least pain as being more intense than worst pain). However, incorrect responses were later coded into one of seven categories: (1) providing two or more responses on the same rating scale; (2) providing a range of responses rather than a single response; (3) being unable to provide a response, even after three explanations; (4) marking between two descriptive terms on the VRS-6 or between two faces on the FPS-R; (5) rating worst pain as lower in intensity than average pain; (6) rating worst pain as lower in intensity than the least pain; or (7) rating the least pain as greater than average pain. This study was approved by the Siriraj Institutional Review Board, Faculty of medicine Siriraj hospital, Mahidol University, Bangkok, Thailand (SI429/2018). All participants signed informed consent forms before any data were collected. This study was conducted in accordance with the Declaration of Helsinki.

Measures

Visual Analogue Scale (VAS)

The VAS is a continuous scale, usually presented as a 10-centimeter horizontal line, anchored by 2 verbal descriptors reflecting no pain and extreme pain. In this study, the extreme descriptor used was “Worst pain.” 9 , 19 This endpoint (which was also used as the endpoint for the NRS-11 in this study) has a literal translation of “most” or “worst” or “maximum” pain, and was translated from that used for the pain intensity ratings from the Brief Pain Inventory (ie, “Pain as bad as it could be”). It has also been deemed to be the cultural equivalent to the Brief Pain Inventory endpoint. 20 With the VAS, respondents are asked to make a mark on the line that represents their pain intensity. The distance, in centimeters (ie, 0.0 to 10.0) from the “No pain” end represents the respondent’s pain intensity score.

6-Point Verbal Rating Scale (VRS-6)

The 6-level VRS is a widely used measure of pain intensity that has been validated in 15 languages. 21 , 22 The six English descriptors used to represent each level of pain intensity are “None,” “Very mild,” “Mild,” “Moderate,” “Severe,” and “Very severe.” For the purpose of this study, we translated these descriptors into Thai. We then linked a specific number (ie, 0, 1, 2, 3, 4, and 5) to each descriptor. The number associated with the descriptor chosen by the respondent was their VRS-6 score.

Numerical Rating Scale-11 (NRS-11)

The 0–10 NRS (also referred to as the NRS-11) is one of the most widely used measures of pain intensity in research and clinical practice. 23 With the NRS-11, respondents are asked to choose the single number that best represents their pain intensity, on a 0 (“No Pain”) to 10 (“Worst pain”) scale.

Faces Pain Scale-Revised (FPS-R)

The original Faces Pain Scale (FPS), developed by Bieri and colleagues, 24 was developed to assess pain intensity in children, who were thought to have difficulty using the most commonly used pain intensity measures such as the VAS or NRSs. In 2001, the FPS was revised to make it more suitable for use with the metric scoring 0–10 scale (FPS-R), using visual depictions of faces to represent increasing levels of pain intensity along a 6-face continuum. With the FPS-R, respondents are asked to select the face that best represents their pain intensity. Each face is associated with a number (ie, 0, 2, 4, 6, 8 and 10), 25 and the respondent’s pain intensity score is the number associated with the face that was chosen.

Statistical Analysis

We first computed descriptive statistics for the demographic variables, pain history variables, and pain ratings (means and standard deviations for continuous variables, and number and percent for categorical variables) to describe the sample. Next, to evaluate the convergent validity of the scales (ie, the association between each scale and a composite score of all measures), we conducted a principal components analysis. 6 , 8 , 12 , 26 We determined the number of factors that emerged from the factor analysis using a scree test, and examined the loadings of the factor(s) that emerged as an indication of scale validity. We then computed the incorrect responses and preference rates for each measure, and evaluated the associations between each of these and age (dichotomized as younger [≤60 years old] versus older [>60 years old]) and education level (dichotomized as lower education [≤12 years of education or junior high school] and higher education [>12 years of education]), using chi-square analyses. Data analyses were performed using PASW Statistics version 16 (SPSS, Inc., Chicago, IL, USA).

Results

Demographic Data and Pain Characteristics

A summary of the demographic information of the sample is presented in Table 1 . Sixty-six percent (N = 293) of the sample were female. Mean age was 53 years (range, 36 to 69 years). A plurality of the participants had neuropathic pain (43%, N=156). The second most common pain problem was muscle/tendon pain (27%, N= 92). Nearly half of participants (49%, N=175) had more pain with movement and 62% (N=223) reported that nothing relieved them from pain. The means and standard deviations of the pain rating scores for each domain of pain intensity (worst, least, average and current pain) are presented in Table 2 .

Table 1

Descriptive Information About the Study Participants (N = 360)

VariableMean±SDN (%)
Gender: Female 239 (66%)
Age (years)53.35±16.13
Marital status
Married 208 (58%)
Unmarried 96 (27%)
Separated/divorced/widowed 54 (15%)
Did not respond to question about marital status 2 (0.5%)
Employed (full or part time) 236 (66%)
Place of residence
Metropolitan area 183 (51%)
Provincial areas 177 (49%)
Highest level of education
No formal education 12 (3%)
Primary School 92 (26%)
Junior High School 35 (10%)
Senior High School 47 (13%)
Vocational Certificate 25 (7%)
Bachelor degree or above 148 (41%)
Did not provide an answer 1 (0.3%)

Table 2

Means and Standard Deviations of the Pain Ratings (N=360)

VariableMean±SD
Worst pain intensity in the past week
VAS5.52±2.65
VRS-63.15±1.09
NRS-115.81±2.45
FPS-R5.62±2.47
Least pain intensity in the past week
VAS2.81±2.30
VRS-61.99±1.15
NRS-112.97±2.19
FPS-R2.66±2.08
Average pain intensity in the past week
VAS4.37±2.40
VRS-62.75±1.03
NRS-114.57±2.25
FPS-R4.30±2.28
Current pain
VAS3.60±2.72
VRS-62.29±1.30
NRS-113.78±2.61
FPS-R3.58±2.58

Note: FPS-R, Face Pain Scale-Revised (0–10); NRS-11, 11-point Numerical Rating Scale (0–10); VAS, Visual Analogue Scale (possible range, 0–10); VRS-6, 6-point Verbal Rating Scale (0–5).

Scale Inter-Correlations and Scale Validity

Very strong correlations were found among the four scales for each pain intensity domain (r’s = 0.72 to 0.91, see Table 3 ). The results of the principal components analyses of the four pain scales provided very strong support that they assessed the same underlying domain, given the very high first eigenvalue (greater than 3.00 in every case) and very low second eigenvalue (see Table 4 ). In addition, all of the scales evidenced a very strong loading (>0.9) on the single component that emerged. The NRS-11 had the highest loading on the components representing worst (0.94), average (0.94), and current pain (0.97). The VAS had the second highest loading.

Table 3

Inter-Scale Correlation Coefficients

Scale CorrelationPearson Correlation Coefficient (r)
Worst PainLeast PainAverage PainCurrent Pain
VASVRS-60.80**0.72**0.80**0.82**
FPS-R0.80**0.76**0.80**0.84**
NRS-110.86**0.83**0.88**0.91**
VRS-6FPS-R0.77**0.74**0.75**0.80**
NRS-110.82**0.78**0.80**0.86**
NRS-11FPS-R0.82**0.80**0.78**0.87**

Abbreviations: FPS-R, Face Pain Scale-Revised; NRS, Numerical Rating Scale; VAS, Visual Analogue Scale; VRS, Verbal Rating Scale.

Table 4

Component Loadings from the Principal Components Analyses of the Four Rating Scales

ScaleWorst PainLeast PainAverage PainCurrent Pain
VAS0.940.920.940.95
VRS-60.910.890.900.92
NRS-110.940.930.940.97
FPS-R0.910.910.900.93
First two eigenvalues3.43, 0.243.32, 0.283.40, 0.263.55, 0.21

Abbreviations: FPS-R, Face Pain Scale-Revised; NRS, Numerical Rating Scale; VAS, Visual Analogue Scale; VRS, Verbal Rating Scale.

Incorrect Responses

The rates of incorrect responding are presented in Table 5 by type of incorrect response. Two categories of incorrect responses (ie, providing a response that was between two response categories and providing a range of responses instead of a single response) were not made by any participant. As can be seen, the VAS had the highest rate of incorrect responding (N=75; 45%). The total number of incorrect responses to the NRS-11, FPS-R and VRS-6 were all less than 40, with the NRS-11 evidencing the lowest rate of incorrect responding (N=25; 15%). The most common error across all of the scales was rating average pain as higher than worst pain (50%), and the second most common error was rating least pain as being higher than average pain (25%).

Table 5

Rates of Incorrect Responding for the Rating Scales

ScaleTypes of Incorrect Response
More Than 1 Answer N (%)Average > Max N (%)Min > Max N (%)Min > Average N (%)No Answer N (%)Total N (%)
VAS2 (1%)45 (27%)11 (7%)17 (10%)0 (0%)75 (45%)
VRS-65 (3%)14 (8%)7 (4%)11 (7%)0 (0%)37 (22%)
NRS-119 (5%)11 (7%)2 (1%)3 (2%)0 (0%)25 (15%)
FPS-R4 (2%)13 (8%)1 (1%)11 (7%)1 (1%)30 (18%)
Total (all scales)20 (12%)83 (50%)21 (13%)42 (25%)1 (1%)167 (100%)

Abbreviations: FPS-R, Face Pain Scale-Revised; NRS, Numerical Rating Scale; VAS, Visual Analogue Scale; VRS, Verbal Rating Scale.

Factors Associated with Incorrect Responding

Older respondents and those with lower education levels had the highest rates of incorrect responding (see Table 6 ); the effect of age was statistically significant for the VRS-6 (P = 0.044), and the effect of education was statistically significant for the VAS (P = 0.001).

Table 6

Associations Between Age and Educational Group and Incorrect Response Rates

VariableNumber of Participants with Incorrect Responses (%)
VASVRS-6NRS-11FPS-R
Age (N = 360)
≤60 years old (n=232)17 (7%)16 (7%)16 (7%)15 (7%)
>60 years old (n=128)16 (13%)17 (13%)9 (7%)14 (11%)
P-value0.1040.0440.9620.136
Education level (N = 359)*
Lower education (n=139)22 (16%)17 (12%)13 (9%)13 (9%)
Higher education (n=220)11 (5%)16 (7%)12 (6%)16 (7%)
P-value0.0010.1130.1580.481

Notes: *Lower education, ≤12 years of education or junior high school; higher education, >12 years of education.

Abbreviations: FPS-R, Face Pain Scale-Revised; NRS, Numerical Rating Scale; VAS, Visual Analogue Scale; VRS, Verbal Rating Scale.

Scale Preferences

Of the 360 participants, 186 (52%) preferred NRS-11 followed by FPS-R (20%), VRS-6 (19%) and VAS (9%), respectively (see Figure 1 and Table 7 ). The results regarding the associations between age and educational level and scale preference are presented in Table 7 . As can be seen, significant effects were observed for education level; although those with higher and lower education both preferred the NRS-11 over the other scales, this preference rate was greater for those with higher education (56%) than lower education (44%). This difference may have been due to the marked difference in preference rates between these groups for the FPS-R (ie, 14% and 30% for those with higher versus lower education, respectively). This difference in pain scale preference as a function of education level was statistically significant (P = 0.003). No statistically significant differences in preference rates were noted in terms of age (P = 0.523).

Table 7

Associations Between Age and Educational Group and Scale Preference

FactorMeasure
VAS N (%)VRS-6 N (%)NRS-11 N (%)FPS-R N (%)P-value
Total preference (n=360)34 (9%)68 (19%)186 (52%)72 (20%)
Age (n=360) 0.523
≤60 years old (N=232)25 (11%)42 (18%)122 (53%)43 (18%)
>60 years old (N=128)9 (7%)26 (20%)64 (50%)29 (23%)
Education level (n=359)* 0.003
Lower education (N=139)10 (7%)27 (19%)61 (44%)41 (30%)
Higher education (N=220)24 (11%)41 (19%)124 (56%)31 (14%)

Notes: *Lower education, ≤12 years of education or junior high school; higher education, >12 years of education.

Abbreviations: FPS-R, Face Pain Scale-Revised; NRS, Numerical Rating Scale; VAS, Visual Analogue Scale; VRS, Verbal Rating Scale.

An external file that holds a picture, illustration, etc. Object name is JPR-14-1129-g0001.jpg

Number of participants who prefer each pain scale (N=360).

Abbreviations: FPS-R, Face Pain Scale-Revised; NRS, Numerical Rating Scale; VAS, Visual Analogue Scale; VRS, Verbal Rating Scale.

Discussion

The key findings from this study were that (1) although all four rating scales are valid measures of pain intensity among those individuals who are able to use them, (2) significant differences in incorrect response rates and preferences emerged for the different rating scales, (3) differences in rates of incorrect responding differed as a function of both age and education level, and (4) differences in preference rates differed as a function of education level. These findings have important implications for determining which measure(s) might be most useful when measuring pain intensity both locally (ie, in this case, in Thailand) and internationally.

Scale Validity

The findings provide strong support for the validity of all four rating scales, at least among the individuals who are able to use each measure. This finding is consistent with those from some other studies that have evaluated the validity of these scales in different populations living in different countries (eg, the USA, China, Nepal, Nigeria, Portugal and Spain). 7 , 8 , 14–18 , 27 Thus, if clinicians and researchers have the ability to ensure that their patients or research study participants are able to use the measure chosen, each one of the measures could potentially provide valid data regarding the magnitude of felt pain. However, not every clinician or investigator has the resources to evaluate a patient’s ability to use pain scales. In such cases, it would make sense to choose the rating scales that tend to have the lowest incorrect response rates and are most preferred by the population being treated or studied.

Incorrect Response Rates

In a sample of individuals with chronic pain from Nepal, the NRS-11 evidenced the highest rate of incorrect responding, followed by the VAS. 8 A similar finding emerged in studies conducted in China and Canada. 7 , 28 Although Thailand is a developing non-Western country with relatively low literacy rates, especially among individuals living in rural areas, the findings from the current study showing the largest incorrect response rates for the VAS and least incorrect response rates for the NRS-11 are similar to those from more developed countries. 12 , 26 , 28 , 29 It is possible that the relatively high education level in the current sample, which is higher than that of the Thai population in general, might explain the similarity of findings to those from developed countries. 12 , 26 , 28–30

Previous research indicates that when significant effects are found, older individuals, individuals with cognitive impairments, and individuals with lower education levels have higher rates of incorrect responding to pain measures. 8 , 12 , 29 This finding appears to more consistent for the VAS than for categorical (ie, FPS-R and VRS) and numerical scales. 28 , 31 These findings were generally replicated in our sample, with older participants and participants with lower education levels evidencing higher rates of incorrect responding to the VAS, although these differences were only statistically significant for education level. Also, in our sample, a significant effect for education level on the incorrect response rates to the VRS-6 emerged. Overall, these findings support the choice of the NRS-11 or the FPS-R for use in Thai samples who are treated or recruited for research from a large metropolitan city (ie, Bangkok).

Scale Preferences

In this study, NRS-11 was the most preferred scale, followed by the FPS-R. Also, only educational level was significantly associated with scale preference. Our finding of a significant association between education level and preference for the NRS-11 (with those having higher education preferring the NRS-11 more than other scales) has also been found by researchers in other (but mostly developed) countries. 12 , 26 , 28 , 31–34 However, in general, research among individuals from developing countries or those with populations with lower education levels show that these individuals tend to prefer categorical scales such as the FPS-R, 7 , 8 , 33–37 even though this scale was originally developed for use in children. One generally consistent finding across all populations is the low preference rate for the VAS. 8 , 12 , 29

No significant differences were found in scale preference in relation to age in our sample. Although previous studies found that the older patients (65 years) and the oldest patients age groups (more than 75 years) prefer VRS-6 to all other scales, 12 , 34 , 38 older participants in this study preferred NRS-11 over the other scales across age groups. The reason for this discrepancy in findings is not entirely clear. It is possible that it might be related to one or more differences in the samples used in different studies (eg, effects of country of origin, effects of education levels). Additional research using samples of individuals with pain from Thailand and other countries is needed to determine which factors are most consistently associated with scale preferences.

Implications for Between-Country/International Research

As researchers continue to seek to identify the factors that influence pain and its impact, an important and as yet understudied area is the extent to which the findings from one country and culture do (or do not) generalize to other countries and cultures. 39 , 40 Much of the pain research that has been and is being conducted has come from studies performed in western countries. Determining the extent to which research findings are “universal” and generalize to individuals from other countries and cultures is very important. Findings regarding the psychometric properties of pain measures that replicate across countries support the generalization of these findings across languages and cultures, and therefore provide evidence that can be used for selecting measures when cross-cultural or cross-country comparisons are a research goal. On the other hand, findings that are unique to specific countries might be less useful for selecting measures in cross-cultural research. Critically, in order to evaluate the cross-country relevance of findings, reliable and valid measures of key pain-related constructs that can be used in populations from different countries are needed. Arguably, measures of pain intensity are among the most important of these.

Based on the current findings, and in light of previous research, one preliminary conclusion that might be drawn is that the FPS-R appears to be the most viable choice for use in cross-country research. Like all of the other measures evaluated in this study, the FPS-R has been shown to be valid in every study that has evaluated this measurement property. 6 , 25 , 41 , 42 In addition, the incorrect response rates of the FPS-R are consistently low across all populations in which these rates have been evaluated. 7 , 37 Critically, this cannot be said of every measure, including the NRS-11, 8 perhaps, in part, because of the negative impact that pain can have on an individual’s “number sense.” 43–45 Although the FPS-R is not the measure that tends to be preferred over the others, 15 , 26 , 29 the lack of preference for the FPS-R has not translated to a lack of validity or a lack of ability to use the measure. Arguably, validity and ability to use a measure without errors are more important than preference when selecting a measure to use in clinical or research settings. Finally, the FPS-R has already been translated into over sixty-nine languages. 46 Given these considerations, our current recommendation is for researchers is to consider either (1) using the FPS-R as their primary measure of pain intensity if that measure is deemed appropriate for helping them address their study question and hypotheses, or (2) adding the FPS-R as a secondary outcome measure in their research, in order to facilitate a standardization that would allow for between-country and between-study comparisons of study findings.

Limitations

This study has several limitations that should be considered when interpreting the results. First, it should be kept in mind that although the majority of the Thai population has finished only junior high school (12 years of education), 30 most of the participants in the current study had completed a bachelor's degree or had an even higher education level. Thus, it is possible that the findings from this study may not generalize to some subgroups of the Thai population, perhaps especially those living in rural areas where education is more limited. Replication of the study findings in samples of individuals who live in rural areas of Thailand would be helpful to understand the generalizability of the study findings. Second, we did not assess cognitive function in the sample, including in the group that was older. It is possible that a subgroup of the study sample may have an undiagnosed cognitive impairment. The role that this factor plays in the study findings could therefore not be determined. Future researchers should include measures of cognitive function, if possible, to determine the extent to which factor might explain the age effects so often observed on incorrect response and preference rates. Third, we did not assess the participants’ previous experience with and knowledge about the rating scales evaluated in this study. Li and colleagues, 7 for example, found that the reliability of pain rating scales improved with more exposure and practice. It is possible, for example, that one reason the findings from the current study largely replicated those from developed countries is that the NRS-11 is used so frequently in the hospital where the participants in the current study were recruited. Researchers in this area would do well to measure rating scale familiarity as a potential predictor of scale validity and utility.

Summary and Conclusion

The findings from this study support the validity of four common rating scales of pain intensity for use in Thai samples. We found support for the validity of the NRS-11 in individuals with pain from Thailand. Considerations of the current findings, in light of the findings from studies from other countries, suggest the possibility that the FPS-R might be the best measure for use in international studies in order to facilitate comparisons of findings between countries. Additional research is needed to evaluate the generalizability of the current findings with respect to individuals from rural areas of Thailand.

Acknowledgments

The authors gratefully acknowledge the patients who generously agreed to participate in this study, Ms. Julaporn Pooliam for assistance with the statistical analysis, and Ms. Nattaya Bunwatsana for general research assistance.

Funding Statement

This research project was supported by Siriraj research fund, Faculty of Medicine Siriraj Hospital, Mahidol University, Grant number (IO) R016131053.

Abbreviations

FPS-R, Face Pain Scale-Revised; NRS, Numerical Rating Scale; VAS, Visual Analogue Scale; VRS, Verbal Rating Scale; SPSS, Statistical Package for the Social Sciences.

Disclosure

Dr Mark P Jensen reports grants from Zogenix, Inc., outside the submitted work. All authors declare no other personal or professional conflicts of interest, and no other financial support from the companies that produce and/or distribute the drugs, devices, or materials described in this report.

References

1. St Sauver JL, Warner DO, Yawn BP, et al. Why patients visit their doctors: assessing the most prevalent conditions in a defined American population . Mayo Clin Proc . 2013; 88 ( 1 ):56–67. doi: 10.1016/j.mayocp.2012.08.020 [PMC free article] [PubMed] [CrossRef] [Google Scholar]

2. Mantyselka P, Kumpusalo E, Ahonen R, et al. Pain as a reason to visit the doctor: a study in Finnish primary health care . Pain . 2001; 89 ( 2–3 ):175–180. doi: 10.1016/S0304-3959(00)00361-4 [PubMed] [CrossRef] [Google Scholar]

3. Dahl JL. Effective pain management in terminal care . Clin Geriatr Med . 1996; 12 ( 2 ):279–300. doi: 10.1016/S0749-0690(18)30227-1 [PubMed] [CrossRef] [Google Scholar]

4. Dworkin RH, Turk DC, Farrar JT, et al. Core outcome measures for chronic pain clinical trials: IMMPACT recommendations . Pain . 2005; 113 ( 1–2 ):9–19. doi: 10.1016/j.pain.2004.09.012 [PubMed] [CrossRef] [Google Scholar]

5. Strong J, van Griensven H. Pain assessment and measurement. In: van Griensven H, Strong J, Unruh A, editors. Pain: A Textbook for Health Professionals . 2nd ed. London: Churchill Livingstone; 2014:91-113. [Google Scholar]

6. Ferreira-Valente MA, Pais-Ribeiro JL, Jensen MP. Validity of four pain intensity rating scales . Pain . 2011; 152 ( 10 ):2399–2404. doi: 10.1016/j.pain.2011.07.005 [PubMed] [CrossRef] [Google Scholar]

7. Li L, Liu X, Herr K. Postoperative pain intensity assessment: a comparison of four scales in Chinese adults . Pain Med . 2007; 8 ( 3 ):223–234. doi: 10.1111/j.1526-4637.2007.00296.x [PubMed] [CrossRef] [Google Scholar]

8. Pathak A, Sharma S, Jensen M. The utility and validity of pain intensity rating scales for use in developing countries . Pain Rep . 2018; 3 ( 5 ). doi: 10.1097/PR9.0000000000000672 [PMC free article] [PubMed] [CrossRef] [Google Scholar]

9. Hawker GA, Mian S, Kendzerska T, French M. Measures of adult pain: Visual Analog Scale for Pain (VAS Pain), Numeric Rating Scale for Pain (NRS Pain), McGill Pain Questionnaire (MPQ), Short-Form McGill Pain Questionnaire (SF-MPQ), Chronic Pain Grade Scale (CPGS), Short Form-36 Bodily Pain Scale (SF-36 BPS), and Measure of Intermittent and Constant Osteoarthritis Pain (ICOAP) . Arthritis Care Res (Hoboken) . 2011; 63 ( 11 ):240–252. doi: 10.1002/acr.20345 [PubMed] [CrossRef] [Google Scholar]

10. Sendlbeck M, Araujo EG, Schett G, Englbrecht M. Psychometric properties of three single-item pain scales in patients with rheumatoid arthritis seen during routine clinical care: a comparative perspective on construct validity, reproducibility and internal responsiveness . RMD Open . 2015; 1 ( 1 ):e000140. doi: 10.1136/rmdopen-2015-000140 [PMC free article] [PubMed] [CrossRef] [Google Scholar]

11. Scher C, Meador L, Van Cleave JH, Reid MC. Moving beyond pain as the fifth vital sign and patient satisfaction scores to improve pain care in the 21st century . Pain Manag Nurse . 2018; 19 ( 2 ):125–129. doi: 10.1016/j.pmn.2017.10.010 [PMC free article] [PubMed] [CrossRef] [Google Scholar]

12. Peters ML, Patijn J, Lamé I. Pain assessment in younger and older pain patients: psychometric properties and patient preference of five commonly used measures of pain intensity . Pain Med . 2007; 8 ( 7 ):601–610. doi: 10.1111/j.1526-4637.2007.00311.x [PubMed] [CrossRef] [Google Scholar]

13. Jensen MP, Karoly P, Braver S. The measurement of clinical pain intensity: a comparison of six methods . Pain . 1986; 27 ( 1 ):117–126. doi: 10.1016/0304-3959(86)90228-9 [PubMed] [CrossRef] [Google Scholar]

14. Farrar JT, Young JP Jr, LaMoreaux L, Werth JL, Poole RM. Clinical importance of changes in chronic pain intensity measured on an 11-point numerical pain rating scale . Pain . 2001; 94 ( 2 ):149–158. doi: 10.1016/S0304-3959(01)00349-9 [PubMed] [CrossRef] [Google Scholar]

15. Hjermstad MJ, Fayers PM, Haugen DF, et al. Studies comparing Numerical Rating Scales, Verbal Rating Scales, and Visual Analogue Scales for assessment of pain intensity in adults: a systematic literature review . J Pain Symptom Manage . 2011; 41 ( 6 ):1073–1093. doi: 10.1016/j.jpainsymman.2010.08.016 [PubMed] [CrossRef] [Google Scholar]

16. Dijkers M. Comparing quantification of pain severity by verbal rating and numeric rating scales . J Spinal Cord Med . 2010; 33 ( 3 ):232–242. doi: 10.1080/10790268.2010.11689700 [PMC free article] [PubMed] [CrossRef] [Google Scholar]

17. Jensen MP, Tomé-Pires C, Solé E, et al. Assessment of pain intensity in clinical trials: individual ratings vs composite scores . Pain Med . 2015; 16 ( 1 ):141–148. doi: 10.1111/pme.12588 [PubMed] [CrossRef] [Google Scholar]

18. Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures . Spine . 2000; 25 ( 24 ):3186–3191. doi: 10.1097/00007632-200012150-00014 [PubMed] [CrossRef] [Google Scholar]

19. Jensen MP, Chen C, Brugger AM. Interpretation of visual analog scale ratings and change scores: a reanalysis of two clinical trials of postoperative pain . J Pain . 2003; 4 ( 7 ):407–414. doi: 10.1016/S1526-5900(03)00716-8 [PubMed] [CrossRef] [Google Scholar]

20. Chaudakshetrin P. Validation of the Thai version of Brief Pain Inventory (BPI-T) in cancer patients . J Med Assoc Thai . 2009; 92 ( 1 ):34–40. [PubMed] [Google Scholar]

21. Caraceni A, Cherny N, Fainsinger R, Kaasa S, Poulain P, Radbruch LDCF. Pain measurement tools and methods in clinical research in palliative care: recommendations of an Expert Working Group of the European Association of Palliative Care . J Pain Symptom Manage . 2002; 23 ( 3 ):239–255. doi: 10.1016/S0885-3924(01)00409-2 [PubMed] [CrossRef] [Google Scholar]

22. Jensen MP, Turner JA, Romano JM. What is the maximum number of levels needed in pain intensity measurement? Pain . 1994; 58 ( 3 ):387–392. doi: 10.1016/0304-3959(94)90133-3 [PubMed] [CrossRef] [Google Scholar]

23. Chien CW, Bagraith KS, Khan A, Deen M, Strong J. Comparative responsiveness of verbal and numerical rating scales to measure pain intensity in patients with chronic pain . J Pain . 2013; 14 ( 12 ):1653–1662. doi: 10.1016/j.jpain.2013.08.006 [PubMed] [CrossRef] [Google Scholar]

24. Bieri D, Reeve RA, Champion GD, Addicoat L, Ziegler JB. The Faces Pain Scale for the self-assessment of the severity of pain experienced by children: development, initial validation, and preliminary investigation for ratio scale properties . Pain . 1990; 41 ( 2 ):139–150. doi: 10.1016/0304-3959(90)90018-9 [PubMed] [CrossRef] [Google Scholar]

25. Hicks CL, von Baeyer CL, Spafford PA, van Korlaar I, Goodenough B. The Faces Pain Scale-Revised: toward a common metric in pediatric pain measurement . Pain . 2001; 93 ( 2 ):173–183. doi: 10.1016/S0304-3959(01)00314-1 [PubMed] [CrossRef] [Google Scholar]

26. Jensen MP, Castarlenas E, Roy R, et al. The utility and construct validity of four measures of pain intensity: results from a university-based study in Spain . Pain Med . 2019; 20 ( 12 ):2411–2420. doi: 10.1093/pm/pny319 [PubMed] [CrossRef] [Google Scholar]

27. Ibrahim AA, Akindele MO, Bello B, Kaka B. Translation, cross-cultural adaptation, and psychometric properties of the Hausa versions of the Numerical Pain Rating Scale and Global Rating of Change Scale in a low-literate population with chronic low back pain . Spine . 2020; 45 ( 8 ):E439–E447. doi: 10.1097/BRS.0000000000003306 [PubMed] [CrossRef] [Google Scholar]

28. Gagliese L, Weizblit N, Ellis W, Chan VW. The measurement of postoperative pain: a comparison of intensity scales in younger and older surgical patients . Pain . 2005; 117 ( 3 ):412–420. doi: 10.1016/j.pain.2005.07.004 [PubMed] [CrossRef] [Google Scholar]

29. Herr KA, Spratt K, Mobily PR, Richardson G. Pain intensity assessment in older adults: use of experimental pain to compare psychometric properties and usability of selected pain scales with younger adults . Clin J Pain . 2004; 20 ( 4 ):207–219. doi: 10.1097/00002508-200407000-00002 [PubMed] [CrossRef] [Google Scholar]

30. OECD, UNESCO. Education in Thailand: An OECD-UNESCO Perspective . Paris: OECD Publishing; 2016. [Google Scholar]

31. Bergh I, Sjöström B, Odén A, Steen B. An application of pain rating scales in geriatric patients . Aging (Milan, Italy) . 2000; 12 ( 5 ):380–387. doi: 10.1007/BF03339864 [PubMed] [CrossRef] [Google Scholar]

32. Göransson KE, Heilborn U, Selberg J, von Scheele S, Djärv T. Pain rating in the ED-a comparison between 2 scales in a Swedish hospital . Am J Emerg Med . 2015; 33 ( 3 ):419–422. doi: 10.1016/j.ajem.2014.12.069 [PubMed] [CrossRef] [Google Scholar]

33. Zalmay P, Williams ACDC. How do medical students use and understand pain rating scales? Scand J Pain . 2017; 15 ( 1 ):68–72. doi: 10.1016/j.sjpain.2016.12.007 [PubMed] [CrossRef] [Google Scholar]

34. Herr KA, Mobily PR, Kohout FJ, Wagenaar D. Evaluation of the Faces Pain Scale for use with the elderly . Clin J Pain . 1998; 14 ( 1 ):29–38. doi: 10.1097/00002508-199803000-00005 [PubMed] [CrossRef] [Google Scholar]

35. Stuppy DJ. The Faces Pain Scale: reliability and validity with mature adults . Appl Nurs Res . 1998; 11 ( 2 ):84–89. doi: 10.1016/S0897-1897(98)80229-2 [PubMed] [CrossRef] [Google Scholar]

36. Taylor LJ, Herr K. Pain intensity assessment: a comparison of selected pain intensity scales for use in cognitively intact and cognitively impaired African American older adults . Pain Manag Nurs . 2003; 4 ( 2 ):87–95. doi: 10.1016/S1524-9042(02)54210-7 [PubMed] [CrossRef] [Google Scholar]

37. Yazici Sayin Y, Akyolcu N. Comparison of pain scale preferences and pain intensity according to pain scales among Turkish patients: a descriptive study . Pain Manag Nurs . 2014; 15 ( 1 ):156–164. doi: 10.1016/j.pmn.2012.08.005 [PubMed] [CrossRef] [Google Scholar]

38. Closs SJ, Barr B, Briggs M, Cash K, Seers K. A comparison of five pain assessment scales for nursing home residents with varying degrees of cognitive impairment . J Pain Symptom Manage . 2004; 27 ( 3 ):196–205. doi: 10.1016/j.jpainsymman.2003.12.010 [PubMed] [CrossRef] [Google Scholar]

39. Orhan C, Van Looveren E, Cagnie B, Mukhtar NB, Lenoir D, Meeus M. Are pain beliefs, cognitions, and behaviors influenced by race, ethnicity, and culture in patients with chronic musculoskeletal pain: a systematic review . Pain Physician . 2018; 21 ( 6 ):541–558. [PubMed] [Google Scholar]

40. Sharma S, Ferreira-Valente A, de CWAC, Abbott JH, Pais-Ribeiro J, Jensen MP. Group differences between countries and between languages in pain-related beliefs, coping, and catastrophizing in chronic pain: a systematic review . Pain Med . 2020; 21 ( 9 ):1847–1862. doi: 10.1093/pm/pnz373 [PMC free article] [PubMed] [CrossRef] [Google Scholar]

41. Thong ISK, Jensen MP, Miro J, Tan G. The validity of pain intensity measures: what do the NRS, VAS, VRS, and FPS-R measure? Scand J Pain . 2018; 18 ( 1 ):99–107. doi: 10.1515/sjpain-2018-0012 [PubMed] [CrossRef] [Google Scholar]

42. Miró J, Huguet A. Evaluation of reliability, validity, and preference for a pediatric pain intensity scale: the Catalan version of the faces pain scale–revised . Pain . 2004; 111 ( 1–2 ):59–64. doi: 10.1016/j.pain.2004.05.023 [PubMed] [CrossRef] [Google Scholar]

43. Majedi H, Mohammadi M, Tafakhori A, Khazaeipour Z. The influence of chronic pain on number sense and Numeric Rating Scale: a prospective cohort study . Anesth Pain Med . 2020; 10 ( 2 ):e103532. doi: 10.5812/aapm.103532 [PMC free article] [PubMed] [CrossRef] [Google Scholar]

44. Wolrich J, Poots AJ, Kuehler BM, Rice AS, Rahman A, Bantel C. Is number sense impaired in chronic pain patients? Br J Anaesth . 2014; 113 ( 6 ):1024–1031. doi: 10.1093/bja/aeu255 [PMC free article] [PubMed] [CrossRef] [Google Scholar]

45. Spindler M, Koch K, Borisov E, et al. The influence of chronic pain and cognitive function on spatial-numerical processing . Front Behav Neurosci . 2018; 12 :165. doi: 10.3389/fnbeh.2018.00165 [PMC free article] [PubMed] [CrossRef] [Google Scholar]