Estimating Quality Weights for EQ-5D Health States with the Time Trade-Off Method in South Korea
ABSTRACT
Objective: To estimate quality weights of EQ-5D health states with the time trade-off (TTO) method in the general population of South Korea.
Methods: A total of 500 respondents valued 42 hypothetical EQ-5D health states using the TTO and visual analog scale. The quality weights for all EQ-5D health states were estimated by a random effects model and compared with those from studies in other countries.
Results: Overall estimated quality weights for all EQ-5D health states from this study were highly correlated with those from previous studies, but quality weights of individual states were substantially different from those of their corresponding states in other studies.
Conclusions: The Korean value set differed from value sets from other countries. Special caution is needed when a value set from one country is applied to another with a different culture.
Introduction
Interest in economic evaluation has increased along with the rapid growth of costs of public health. Among the several approaches used for economic evaluation, cost-utility analysis is useful for making comparisons across a broad set of interventions because it uses quality adjusted life years (QALYs). Calculation of QALYs requires quality weights for each health state.
The EQ-5D is a good preference measure that uses the health state classification system consisting of five dimensions (Mobility, Self-Care, Usual Activities, Pain/Discomfort, and Anxiety/Depression) with three levels of “description,” corresponding to “no problems” (level 1), “some problems” (level 2), and “extreme problems” (level 3), thus generating 243 possible health states [1]. The quality weight for each EQ-5D state can be found from a value set.
Although several studies in South Korea have used QALYs, they have used quality weights from other countries. We found, however, that there are differences between Korea and other countries in observed quality weights for some EQ-5D health states [2]. Many researchers have suggested that each country should develop its own social value set [3–5]. We therefore sought to develop a Korean valuation set for EQ-5D health states using time trade-off (TTO) and a statistical model based on quality weights for some EQ-5D health states in the general population of South Korea. We also compared the estimated values with those in other countries.
Methods
Individuals aged 20 years and over living in two adjacent regions, Seoul and Gyeonggi-do, were eligible for this study. A multistage proportionate quota sample was chosen with a consideration of age and sex distribution of the population. We selected 51 small areas and chose homes in each area with a systematic sampling. We finally interviewed individuals by visiting their homes. The survey was performed from March 24, 2005 to April 21, 2005 by 19 interviewers trained in valuing methods such as visual analog scale (VAS) and TTO.
The valuing processes were similar to those of UK Measurement and Valuation of Health (MVH) study [6]. At first, all respondents evaluated their own health states using EQ-5D. Then, they valued 15 hypothetical EQ-5D states which consisted of 11 EQ-5D health states randomly selected from a subset of 41 states by severity and “11111,”“33333,”“unconscious,” and “dead” states given to all respondents using VAS from 0 to 100. Lastly, 13 hypothetical health states except for “11111” and “dead” states were valued with TTO after dividing those states into “better than death” states and “worse than death” states. For “better than death” states, interviewers tried to find a respondent's point of indifference between a length of time (t) in the perfect health state (11111) and 10 years in the target health state. For “worse than death” states, respondents were asked to choose between dying immediately and living a length of time (10-t) in the target health state followed by t years in the perfect health state. We used health state description cards with visual aids to facilitate a valuation task. After their valuation process, respondents were given an opportunity for revision.
Because the meanings of t in “better than death” and “worse than death” states are different, adjustment is needed. Quality weights for states “better than death” were adjusted using the formula t/10. Quality weights for states “worse than death” were calculated using the formula –t/10, so that all values lie in the range [−1, 1]. The data of respondents that had four or more logical inconsistencies were excluded from modeling. The logical inconsistency was defined as the case of the value being larger than that of a health state that is dominant to that state, according to logical ordering of the health state [3].
The initial model included 5 dimensions of EQ-5D with 10 dummy variables indicating the presence of a level 2 or 3 in each dimension. The additional variables considered as independent variables for model development were N3, D1, I2, I2s, I3, and I3s [6,7]. “N3” represents “whether any of the dimensions is at level 3”[6] and “D1” means “the number of movements away from perfect health”[5]. “I*” and “I*s” represent the number of dimensions at level * beyond the first and the square of I*[5]. During model development, we considered four kinds of dependent variables as alternatives: 1 minus the quality weight for a given health state (q1), TTO value of “33333” state minus a given health state (q2), the logarithm of q1 and q2.
For determining the final model, we compared those models using residual diagnostic plots, Wald-type test and mean absolute error (MAE: the average of the absolute differences between observed and predicted values). The residual plots showed that the variability of standardized residuals decreased as predicted values increased. To overcome this heteroscedasticity, we considered a log linear model for the variance as well as a random effects model for mean response [7]. MAE was considered as an important model selection indicator because the objective of this modeling was to predict the quality weights of EQ-5D health states. In conditions with similar MAE values, the simplest model was selected because of parsimony.
To compare estimated quality weights for EQ-5D health states between this study and other foreign studies, the rank correlation coefficients and mean absolute differences (MAD) between estimated quality weights from the final model and those from the UK [6], USA [5], and Japan [4] were calculated.
Results
Respondents composed of 251 men and 249 women. The mean age was 41.3. The proportion of the subjects with elementary school or below, middle and high school, and university or above was 3.8%, 64.6%, and 31.6%, respectively. The sociodemographic characteristics of respondents are detailed elsewhere [2]. Most of the respondents did not differ from the Korean general population. Observed TTO values for the 42 EQ-5D health states and the “unconscious” state are shown in Table 1.
EQ-5D | N | Mean Observed value | SD | Mean Predicted value | EQ-5D | N | Mean Observed value | SD | Mean Predicted value |
---|---|---|---|---|---|---|---|---|---|
11112 | 297 | 0.922 | 0.061 | 0.906 | 12222 | 203 | 0.665 | 0.122 | 0.700 |
11121 | 273 | 0.910 | 0.061 | 0.906 | 21323 | 94 | 0.545 | 0.259 | 0.564 |
11211 | 158 | 0.906 | 0.067 | 0.900 | 32211 | 91 | 0.598 | 0.180 | 0.722 |
12111 | 162 | 0.908 | 0.074 | 0.890 | 12223 | 115 | 0.581 | 0.140 | 0.599 |
21111 | 110 | 0.902 | 0.077 | 0.910 | 22331 | 82 | 0.478 | 0.354 | 0.547 |
11122 | 217 | 0.812 | 0.089 | 0.871 | 21232 | 99 | 0.584 | 0.225 | 0.710 |
11131 | 156 | 0.778 | 0.106 | 0.888 | 32313 | 79 | 0.329 | 0.497 | 0.323 |
11113 | 196 | 0.800 | 0.091 | 0.875 | 22222 | 138 | 0.587 | 0.245 | 0.608 |
21133 | 97 | 0.701 | 0.230 | 0.733 | 33232 | 134 | 0.203 | 0.500 | 0.124 |
21222 | 103 | 0.759 | 0.113 | 0.755 | 23232 | 175 | 0.340 | 0.409 | 0.342 |
21312 | 104 | 0.680 | 0.216 | 0.763 | 23321 | 140 | 0.295 | 0.443 | 0.459 |
12211 | 134 | 0.797 | 0.111 | 0.840 | 13332 | 137 | 0.226 | 0.504 | 0.328 |
11133 | 83 | 0.731 | 0.132 | 0.796 | 22233 | 174 | 0.358 | 0.425 | 0.379 |
22121 | 113 | 0.781 | 0.105 | 0.803 | 22323 | 146 | 0.252 | 0.488 | 0.302 |
12121 | 124 | 0.798 | 0.110 | 0.849 | 32223 | 139 | 0.135 | 0.544 | 0.302 |
22112 | 96 | 0.751 | 0.133 | 0.804 | 32232 | 117 | 0.100 | 0.535 | 0.381 |
11312 | 77 | 0.705 | 0.268 | 0.818 | 33321 | 71 | 0.118 | 0.484 | 0.279 |
13212 | 137 | 0.625 | 0.166 | 0.691 | 33323 | 70 | −0.161 | 0.578 | −0.318 |
32331 | 112 | 0.394 | 0.438 | 0.396 | 23313 | 87 | 0.107 | 0.520 | 0.280 |
13311 | 164 | 0.576 | 0.264 | 0.699 | 33212 | 110 | 0.263 | 0.480 | 0.462 |
22122 | 186 | 0.651 | 0.164 | 0.730 | 33333 | 500 | −0.708 | 0.399 | −0.562 |
The number of respondents who had no logical inconsistency was 333 (66.6%) of 500 respondents. The number of respondents who violated over 3 inconsistencies was 12 (2.4%). Therefore, 488 respondent values were included in modeling data set.
Most of the five dimension variables were statistically significant in all alternative models. Predictive performance was improved by logarithmic transformation of q1 model. The q1 log model with basic variables was selected as the final model because all variables were statistically significant, MAE was relatively lower (0.074), and it was simple (Table 2).
Model | y | 1-S | 1-S | 1-S | LN(1-S)* | LN(1-S) | |||||
---|---|---|---|---|---|---|---|---|---|---|---|
xi | 5D | 5D+N3 | 5D+D1 | 5D | 5D+N3 | ||||||
β | p | β | p | β | p | β | p | β | p | ||
C | 0.021 | <0.001 | 0.019 | 0.1095 | — | — | −2.68 | <0.001 | −2.778 | <0.001 | |
Mo2 | 0.060 | <0.001 | 0.056 | <0.001 | 0.001 | 0.996 | 0.267 | <0.001 | 0.242 | <0.001 | |
Mo3 | 0.310 | <0.001 | 0.404 | <0.001 | 0.349 | <0.001 | 0.554 | <0.001 | 0.536 | <0.001 | |
Sc2 | 0.088 | <0.001 | 0.081 | <0.001 | 0.063 | <0.001 | 0.471 | <0.001 | 0.499 | <0.001 | |
Sc3 | 0.229 | <0.001 | 0.399 | <0.001 | 0.334 | <0.001 | 0.819 | <0.001 | 0.800 | <0.001 | |
UA2 | 0.072 | <0.001 | 0.084 | <0.001 | −0.033 | 0.029 | 0.374 | <0.001 | 0.244 | <0.001 | |
UA3 | 0.177 | <0.001 | 0.301 | <0.001 | 0.152 | <0.001 | 0.662 | <0.001 | 0.514 | <0.001 | |
PD2 | 0.077 | <0.001 | 0.076 | <0.001 | 0.071 | <0.001 | 0.318 | <0.001 | 0.369 | <0.001 | |
PD3 | 0.162 | <0.001 | 0.297 | <0.001 | 0.200 | <0.001 | 0.488 | <0.001 | 0.450 | <0.001 | |
AD2 | 0.059 | <0.001 | 0.077 | <0.001 | 0.045 | <0.001 | 0.313 | <0.001 | 0.338 | <0.001 | |
AD3 | 0.159 | <0.001 | 0.391 | <0.001 | 0.270 | <0.001 | 0.603 | <0.001 | 0.537 | <0.001 | |
N3 | −0.242 | <0.001 | 0.313 | <0.001 | |||||||
D1 | 0.042 | <0.001 | |||||||||
MAE | 0.094 | 0.082 | 0.094 | 0.074 | 0.071 |
- Quality weight calculation from the basic log model.
- Quality weight for 12123 state (q).
- Constant term: −2.68.
- Mobility: level 1—0, Self-care: level 2—0.471, Usual activity: level 1—0.
- Pain/discomfort: level 2—0.318, Anxiety/depression: level 3—0.603.
- LN(1 − q) = −2.68 + 0 + 0.471 + 0 + 0.318 + 0.603 = −1.288, (1 − q) = exp(−1.288) = 0.276.
- Thus the quality weight of 12123 state (q) is 0.774.
- * The basic log model.
The rank correlation coefficient of estimated values between this study and foreign studies showed a strong positive correlation (the UK: 0.759, the USA: 0.747, Japan: 0.721). However, the magnitude of MAD was substantial, 0.455, 0.237, and 0.198, respectively (Fig. 1).

Comparison of quality weight estimates for 242 EQ-5D health states with those in other studies.
Discussion
TTO values for 42 EQ-5D health states were elicited from 500 respondents in the general population of South Korea. The q1 log model was chosen as the final model for estimating quality weight for all EQ-5D health states. Although most models including five-dimension variables were significant, the q1 log model that included only basic variables, a random effects model, was selected as the prediction model because of goodness of fit using MAE. The magnitude of MAE (0.074) was higher than that of studies in the UK, the USA, and Japan. This may be because of, in part, the different patterns of quality weights for EQ-5D health states among countries. Most EQ-5D health states in this study had higher quality weights than those in other countries. The quality weight of “33333” state in this study was lower than in other studies. These differences may have contributed to the relatively high MAE in this study.
All regression coefficients in the final model were statistically significant by the Wald-type test. At level 3, the dimension receiving the highest coefficient was “self-care” followed by “usual activity,”“anxiety/depression,”“mobility,” and “pain/discomfort.” The order of these coefficients also differed from those in the UK, the USA, and Japan [4–6].
Although the overall value set showed a similar pattern among countries, estimated quality weights of EQ-5D health states in the general population of South Korea differed substantially from those in the UK, USA, and Japan. MAD between the Korean and Japanese value sets was lower than that between the Korean and UK and USA sets. This may be because of, in part, the magnitude of cultural similarities among countries. Even the lowest MAD, 0.198, may change the results or conclusions of studies on burden of disease, economic evaluation, and clinical trials when considering that the maximum possible MAD is 2. These MADs provide a rationale for the development of a Korean EQ-5D value set. TTO values may differ from country to country more than VAS values, as shown among European countries [3], between Japan and the UK [4], and between the UK and Spain [8].
Conclusion
This study successfully determined quality weights for all EQ-5D health states using TTO in a general population of South Korea. Substantial differences in the EQ-5D value set among countries were found. Special caution is needed when a value set from one country is applied to another with a different culture.