1 INTRODUCTION
The change produced by selection that chiefly interests us is the change of the population mean. This is called the response to selection and is the difference in mean phenotypic value between the offspring of the selected parents and the whole of the parental generation before selection (Falconer, 1981). However, if the environment differs between generations, it is necessary to correct for this effect. Therefore, in such cases, the breeding value (BV) estimated by the BLUP method is generally used to predict the response to selection.
Prediction of response to selection is important in choosing breeding programs and for estimating the expected economic benefits (Dekkers,
1992; Verrier et al.,
1991). The classical formula for predicting response to truncated phenotypic selection (Falconer,
1981) is
where
is the intensity of selection,
is the additive genetic standard deviation in the candidate population for selection, and
is the correlation coefficient between BVs and phenotypic values in the candidate population for selection (accuracy of selection). If the selection is based on BLUP (
) of the breeding values instead of the phenotypic value, then
is
.
Theoretical methods have been proposed to approximate the responses to selection based on selection index and BLUP methods (e.g., Dekkers, 1992; Wray & Hill, 1989). These methods predict the response to selection by giving population size, selection rate, genetic parameters, and so on. On the other hand, when actual pedigree information and phenotypic records are available, the predicted response to selection based on the BLUP method is the deviation of the mean of estimated breeding values (EBVs) in the selected animals from the mean of EBVs in the candidate population for selection. There are also several possible methods of predicting response to selection based on the above-quoted Falconer's equation.
In this study, the prediction accuracies of three different methods for predicting responses to truncated selection based on the BLUP method were compared using data generated by Monte Carlo computer simulation. The estimation accuracies of fixed effects were also examined.
2 METHODS
Consider the following equations for a mixed model:
where
y,
b,
g, and
e are vectors of phenotypic records, fixed effects, breeding values, and random errors;
X and
Z are the incidence matrices that relate
y to
b and
y to
g;
is the random environmental variance; and
A and
I are the additive relationship and incidence matrices, respectively. Let
and
be vectors of estimated values of
b and predicted values of
g. These are obtained by solving the following mixed model equation (MME) (Henderson,
1973).
(1)
The inverse matrix on the left-hand side of Equation (
1) is
(2)
From Henderson (
1975),
(3) and
(4)
2.1 Root-mean-square error (RMSE) of fixed effect
2.1.1 RMSE calculated from estimates of fixed effect
Let t,
, and
be the number of levels of the fixed effect, the true value at the
jth level, and its estimated value in the solution of Equation (
1). The root-mean-square error (
) at each level of the fixed effect is
2.1.2 RMSE calculated from the inverse matrix of the left-hand side of MME
Let
be the
jth diagonal element of
in Equations (
2) and (
3). The root-mean-square-error (
) at each level of the fixed effect is
2.2 Prediction of response to selection
Let us assume that the candidate population for selection consists of an equal number of males and females (N each) and the total average BV in the population is
. Let
and
be the average BV of
males and
females selected from the population. The response to selection (
) is
This was assumed to be the true response to selection.
2.2.1 Prediction of response to selection from the mean of EBV
Let
be the average EBV in a candidate population for selection and
and
, respectively, be the average EBV of selected males and females. The response to selection (
) is
2.2.2 Prediction of response to selection from the variance of EBV
The variance of EBV in a candidate population for selection (
) is estimated from
where
is the EBV in the
ith animal. From Equation (
4),
is equal to covariance (
) between BV and EBV. Therefore, the response to selection (
) is
where
is the average inbreeding coefficient in the population.
2.2.3 Prediction of response to selection from the inverse of the left-hand side of MME
Let
be the
ith diagonal element of
,
where
,
, and
are BV, EBV, and the inbreeding coefficient of the
ith animal.
and
in a candidate population for selection from Equations (
2) and (
4)
where
is the mean of the diagonal elements of
corresponding to each animal in the population. Therefore, the response to selection (
) is
2.3 Comparison of response to selection using computer simulation
To compare the response to selection using different prediction methods (
,
,
), a Monte Carlo computer simulation was used to generate pedigree and phenotypic records. The base population (G0) consisted of
males and
females that were all assumed to be unrelated, unselected, and non-inbred. Single trait of six separate generations (G0 to G5) were simulated. One male was mated at random to five females, a mated female produced two males and two females, and
males and
females were randomly selected in each generation. An infinitesimal additive genetic model (Bulmer,
1980) was assumed. The additive genetic value (
) of the
ith animal was generated using the following equation:
where
and
are the additive genetic values for the sire and dam of the
ith animal, respectively;
and
are the inbreeding coefficients of the sire and dam, respectively; and
is the standard normal deviate of the
ith animal.
The phenotypic record (
) of the
ith animal was simulated using the following equation:
where
is the random environmental effect for the
ith animal with a mean of zero and variance of
.
The phenotypic variance of a trait with heritability of 0.2 or 0.5 was set at 100. Fixed effect was assumed to be mean (one level), sex (two levels), or generation (five levels), and the magnitude of every level was set at zero. The population sizes () were assumed to be , , , , or . Thus, the selection intensities in each population are 1.2640, 1.2700, 1.2765, 1.2765, and 1.2765 (Falconer, 1981). Animals in G0 had no records and G1 through G5 had one record for all animals. One thousand replicates were simulated for each combination () of heritability, fixed effect, and population size.
and
were obtained based on the data generated using the simulation. When
was taken as the true value, the mean errors and RMSEs of
to
were calculated as follows:
where
is
at
kth iteration and
rpt (= 1000) is the number of iterations. The mean errors and RMSEs were used as criteria of the prediction accuracy in the response to selection.
3 RESULTS
Table 1 shows the mean RMSE and its standard error (SE) of fixed effect over 1000 replicates. No obvious differences or trends were found between and . The SE for was smaller than that for . The RMSE tended to rise with increasing number of levels of fixed effect. The larger the population size, the smaller the RMSE. The RMSE was smaller for a trait with low heritability than with high heritability.
TABLE 1.
Mean RMSE and its standard error (SE) of the fixed effect over 1000 replicates when one of the three kinds of fixed effects is included in the mixed model.
Population size |
Fixed effect |
h2 = 0.2 |
h2 = 0.5 |
RMSEFxd1 |
RMSEFxd2 |
RMSEFxd1 |
RMSEFxd2 |
10 sires: 50 dams |
Mean |
0.917 (1.26E-03) |
0.892 (2.96E-06) |
2.172 (3.28E-03) |
1.817 (7.67E-07) |
Sex |
0.945 (1.19E-03) |
0.982 (2.88E-06) |
1.986 (2.58E-03) |
1.891 (7.33E-07) |
Genertion |
1.787 (1.96E-03) |
1.763 (1.99E-05) |
3.369 (3.94E-03) |
3.093 (1.37E-05) |
20 sires: 100 dams |
Mean |
0.463 (6.81E-04) |
0.446 (1.07E-06) |
0.935 (1.43E-03) |
0.908 (2.86E-07) |
Sex |
0.488 (6.17E-04) |
0.491 (1.03E-06) |
0.896 (1.14E-03) |
0.946 (2.72E-07) |
Generation |
0.881 (9.38E-04) |
0.884 (7.09E-06) |
1.596 (1.83E-03) |
1.549 (5.10E-06) |
30 sires: 150 dams |
Mean |
0.288 (4.41E-04) |
0.297 (5.71E-07) |
0.608 (8.04E-04) |
0.606 (1.54E-07) |
Sex |
0.345 (4.56E-04) |
0.327 (5.71E-07) |
0.654 (9.28E-04) |
0.631 (1.50E-07) |
Generation |
0.613 (6.16E-04) |
0.590 (3.91E-06) |
1.039 (1.16E-03) |
1.033 (2.76E-06) |
40 sires: 200 dams |
Mean |
0.222 (3.14E-04) |
0.223 (3.55E-07) |
0.440 (6.52E-04) |
0.454 (9.76E-08) |
Sex |
0.239 (3.12E-04) |
0.246 (3.82E-07) |
0.443 (6.18E-04) |
0.473 (9.86E-08) |
Generation |
0.404 (4.13E-04) |
0.443 (2.63E-06) |
0.757 (9.12E-04) |
0.775 (1.82E-06) |
50 sires: 250 dams |
Mean |
0.180 (2.41E-04) |
0.179 (2.67E-07) |
0.390 (5.27E-04) |
0.363 (7.13E-08) |
Sex |
0.196 (2.55E-04) |
0.196 (2.81E-07) |
0.390 (5.25E-04) |
0.378 (7.14E-08) |
Generation |
0.351 (3.62E-04) |
0.354 (1.88E-06) |
0.655 (7.61E-04) |
0.620 (1.33E-06) |
-
Note: Parentheses denote SE of RMSE.
-
Abbreviations: RMSEFxd1, RMSE calculated from solutions of MME; RMSEFxd2, RMSE calculated from the inverse of the left-hand side of MME.
Table 2 shows the mean of response to selection and its SE over 1000 replicates for each prediction method. Since no difference among three fixed effects was observed, the results are presented here with the fixed effect as mean only. The SE was, in increasing order , followed by , and was largest for . The response to selection increased as the population size increased.
TABLE 2.
Mean true response to selection (Δg0) and predicted responses to selection (Δg1-Δg3) and their standard errors (SE) over 1000 replicates under conditions of different heritabilities and population sizes.
Heritability |
Population size |
Δg0 |
Δg1 |
Δg2 |
Δg3 |
Mean |
(se) |
Mean |
(se) |
Mean |
(se) |
Mean |
(se) |
0.2 |
10 sires: 50 dams |
3.265 |
(8.9E-04) |
3.249 |
(5.4E-04) |
3.088 |
(4.9E-04) |
3.474 |
(1.9E-05) |
20 sires: 100 dams |
3.434 |
(6.7E-04) |
3.444 |
(4.3E-04) |
3.355 |
(3.8E-04) |
3.529 |
(1.0E-05) |
30 sires: 150 dams |
3.473 |
(5.3E-04) |
3.452 |
(3.4E-04) |
3.410 |
(3.1E-04) |
3.558 |
(6.8E-06) |
40 sires: 200 dams |
3.497 |
(4.5E-04) |
3.485 |
(3.0E-04) |
3.451 |
(2.8E-04) |
3.564 |
(5.0E-06) |
50 sires: 250 dams |
3.510 |
(3.8E-04) |
3.510 |
(2.7E-04) |
3.483 |
(2.5E-04) |
3.567 |
(4.1E-06) |
0.5 |
10 sires: 50 dams |
6.538 |
(1.1E-03) |
6.562 |
(8.7E-04) |
6.223 |
(7.5E-04) |
6.805 |
(1.6E-05) |
20 sires: 100 dams |
6.861 |
(8.4E-04) |
6.857 |
(6.7E-04) |
6.674 |
(6.0E-04) |
6.953 |
(8.4E-06) |
30 sires: 150 dams |
6.938 |
(6.9E-04) |
6.951 |
(5.4E-04) |
6.848 |
(4.9E-04) |
7.028 |
(5.6E-06) |
40 sires: 200 dams |
6.982 |
(6.2E-04) |
6.972 |
(4.9E-04) |
6.899 |
(4.3E-04) |
7.047 |
(4.4E-06) |
50 sires: 250 dams |
6.996 |
(5.4E-04) |
6.995 |
(4.3E-04) |
6.934 |
(3.9E-04) |
7.059 |
(3.6E-06) |
-
Note: Parentheses denote SE of RMSE.
-
Abbreviations: Δg0, true response to selection; Δg1, predicted response to selection calculated from the mean of EBV; Δg2, predicted response calculated from the variance of EBV; Δg3, predicted response calculated from the inverse of the left-hand side of MME.
Table 3 shows the average mean error and RMSE over 1000 replicates. The absolute value of the mean error for was the smallest and the absolute values for and were, respectively, all negative and all positive. The absolute values of the mean errors tended to become larger as the population size decreased. The RMSE was smallest for , followed by and . For all prediction methods, the RMSE tended to become larger as the population size decreased. In increasing order, RMSE was smallest for , followed by and .
TABLE 3.
Average mean error and RMSE over 1000 replicates of deviations of predicted responses to selection (Δg1 to Δg3) from true response to selection (Δg0).
Heritability |
Population size |
Mean error |
RMSE |
Δg1 |
Δg2 |
Δg3 |
Δg1 |
Δg2 |
Δg3 |
0.2 |
10 sires: 50 dams |
−0.015 |
−0.176 |
0.209 |
0.338 |
0.268 |
0.494 |
20 sires: 100 dams |
0.011 |
−0.078 |
0.095 |
0.306 |
0.261 |
0.401 |
30 sires: 150 dams |
−0.020 |
−0.063 |
0.085 |
0.247 |
0.226 |
0.357 |
40 sires: 200 dams |
−0.011 |
−0.046 |
0.067 |
0.233 |
0.215 |
0.323 |
50 sires: 250 dams |
0.013 |
−0.090 |
0.090 |
0.281 |
0.224 |
0.387 |
0.5 |
10 sires: 50 dams |
0.024 |
−0.316 |
0.267 |
0.361 |
0.232 |
0.566 |
20 sires: 100 dams |
−0.004 |
−0.187 |
0.092 |
0.291 |
0.219 |
0.431 |
30 sires: 150 dams |
−0.001 |
−0.028 |
0.057 |
0.220 |
0.205 |
0.295 |
40 sires: 200 dams |
−0.010 |
−0.083 |
0.065 |
0.246 |
0.219 |
0.364 |
50 sires: 250 dams |
−0.001 |
−0.062 |
0.063 |
0.238 |
0.209 |
0.345 |
-
Abbreviations: Δg0, true response to selection; Δg1, predicted response to selection from the mean of EBV; Δg2, predicted response to selection from the variance of EBV; Δg3, predicted response to selection from the inverse of the left-hand side of MME.
4 DISCUSSION
One method of estimating genetic trends is to use the BLUP of BVs (Henderson, 1973), which is a development of the method for separating genetic effects from environmental effects (Henderson et al., 1959). Estimation of genetic trends is important to provide breeders with information that will enable them to develop more efficient selection programs in the future (Béjar et al., 1993). This method is widely used from laboratory animals to large livestock (e.g., Abdallah & McDaniel, 2000; Saad et al., 2020; Satoh et al., 1997; Tomiyama et al., 2011). When estimating responses to selection using this method, animals for which phenotypic values have been obtained are targeted. On the other hand, it is possible to predict the response to selection even if the phenotypic values of animals are not available. In this study, when the phenotypic values for progenies of selected animals are not available, three methods of predicting the response to selection were compared in terms of the accuracy of estimating fixed effects and the accuracy of predicting responses to truncated selection based on EBV.
In the estimation of fixed effect, there was no obvious difference between and , and the SE was smaller for than for . However, since the SE was very small compared with the RMSE, it was considered to have no obvious impact on the accuracy of the estimation of the fixed effect.
The absolute value of the mean error was the smallest for and the RMSE was the smallest for . Since predicts the response to selection using the diagonal elements of the inverse of the left-hand side of MME, it requires more computing time than and . In addition, the mean error was not the smallest and the RMSE was the largest for . Since and predict the response to selection based on the accuracy of selection, it is likely that underestimates and overestimates the variance of the EBV. In particular, the low prediction accuracy of may be due to the use of the diagonal elements of the inverse matrix of the mixed model equations. Essentially, the diagonal elements are the prediction error variances of BVs in each individual. The use of these averages in this study may be one of the reasons for the low prediction accuracy. We therefore consider it desirable to use or in a randomly selected population if the phenotypic records for progenies of selected animals are not available. It should be noted, however, that was slightly biased downward from its true value. Although the RMSE increased with decreasing population size, the response to selection based on EBV can be predicted, even in small populations.
The formula for replaces the BV in the formula for the true value () with the estimated value EBV and can be used even when generations overlap or when the distribution is not normal. For and , if the BVs or EBVs are not normally distributed, the prediction accuracy of selection response may decrease, and it may be difficult to calculate the selection intensity in a population with overlapping generations. Thus, in the population where selection is ongoing, the prediction accuracy of selection response is likely to be affected by the distortion of the distribution and the Bulmer effect (Bijma, 2012; Gorjanc et al., 2015). The is, therefore, considered to be the most practical for such a population.
ACKNOWLEDGMENTS
This work was supported by JSPS KAKENHI Grant Numbers JP22H02491.
CONFLICT OF INTEREST STATEMENT
The author declares that he has no conflicts of interest.
REFERENCES
- Abdallah, J. M., & McDaniel, B. T. (2000). Genetic parameters and trends of milk, fat, days open, and body weight after calving in North Carolina experimental herds. Journal of Dairy Science, 83, 1364–1370. https://doi.org/10.3168/jds.S0022-0302(00)75004-1
- Béjar, F., Rodríguez, M. C., & Toro, M. A. (1993). Estimation of genetic trends for weaning weight and teat number in Iberian pigs using mixed model methodology. Livestock Production Science, 33, 239–251. https://doi.org/10.1016/0301-6226(93)90005-3
- Bijma, P. (2012). Accuracies of estimated breeding values from ordinary genetic evaluations do not reflect the correlation between true and estimated breeding values in selected populations. Journal of Animal Breeding and Genetics, 129, 345–358. https://doi.org/10.1111/j.1439-0388.2012.00991.x
- Bulmer, M. G. (1980). The mathematical theory of quantitative genetics. Clarendon Press.
- Dekkers, J. C. M. (1992). Asymptotic response to selection on best linear unbiased predictors of breeding values. Animal Production, 54, 351–360.
- Falconer, D. S. (1981). Introduction to quantitative genetics ( 2nd ed.). Longman.
- Gorjanc, G., Bijma, P., & Hickey, J. M. (2015). Reliability of pedigree-based and genomic evaluations in selected populations. Genetics, Selection, Evolution, 47, 65. https://doi.org/10.1186/s12711-015-0145-1
- Henderson, C. R. (1973). Sire evaluation and genetic trends. In Proceedings of the Animal Breeding and Genetics Symposium in Honor of Dr. J.L. Lush. A.S.A.S. and A.D.S.A. 1041.
- Henderson, C. R. (1975). Best linear unbiased estimation and prediction under a selection model. Biometrics, 31, 423–447. https://doi.org/10.2307/2529430
- Henderson, C. R., Kempthorne, O., Searle, S. R., & von Krosigk, C. M. (1959). The estimation of environmental and genetic trends from records subject to culling. Biometrics, 13, 192–218. https://doi.org/10.2307/2527669
- Saad, H. M., Thomas, M. G., Speidel, S. E., Peel, R. K., Frasier, W. M., & Enns, R. M. (2020). Differential response from selection for high calving ease vs. low birth weight in American Simmental beef cattle. Journal of Animal Science, 98, skaa162. https://doi.org/10.1093/jas/skaa162
- Satoh, M., Nishida, A., van Arendonk, J. A. M., & van der Lende, T. (1997). Benefit of multiple trait selection to increase reproductive traits: Experimental evidence from golden hamsters. Journal of Animal Science, 75, 3103–3113. https://doi.org/10.2527/1997.75123103x
- Tomiyama, M., Kubo, S., Takagi, T., & Suzuki, K. (2011). Evaluation of genetic trends and determination of the optimal number of cumulative records of parity required in reproductive traits in a large white pig population. Animal Science Journal, 82, 621–626. https://doi.org/10.1111/j.1740-0929.2011.00889.x
- Verrier, E., Colleau, J. J., & Foulley, J. L. (1991). Methods for predicting response to selection in small populations under additive genetic models: A review. Livestock Production Science, 29, 93–114. https://doi.org/10.1016/0301-6226(91)90060-4
- Wray, N. R., & Hill, W. G. (1989). Asymptotic rates of response from index selection. Animal Production, 49, 217–227.