Evaluation of the effectiveness of bootstrap aggregating sampling technique in the accuracy of genomic best linear unbiased prediction method

Document Type : Research Paper

Authors

1 Ph.D. Candidate, Department of Animal Science, Khuzestan Agricultural Sciences and Natural Resources University, Iran

2 Associate Professor, Department of Animal Science, Khuzestan Agricultural Sciences and Natural Resources University, Iran

3 Professor, Department of Animal Science, Khuzestan Agricultural Sciences and Natural Resources University, Iran

4 Assistant Professor, Department of Animal and Poultry Science, Aburaihan Campus, University of Tehran, Iran

Abstract

In order to increase the accuracy of genomic best linear unbiased prediction method (GBLUP), bootstrap aggregating sampling (bagging) technique was applied. In this order a genome consisted of 10,000 bi-allelic single nucleotide polymorphism (SNP) over ten chromosomes, with 100 cM length each, was simulated. To generate linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTL), random mating was simulated for 100 generations between 100 individuals (50 males and 50 females). Then in generation 101 (reference population) number of individuals increased to 1000 or 2000 and their phenotypes were also simulated. Then the marker effects were estimated in this population using GBLUP method or combined this method with bagging technique (BGBLUP). By using these regression coefficients and according to the genotype markers for juvenile individuals in generations 102 to 105, called validation population which had no phenotype, genomic breeding values were predicted. According to the finding of this research, the accuracies of genomic breeding values of GBLUP method were higher than those for BGBLUP (p > 0.05) and about the first testing set (102 generation) and regardless of QTL effects with a population of 1000 (or 2000) observations in the reference set, the range of GBLUP accuracy was 0.339±0.049 (0.412±0.042) for a trait with 0.05 heritability to 0.728±0.015 (0.783±0.015) for a trait with 0.65 heritability, whereas the accuracy of BGBLUP method were varied between 0.338±0.047 (0.411±0.042) to 0.725±0.016 (0.780±0.015).

Keywords


  1. Abdollahi‐Arpanahi, R., Morota, G., Valente, B.D., Kranis, A., Rosa, G. J. M. & Gianola, D. (2015). Assessment of bagging GBLUP for whole‐genome prediction of broiler chicken traits. Journal of Animal Breeding and Genetics, 132, 218-228.
  2. Breiman, L. (1996). Bagging predictors. Machine learning, 24, 123-140.
  3. Efron, B. & Tibshirani, R. J. (1993). An introduction to the bootstrap, New York: Chapman & Hall.
  4. Gianola, D., Weigel, K. A., Krämer, N., Stella, A. & Schön, C. C. (2014). Enhancing genome-enabled prediction by bagging genomic BLUP. PLoS One, 9, e91693.
  5. Goddard, M. (2009). Genomic selection: prediction of accuracy and maximisation of long term response. Genetics, 136, 245-257.
  6. Goddard, M. E. & Hayes, B. J. (2007). Genomic selection. Journal of Animal Breeding and Genetics, 124, 323-330.
  7. Goddard, M. E. & Hayes, B. J. (2009). Mapping genes for complex traits in domestic animals and their use in breeding programmes. Nature Reviews Genetics, 10, 381-391.
  8. García-Ruiz, A., Cole, J. B., VanRaden, P. M., Wiggans, G. R., Ruiz-López, F. J. & Van Tassell, C. P. (2016). Changes in genetic selection differentials and generation intervals in US Holstein dairy cattle as a result of genomic selection. Proceedings of the National Academy of Sciences, 113, E3995-E4004.
  9. Habier, D., Fernando, R. & Dekkers, J. (2007). The impact of genetic relationship information on genome-assisted breeding values. Genetics, 177, 2389-2397.
  10. Habier, D., Tetens, J., Seefried, F.R., Lichtner, P. & Thaller, G. (2010). The impact of genetic relationship information on genomic breeding values in German Holstein cattle. Genetics Selection Evolution, 42, 5.
  11. Hayes, B. J., Bowman, P. J., Chamberlain, A. J. & Goddard, M. E. (2009). Invited review: Genomic selection in dairy cattle: Progress and challenges. Journal of Dairy Science, 92, 433-443.
  12. Henderson, C. R. (1975). Best linear unbiased estimation and prediction under a selection model. Biometrics, 423-447.
  13. Hutchison, J., Cole, J. & Bickhart, D. (2014). Use of young bulls in the United States. Journal of Dairy Science, 97, 3213-3220.
  14. Lee, S. H., van der Werf, J. H., Hayes, B. J., Goddard, M. E. & Visscher, P. M. (2008). Predicting unobserved phenotypes for complex traits from whole-genome SNP data. PLoS Genetics, 4, e1000231.
  15. Lund, M. S., Sahana, G., de Koning, D. J., Su, G. & Carlborg, Ö. (2009). Comparison of analyses of the QTLMAS XII common dataset. I: Genomic selection. BMC proceedings, BioMed Central.
  16. Meuwissen, T., Hayes, B. & Goddard, M. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics, 157, 1819-1829.
  17. Mikshowsky, A. A., Gianola, G. & Weigel, K. A. (2016). Improving reliability of genomic predictions for Jersey sires using bootstrap aggregation sampling. Journal of Dairy Science, 99, 3632-3645.
  18. Mikshowsky, A. A., Gianola, G. & Weigel, K. A. (2017). Assessing genomic prediction accuracy for Holstein sires using bootstrap aggregation sampling and leave-one-out cross validation. Journal of Dairy Science, 100, 453-464.
  19. Schaeffer, L. (2006). Strategy for applying genome‐wide selection in dairy cattle. Journal of Animal Breeding and Genetics, 123, 218-223.
  20. Technow, F. (2013). R Package hypred: Simulation of genomic data in applied Genetics. University of Hohenheim.
  21. VanRaden, P., Van Tassell, V., Wiggans, G., Sonstegard, T., Schnabel, R., Taylor, J. & Schenkel, F. (2009). Invited review: Reliability of genomic predictions for North American Holstein bulls. Journal of Dairy Science, 92, 16-24.