Science Cast

INCREASING PHENOMIC PREDICTION EFFICIENCY USING A PRINCIPAL COMPONENT ANALYSIS BASED PRE-PROCESSING OF NEAR INFRARED SPECTRA

librarianMay 13, 2026 9:41pm

Views (86)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

INCREASING PHENOMIC PREDICTION EFFICIENCY USING A PRINCIPAL COMPONENT ANALYSIS BASED PRE-PROCESSING OF NEAR INFRARED SPECTRA

bioRxivPDFMay 13, 2026 12:00am

Authors

Bienvenu, C.; Roger, J.-M.; Sene, M.; Castro Pacheco, S. A.; Singer, M.; Felaniaina, B. L.; Terrier, N.; De Bellis, F.; Pot, D.; DE VERDAL, H.; Segura, V.

Abstract

Phenomic prediction (PP) is a breeding value prediction method using near infrared spectroscopy (NIRS). Spectra pre-processing is a key step in the analysis pipeline of PP and generally involves chemometrics methods. However, there is still little understanding in the genetics community of what pre-processing does and why it increases performances. Consequently, the choice of pre-processing is done either arbitrarily or through a search of the optimal set of methods and associated parameters. In this study, we propose a PCA-based pre-processing method where genetic values of spectra are estimated on a set of principal components instead of individual wavelengths. This way, estimations are based on a few informative and orthogonal features of spectra instead of many correlated, uninformative wavelengths. We tested this new pre-processing method on five data sets representing four plant species (maize, rice, sorghum and grapevine). Results show that it performs as good, or better than the best classical chemometric pre-processing methods in almost all cases. Combining PCA-based and classical chemometric pre-processing methods maximizes predictive ability. Moreover, this pre-processing method opens up possibilities of better understanding and selecting parts of the spectral information that are relevant for the prediction of breeding values. Indeed, components representing together about 1% of spectral variability were found to be responsible for most of PP predictive ability.

TwitterandLinkedIn

0 comments

Add comment

INCREASING PHENOMIC PREDICTION EFFICIENCY USING A PRINCIPAL COMPONENT ANALYSIS BASED PRE-PROCESSING OF NEAR INFRARED SPECTRA

INCREASING PHENOMIC PREDICTION EFFICIENCY USING A PRINCIPAL COMPONENT ANALYSIS BASED PRE-PROCESSING OF NEAR INFRARED SPECTRA

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments