林业科学2026,Vol.62Issue(2):111-125,15.DOI:10.11707/j.1001-7488.LYKX20240779
基于CARS-PLSR的油茶种仁油脂含量和脂肪酸成分的近红外光谱预测模型构建
Construction of Near Infrared Spectroscopy Prediction Models Based on CARS-PLSR for Determining Oil Content and Fatty Acid Composition of Camellia oleifera Kernel
摘要
Abstract
[Objective]This study aims to develop a low-cost,non-destructive,accurate,and batch method for detecting the oil content and fatty acid composition of Camellia oleifera kernels,to improve the evaluation efficiency of the oil traits.[Method]The oil content in kernels of 220 C.oleifera clones was determined by Soxhlet extraction,and the fatty acid composition was determined by gas chromatography,respectively.The near infrared spectra of the kernels in the wavelength range of 1 000-2 500 nm were collected.After preprocessing the spectral data using 9 methods,the samples were divided into calibration and prediction sets at a ratio of 4:1 by random sampling(RS)and sample set partitioning based on joint X-Y distance(SPXY),respectively.The competitive adaptive reweighted sampling(CARS)was used to select the key wavelengths that were significantly correlated with R2p R2p R2p the oil traits of C.oleifera from the spectral data,and the partial least squares regression(PLSR)prediction models were established for determining the oil content and fatty acid composition of C.oleifera kernels.[Result]The variation ranges of oil content and the content of seven fatty acids(C16:0,C16:1,C18:0,C18:1,C18:2,C18:3,C20:1)were in accordance with or close to normal distribution.The established models for predicting oil content had good accuracy and stability.With the RS samples dividing method,the pretreatment method of standard normal variate(SNV)was optimal.With 14 key wavelengths selected,a prediction model of oil content was established with the relative percent deviation(RPD)of 5.205 5,prediction set determination coefficient(R2p)and root mean square error(RMSEp)of 0.965 1 and 1.854 8 g·(100 g)-1,respectively.With the SPXY samples dividing method,the optimal SNV+first derivative(FD)pretreatment,and 25 key wavelengths selected,another prediction model of oil content was established with a RPD of 3.417 0,prediction set R2p and RMSEp of 0.916 8 and 2.622 4 g·(100 g)-1,respectively.The models for C18:1,C18:2 and C18:3 contents were optimal under the RS method using second derivative(SD),SNV and continuum removal(CR)pretreatment methods,respectively,with RPD values of 1.939 4,2.116 4 and 2.338 1,R2p values of 0.738 5,0.775 4 and 0.831 6,and RMSEp values of 1.707 1%,1.370 2%and 0.049 2%,respectively.[Conclusion]The prediction model for oil content of C.oleifera kernels has been constructed based on near-infrared spectroscopy in this study.This model has high accuracy and good stability,and can be used for rapid,batch and non-destructive detection of oil content of C.oleifera kernels.The prediction models for C18:1,C18:2 and C18:3 contents can be used for preliminary prediction of unsaturated fatty acid.This study can provide scientific basis for rapid detection of oil content,fatty acid composition and other traits of C.oleifera by near-infrared spectroscopy technology.关键词
油茶种仁/油脂含量/脂肪酸/近红外光谱/预测模型Key words
Camellia oleifera kernel/oil content/fatty acid/near infrared spectroscopy/prediction model分类
农业科技引用本文复制引用
钟慧奇,柴静瑜,王开良,滕建华,毕文玉,王安妮,林萍..基于CARS-PLSR的油茶种仁油脂含量和脂肪酸成分的近红外光谱预测模型构建[J].林业科学,2026,62(2):111-125,15.基金项目
"十四五"国家重点研发计划课题(2022YFD2200401) (2022YFD2200401)
浙江省林木新品种选育重大科技专项课题(2021C02070-2). (2021C02070-2)