化学学报 ›› 2013, Vol. 71 ›› Issue (9): 1281-1286.DOI: 10.6023/A13040403 上一篇    下一篇

研究论文

基于平行因子分析法提高近红外无创血糖校正模型稳健性的研究

张婉洁, 刘蓉, 徐可欣   

  1. 天津大学 精密测试技术及仪器国家重点实验室 天津 300072
  • 投稿日期:2013-04-12 发布日期:2013-05-16
  • 通讯作者: 刘蓉,E-mail:rongliu@tju.edu.cn E-mail:rongliu@tju.edu.cn
  • 基金资助:

    项目受国家自然科学基金(Nos. 60938002, 30900275)、国家863计划(No. 2012AA022602)资助

Enhanced Robustness of Calibration Models Using Parallel Factor (PARAFAC) Analysis with NIR Spectral Data for Non-invasive Blood Glucose Monitoring

Zhang Wanjie, Liu Rong, Xu Kexin   

  1. State Key Laboratory of Precision Measuring Technology and Instruments, Tianjin University, Tianjin 300072
  • Received:2013-04-12 Published:2013-05-16
  • Supported by:

    Project supported by the National Natural Science Foundation of China (Nos. 60938002, 30900275) and the National High Technology Research and Development Program of China (‘863’ Program, No. 2012AA022602).

采用近红外光谱进行无创血糖检测时, 样品背景变动造成的预测集样本与校正集样本量测体系不一致的问题是导致预测精度低的原因之一. 提出一种将母体背景作为变量引入回归建模中, 结合各个母体背景下的样本光谱信息构建三维光谱矩阵以提高校正模型稳健性的分析方法. 将平行因子分析(PARAFAC)与多元线性回归(MLR)相结合, 对人体三层皮肤模型的蒙特卡罗模拟实验和葡萄糖水溶液及其混合物的离体实验进行了验证. 实验结果表明, 与传统的单一母体背景所建立的偏最小二乘模型相比, 将母体背景作为建模元素采用PARAFAC-MLR法所建立的校正模型具有更好的预测能力和稳健性.

关键词: 近红外, 平行因子分析, 多元线性回归, 蒙特卡罗, 葡萄糖, 背景

The main cause of the low prediction accuracy in non-invasive blood glucose monitoring with near-infrared (NIR) spectroscopy is that the variations induced by the changes of the measuring system in the prediction data set are inconsistent with those in the calibration data set. In this paper, a method to improve the robustness of the calibration models is proposed, in which the information of the matrix background is introduced as a variable into the calibration data and the three-way tensor is used to build the regression model. The idea of constructing regression models based on the hybrid algorithms consists of two steps. The first is to build a parallel factor (PARAFAC) model with its second-order advantage and calculate the scores and loadings. Then a multivariate linear regression (MLR) calibration model is built from the PARAFAC sample scores combined with the reference concentration values for quantification purposes. For the validation and prediction, the PARAFAC loadings are used to calculate the predicted scores with the validation and prediction data sets, and then the predicted concentration values can be deduced from the MLR model. The proposed method has been successfully applied to two NIR spectroscopy experiments. One is a Monte-Carlo simulation experiment of skin. The changes of the absorption coefficients and scattering coefficients of dermis are considered as the variations of the matrix background. The other is an in vitro experiment including glucose, haemoglobin and albumin solutions and the mixed composition solutions. The determination coefficients and root mean square error of prediction (RMSEP) values obtained from the PARAFAC-MLR models are compared with those obtained from traditional chemometrics tools such as partial least squares (PLS). The results show that the PLS model cannot handle uncalibrated variations whereas the way of introducing the matrix background to generate tensor data and the regression method based on the combination of PARAFAC and MLR perform better in model robustness and prediction precision.

Key words: near-infrared spectroscopy, parallel factor (PARAFAC), multivariate linear regression (MLR), Monte-Carlo, glucose, background