Acta Chimica Sinica ›› 2007, Vol. 65 ›› Issue (2): 152-158. Previous Articles     Next Articles

Original Articles

正交信号校正在正常成人血清1H NMR谱的代谢组分析中的滤噪作用评价

冒海蕾1,3, 徐旻2, 王斌3, 王惠民3, 邓小明*,1, 林东海*,2   

  1. (1第二军医大学长海医院 上海 200433)
    (2中国科学院上海药物研究所 上海 201203)
    (3南通大学附属医院 南通 226001)
  • 收稿日期:2006-04-03 修回日期:2006-06-19 出版日期:2007-01-28 发布日期:2007-01-28
  • 通讯作者: 林东海

Evaluation of Filtering Effects of Orthogonal Signal Correction on Metabonomic Analysis of Healthy Human Serum 1H NMR Spectra

MAO Hai-Lei1,3; XU Min2; WANG Bin3; WANG Hui-Min3; DENG Xiao-Ming*,1; LIN Dong-Hai*,2   

  1. (1 Changhai Hospital, Secondary Military Medical University, Shanghai 200433)
    (2 Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203)
    (3 Affiliated Hospital, Nantong University, Nantong 226001)
  • Received:2006-04-03 Revised:2006-06-19 Online:2007-01-28 Published:2007-01-28
  • Contact: LIN Dong-Hai

Three different pattern recognition methods before and after orthogonal signal correction (OSC) were employed to perform the metabonomic analysis of 1H NMR spectra recorded on healthy human sera, in order to explore the potential of applying 1H NMR-based metabonomics to clinical research. At first, 78 healthy human sera were collected after a routine fasting for 8 h, and the corresponding 1D 1H NMR spectra were recorded on a Varian Unity INOVA-600 spectrometer, and then three pattern recognition analyses, PCA (principal component analysis), PLS-DA (partial least squares-discriminant analysis), and SIMCA (soft independent modeling of class analogy), were performed, respectively. In spite of no specific sample-collecting restriction on foods, life styles, and physiological cycles, the PLS-DA method after OSC is able to distinguish the NMR metabonomic profiles of male sera from those of female sera, more perfectly than both the PCA and SIMCA. Furthermore, the major NMR integral regions relevant to gender classification from PLS-DA after OSC were identical with those from PLS-DA without OSC filter in the literature. In the figure of displaying the variation of PLS-DA model before OSC and after removing different OSC latent variables (LVs), the eigenvalues of the first and second OSC-removed LVs were much greater than others. After removing two LVs by OSC, the remaining sum of square (RSS) in the X block was 20.82%, that is, 79.18% information unrelated to Y was removed from the PLS-DA model. Meanwhile, the LV number of PLS-DA model attained to one; while the LV number was two for the model with the first LV being removed by OSC, and three for the model without OSC. R2X, R2Y, and Q2 (cum) are usually used to evaluate the quality of PLS-DA model. R2X and R2Y are the fraction of the sum of square of the entire X’s and Y’s explained by the current LV of PLS-DA, and represent the variance of X and Y variables, respectively; while Q2 is cross validated R2. Q2 (cum) reflects the cumulative cross-validated percent of the total variation of the X’s and Y’s that can be predicted by the current LV of PLS-DA model. In our study, after OSC filtering the first two LVs, R2X reached the minimum, suggesting that the least systematic variance should be present in the current PLS-DA model. Meanwhile, both R2Y and Q2 (cum) were always higher than 80%, indicative of the good quality of the PLS-DA model. Obviously, OSC is capable of eliminating the influence of dietary and environmental factors, and decreasing the heterogeneity of samples, which is fairly useful and important for clinical investigations. Additionally, the appropriate number of OSC-removed LVs should be determined on the basis of RSS in the X block, eigenvalue of OSC-removed latent variables, LV number and the qualitative indicators of the PLS-DA model.

Key words: NMR, metabonomics, pattern recognition, orthogonal signal correction, sera