期刊文献+

基于遗传-支持向量机和遗传-径向基神经网络的有机物正辛醇-水分配系数QSPR研究 被引量:14

Research on QSPR for n-Octanol-Water Partition Coefficients of Organic Compounds Based on Genetic Algorithms-Support Vector Machine and Genetic Algorithms-Radial Basis Function Neural Networks
在线阅读 下载PDF
导出
摘要 基于遗传算法(GA)的因子筛选和支持向量机(SVM)的非线性回归,提出了1种改进的有机物定量结构-性质相关(QSPR)建模方法——遗传-支持向量机(GA-SVM),并将其用于38种食品工业常用有机物正辛醇-水分配系数(Kow)的QSPR建模.结果显示,QSPR模型选取了分子量、Hansen极性、沸点、含氧率和含氢率5种参数;模型的预测值与实测值间的误差平方和(SSE)、均方差(RMSE)和决定系数(R2)分别为0.048、0.036和0.999,表明模型具有较强的预测能力;同时,交叉验证的结果(SSE=0.295,RMSE=0.089,R2=0.995)也表明,模型具有良好的稳健性,因此,GA-SVM算法适用于对有机物正辛醇-水分配系数的QSPR建模.此外,将基于GA-SVM的QSPR模型分别与基于遗传-径向基神经网络(GA-RBFNN)和基于线性算法的模型进行了比较,结果表明,应用GA-SVM建立的QSPR模型无论从稳健性还是预测能力上都优于应用其它2种算法建立的模型,因此,GA-SVM算法比GA-RBFNN和线性算法更适合于对有机物正辛醇-水分配系数进行QSPR建模. A modified method to develop quantitative structure-property relationship (QSPR) models of organic compounds was proposed based on genetic algorithm (GA) and support vector machine (SVM) (GA-SVM). GA was used to perform the variable selection, and SVM was used to construct QSPR models. GA-SVM was applied to develop the QSPR models for n-octanol-water partition coefficients ( Kow ) of 38 typical organic compounds in food industry. 5 descriptors (molecular weights, Hansen polarity, boiling point, percent oxygen and percent hydrogen) were selected in the QSPR model. The coefficient of multiple determination (R^2), the sum of squares due to error (SSE) and the root mean squared error (RMSE) values between the measured values and predicted values of the model developed by GA-SVM are 0.999, 0.048 and 0.036, respectively, indicating good predictive capability for lgKow values of these organic compounds. Based on leave-one-out cross validation, the QSPR model constructed by GA-SVM showed good robustness (SSE = 0.295, RMSE = 0.089, R^2 = 0.995). Moreover, the models developed by GA-SVM were compared with the models constructed by genetic algorithm-radial basis function neural network (GARBFNN) and linear method. The models constructed by GA-SVM show the optimal predictive capability and robustness in the comparison, which illustrates GA-SVM is the optimal method for developing QSPR models for lgKow values of these organic compounds.
出处 《环境科学》 EI CAS CSCD 北大核心 2008年第1期212-218,共7页 Environmental Science
基金 国家重点基础研究发展规划(973)项目(2003CB415204)
关键词 定量结构-性质相关(QSPR) 正辛醇-水分配系数(Kow) 遗传算法(GA) 支持向量机(SVM) quantitative structure-property relationship (QSPR) n-octanol-water partition coefficients ( Kow ) genetic algorithms (GA) support vector machine (SVM)
  • 相关文献

参考文献28

  • 1Tehrany E A, Fournier F, Desobry S. Simple method to calculate octanol-water partition coefficient of organic compounds [J]. J Food Eng, 2004, 64: 315-320.
  • 2Pocas M D F, Hogg T. Exposure assessment of chemicals from packaging materials in foods: a review[J]. Trends Food Sci Tech, 2007, 18(4): 219-230.
  • 3Triantafyllou V I, Akrida-Demertzi k, Demertzis p G. A study on the migration of organic pollutants from recycled paperboard packaging materials to solid food matrices[J]. Food Chem, 2007, 101(4) : 1759-1768.
  • 4Durjava M K, Ter Laak T L, Hermens J L, et al. Distribution of PAHs and PCBs to dissolved organic matter: High distribution coefficients with consequences for environmental fate modeling [ J ]. Chemosphere, 2007, 67(5): 990-997.
  • 5周霞,余刚,黄俊,张祖麟,胡洪营.北京东南郊化工区土壤和植物中氯苯类有机物的残留及分布特征[J].环境科学,2007,28(2):249-254. 被引量:20
  • 6Barbour J P, Smith J A, Chiou C T. Sorption of aromatic organic pollutants to grasses from water[ J]. Environ Sci Technol, 2005, 39 (21) : 8369-8373.
  • 7Turner A, Williamson I. On the relationship between Dow and Kow in natural waters[J]. Environ Sci Technol, 2005, 39 (22): 8719-8727.
  • 8隆兴兴,牛军峰,史姝琼.邻苯二甲酸酯类化合物正辛醇-水分配系数的QSPR研究[J].环境科学,2006,27(11):2318-2322. 被引量:20
  • 9王斌,赵劲松,郁亚娟,王晓栋,王连生.取代联苯的定量结构活性相关及联合毒性研究[J].环境科学,2004,25(3):89-93. 被引量:5
  • 10Gramatica P, Giani E, Papa E. Statistical external validation and consensus modeling: A QSPR case study for Kow prediction [ J ]. J Mol Graphics Modell, 2007, 25 (6) :755-766.

二级参考文献126

  • 1贡雪东,王剑,肖鹤鸣.硝酸酯几何构型、生成热和电子结构的PM3研究[J].高等学校化学学报,1994,15(12):1817-1820. 被引量:10
  • 2周霞,余刚,张祖麟,牛军峰.北京通惠河水和表层沉积物中氯苯类有机物污染现状[J].环境科学,2005,26(2):117-120. 被引量:29
  • 3蔡全英,莫测辉,李云辉,曾巧云,王伯光,肖凯恩,李海芹,徐国生.广州、深圳地区蔬菜生产基地土壤中邻苯二甲酸酯(PAEs)研究[J].生态学报,2005,25(2):283-288. 被引量:141
  • 4[1]Boedeker W, Altenburger R, Faust M, Grimme L H. Synopsis of Concepts and Models for the Quantitative Analysis of Combination Effects: from Biometrics to Ecotoxicology[J]. Arch. Complex Environ. Stud.,1992,4 (3):45~53.
  • 5[2]Backhaus T, Scholze M, Grimme L H. The Single Substance and Mixture Toxicity of Quinolines to the Bioluminescent Bacterium Vibrio fischeri[J]. Aquat. Toxicol.,2000,49:49~61.
  • 6[3]Faust M,Altenburger R,Backhaus T. Predicting the Joint Algal Toxicity of Multi- component s-triazine Mixtures at Low-effect Concentrations of Individual Toxicants[J]. Aquat. Toxicol.,2001,21:13~32.
  • 7[4]Altenburger R,Backhaus T,Boedeker W. Predictability of the Toxicity of Multiple Chemical Mixtures to Vibrio fischeri: Mixtures Composed of Similarly Acting Chemicals Environ[J]. Toxicol. Chem.,2000,19(9):2341~2347.
  • 8[5]Backhaus T,Altenburger R,Boedeker W. Predictability of the Toxicity of A Multiple Mixture of Dissimilarly Acting Chemicals to Vibrio fischeri[J]. Environ. Toxicol. Chem.,2000,19(9):2348~2356.
  • 9[6]Xu S, Nirmalakhandan N. Use of QSAR models in predicting joint effects in multi-component mixtures of mixtures of organic chemicals[J]. Wat. Res.,1998,32(8): 2391~2399.
  • 10[7]Altenburger R,Nendza M,Schüürmann G. Mixture Toxicity and its Modeling by Quantitative Structure-Activity Relationships[J]. Environ. Contam. Toxicol.,2003,22(8):1900~1915.

共引文献89

同被引文献204

引证文献14

二级引证文献62

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部