Acta Chimica Sinica ›› 2026, Vol. 84 ›› Issue (1): 8-19.DOI: 10.6023/A25080275 Previous Articles     Next Articles

Article

基于MOFid赋能下的AI大数据挖掘高性能化学战剂吸附材料

翁惠琼a,, 黄河a,, 王雯菲a, 李和国b, 李晓鹏b, 张守鑫b, 李树华a, 赵越b,*(), 吴玉芳a,*(), 乔智威a,*()   

  1. a 广州大学化学化工学院, 能源与催化研究所 广东 广州 510006
    b 核生化灾害防护化学全国重点实验室 北京 102205
  • 投稿日期:2025-08-08 发布日期:2025-10-23
  • 基金资助:
    国家自然科学基金项目(22478085); 国家自然科学基金项目(22308069); 国家自然科学基金项目(21978058); 广东省自然科学基金项目(2023A1515240076); 广东省自然科学基金项目(2022A1515011446); 新型反应器与绿色化学工艺湖北省重点实验室开放基金(NRG202407); 广州大学研究生创新能力培养项目(JCCX2024-064)

AI Big-Data Mining Empowered by MOFid for High-Performance Chemical Warfare Agent Adsorbents

Huiqiong Wenga, He Huanga, Wenfei Wanga, Heguo Lib, Xiaopeng Lib, Shouxin Zhangb, Shuhua Lia, Yue Zhaob,*(), Yufang Wua,*(), Zhiwei Qiaoa,*()   

  1. a Guangzhou Key Laboratory for New Energy and Green Catalysis, College of Chemistry and Chemical Engineering, Guangzhou University, Guangzhou 510006, Guangdong, China
    b State Key Laboratory of Chemistry for NBC Hazards Protection, Beijing 102205, China
  • Received:2025-08-08 Published:2025-10-23
  • Contact: * E-mail: zy291857261@126.com;yufang.wu@gzhu.edu.cn;zqiao@gzhu.edu.cn
  • About author:
    These authors contributed equally to this work.
  • Supported by:
    National Natural Science Foundation of China(22478085); National Natural Science Foundation of China(22308069); National Natural Science Foundation of China(21978058); Natural Science Foundation of Guangdong Province(2023A1515240076); Natural Science Foundation of Guangdong Province(2022A1515011446); Open Fund of the Hubei Key Laboratory of Novel Reactor and Green Chemical Technology(NRG202407); Graduate Student Innovation Ability Training Program of Guangzhou University(JCCX2024-064)

To efficiently capture low-concentration chemical warfare agents (CWAs), which pose severe threats to human health and the environment, this study employed an AI big-data-driven high-throughput computational screening strategy to systematically analyze and evaluate the capture and separation performance of tens of thousands of metal-organic framework (MOF) adsorbents for trace CWAs in air. Grand Canonical Monte Carlo simulations were employed to evaluate the CWAs uptake of 15333 Computation-ready experimental MOFs under a gas mixture containing N2, O2, and toxic gas at 298 K and 101.325 kPa. A trade-off score (TSN) combining selectivity and adsorption capacity was introduced as a comprehensive performance metric. Predictive models were constructed using six algorithms (Decision Tree, Random Forest, Gradient Boosting Regression Tree, Extreme Gradient Boosting (XGB), Backpropagation Neural Network, and Light Gradient Boosting Machine) based on seven key descriptors of MOFs—namely, Largest Cavity Diameter (LCD), density of MOF (ρMOF), Pore-Limiting Diameter (PLD), porosity (φ), Volumetric Surface Area (VSA), Henry Coefficient (K), and heat of adsorption (Q0st). And the XGB algorithm yielded the best predictive performance, with the R2 up to 0.923. ​​SHapley Additive exPlanations analysis revealed that K was the most critical descriptor for CWAs adsoprtion, with optimal φ and VSA ranges for maximal sarin and soman adsorption​​ being 0.7~0.8 and 2000~2500 m2•cm-3, respectively​​. Then, the structural commonalities of the top 1% high-performance MOFs were analyzed by integrating the XGB algorithm with the MOFid standardized identifier. The results reveal that the synergistic effect between open transition metal sites (​​particularly Cr, Nb, Al, In, Li​​) and rigid organic linkers of MOFs enhances the adsorption affinity towards CWAs. Meanwhile, the high occurrence probability of specific topological structures (​​such as sql and kgm​​) indicates that they are conducive to forming favorable pore structures, thus enhancing the adsorption effect. ​​Guided by these insights, 26 novel MOFs were rational designed for enhanced CWAs capture. This study, employing AI big-data mining driven by MOFid method, provides critical guidance for optimizing MOF adsorption performance and screening highly efficient materials, thereby facilitating the efficient capture of trace chemical warfare agents in air ​​and advancing the development of protective equipment​​.

Key words: AI big-data mining, MOFid, high-throughput screening, chemical warfare agents, machine learning, molecular simulation, metal-organic frameworks