Acta Chimica Sinica ›› 2022, Vol. 80 ›› Issue (5): 614-624.DOI: 10.6023/A22010031 Previous Articles     Next Articles

Article

机器学习与分子模拟协同的CH4/H2分离金属有机框架高通量计算筛选

王诗慧, 薛小雨, 程敏, 陈少臣, 刘冲, 周利, 毕可鑫, 吉旭*()   

  1. 四川大学化学工程学院 成都 610065
  • 投稿日期:2022-01-16 发布日期:2022-05-31
  • 通讯作者: 吉旭
  • 基金资助:
    国家自然科学基金青年基金(22108178)

High-Throughput Computational Screening of Metal-Organic Frameworks for CH4/H2 Separation by Synergizing Machine Learning and Molecular Simulation

Shihui Wang, Xiaoyu Xue, Min Cheng, Shaochen Chen, Chong Liu, Li Zhou, Kexin Bi, Xu Ji()   

  1. School of Chemical Engineering, Sichuan University, Chengdu 610065
  • Received:2022-01-16 Published:2022-05-31
  • Contact: Xu Ji
  • Supported by:
    Young Scientists Fund of the National Natural Science Foundation of China(22108178)

In this work, a hierarchical screening strategy by synergizing machine learning (ML) and molecular simulation was proposed to identify the optimal adsorbents for CH4/H2 separation from 134185 hypothetical metal-organic frameworks (MOFs). At the initial screening, MOF materials with inappropriate pore size and/or volumetric surface area were removed from the total database, resulting in a list of 62278 MOFs. Among them, 10% MOFs were randomly chosen and grand canonical Monte Carlo (GCMC) simulations were performed to calculate the adsorption behaviors of CH4/H2 mixture in these MOFs under vacuum swing adsorption (VSA) and pressure swing adsorption (PSA) conditions. Following this, structural/ chemical descriptors and corresponding adsorbent performance scores (APS) of the selected MOFs were employed to develop the random forest (RF) models for VSA and PSA processes. Compared with the accuracy of other ML algorithms, covering support vector machine, k-nearest neighbor, decision tree, and artificial neural network, the proposed model exhibits the optimum predictive power. Meanwhile, the hybrid of structural and chemical descriptors, as well as the application of the preliminary screening strategy improve the accuracy of the RF model. Thus, it was used to predict the APS values of the remaining 90% MOFs in the next stage of screening, and the top 1000 candidates were screened out according to the results. GCMC simulations were subsequently carried out on the top candidates to refine the predictions, and then ten MOFs with the best CH4/H2 separation performance were obtained under VSA and PSA conditions, respectively. The high performance of the optimal MOFs was verified by comparison with well-studied MOF materials in the literature. Finally, the feature importance of the descriptors was interpreted by the Shapley Additive Explanations. The result reveals the potential for the developed model to transfer between the two operating conditions due to the consistency of the dominant descriptors, which also provides an efficient pathway for rapid screening of promising MOF adsorbents in CH4/H2 separation suitable for different operation scenarios.

Key words: metal-organic frameworks, CH4/H2 separation, molecular simulation, machine learning, interpretability