机器学习技术在新污染物可疑和非靶向筛查分析中的应用

穆洪新; 张后虎; 陈玲; 吴兵; 卜元卿

doi:10.19741/j.issn.1673-4831.2025.0335

机器学习技术在新污染物可疑和非靶向筛查分析中的应用

Application of Machine Learning Techniques in Non-Targeted Screening Analysis of Emerging Contaminants

摘要

摘要: 新污染物结构多样且大部分缺乏分析标准品，传统的靶向分析方法无法检测标准品之外的物质，因此使用高分辨质谱(HRMS)进行可疑和非靶向筛查，对环境中新污染物的全面识别至关重要。传统分析方法难以处理HRMS获取的海量数据，复杂质谱数据解析与物质鉴定已成为环境分析化学的核心挑战。机器学习作为一种强大的数据处理和模式识别工具，在提升新污染物可疑和非靶向筛查效率与精度方面展现出巨大的应用潜力。该研究系统梳理了机器学习技术在可疑和非靶向筛查全流程分析中的创新应用与最新进展，聚焦质谱数据预处理、分子式智能分配、保留时间预测、浓度定量分析等关键环节，阐释了机器学习及深度学习算法在提升筛查效率与准确性方面的作用。未来应将机器学习技术纳入可疑和非靶向筛查全流程分析，以便更全面地研究新污染物的环境暴露特征。

Abstract: The structural diversity of emerging contaminants and the absence of analytical standards for certain compounds limit the capability of traditional targeted approaches to detect substances beyond predefined reference standards. Consequently, the application of high-resolution mass spectrometry (HRMS)-based suspect and non-targeted screening has become indispensable for comprehensive identification of emerging contaminants in environmental matrices. However, traditional analysis methods are difficult to process the massive data obtained by HRMS, and complex mass spectrometry data analysis and substance identification have become the core challenges of environmental analytical chemistry. As a powerful data processing and pattern recognition tool, machine learning provides great application potential in improving the efficiency and accuracy of suspect and non-targeted screening of emerging contaminants. In this paper, we systematically review the innovative applications and recent advances of machine learning techniques in the full process analysis of suspect and non-targeted screening, focusing on key aspects such as raw mass spectrometry data pre-processing, intelligent molecular formula assignment, retention time prediction and quantitative concentration analysis, and comprehensively illustrate the role of conventional machine learning and deep learning algorithms in improving the efficiency and accuracy of screening. Future efforts should prioritize the integration of machine learning into the entire suspect and non-targeted screening workflow to enable more holistic investigations into the environmental exposure characteristics of emerging contaminants.

HTML全文

参考文献(37)

施引文献

资源附件(0)