高级检索

      基于不同机器学习算法的水位异常值检测

      Detection of water level anomaly values based on different machine learning algorithms

      • 摘要: 为提高水文监测的准确性与可靠性,基于不同机器学习算法(孤立森林、随机森林和k近邻),对山西省东榆林、韩家楼、新绛、赵城水文站的水位数据进行异常值检测。综合考虑水位数据时间序列、季节性波动等特征,针对不同水文站数据的特性和异常值分布情况对各算法进行精细优化,并采用F1分数和ROC曲线对各算法性能进行评估。结果表明:3种算法在不同水文站的水位异常值检测中均具有一定的分类能力;其中随机森林算法凭借其集成学习特性,F1分数达到0.923~0.950,AUC值均达到1,整体检测性能最佳。研究成果可为水资源的科学管理和防洪减灾提供参考。

         

        Abstract: To enhance the accuracy and reliability of hydrological monitoring, this study employed different machine learning algorithms, Isolation Forest, Random Forest, and k-Nearest Neighbors, to detect anomalies of water level data from Dongyulin, Hanjialou, Xinjiang, and Zhaocheng Hydrological Stations in Shanxi Province. By comprehensively considering characteristics such as time series and seasonal fluctuations of water level data, the algorithms were finely optimized according to the data characteristics and anomaly distribution patterns of different hydrological stations. The performance of each algorithm was evaluated using the F1 score and ROC curve. The results indicated that all three algorithms demonstrated certain classification capabilities in detecting water level anomalies across different hydrological stations. Notably, the Random Forest algorithm, leveraging its ensemble learning properties, achieved F1 scores ranging from 0.923 to 0.950 and AUC values to 1.0 across all four stations, exhibiting the best overall detection performance. The research results can provide references for the scientific management of water resources and flood prevention and disaster reduction.

         

      /

      返回文章
      返回