基于动态指数移动平均的半监督矿工不安全行为识别方法

Semi-supervised miner unsafe behavior recognition method based on dynamic exponential moving average

  • 摘要: 矿工的不安全行为是影响煤矿井下安全生产的主要原因之一,对矿工不安全行为进行识别对于实现井下智能监控至关重要。目前基于深度学习的矿工不安全行为识别方法需要利用大量标注数据进行训练,数据标注消耗大量人力资源。基于半监督学习的识别方法可以有效减少矿工图像的标注成本,但主流的半监督学习方法大多采用指数移动平均(Exponential Moving Average,EMA)对教师模型进行保守更新,使得早期教师模型学习速率较低,导致生成的伪标签质量不高,影响训练效果。为此,设计了基于动态EMA的半监督矿工不安全行为识别方法,结合指数衰减的思想,将EMA中的权重参数设置为随训练批次动态可变,以适应不同阶段的训练。同时,矿井环境昏暗模糊,难以提取矿工信息并且会加剧识别模型分类任务与定位任务的不一致,影响识别精度。针对这一问题,将高效局部注意力(Efficient Local Attention,ELA)融入特征金字塔网络中,构建高效局部注意特征金字塔模块(Efficient Local Attention Feature Pyramid Network,ELA-FPN),提高矿工信息的显著度。为解决矿工不安全行为识别任务中分类与定位不一致的问题,设计特征对齐检测头(Feature Alignment Head,FA-Head )将定位特征映射到分类特征上,提高模型对矿工行为的识别效果。试验表明:在矿工不安全行为数据集使用10%有标签数据时,研究所提算法在不增加模型复杂度的基础上对于矿工不安全行为的识别精度达到71.008%,相较于主流的Unbiased teacher v1、Unbiased teacher v2、Consistent teacher、Dense teacher和ARSL算法分别提高了5.33%、1.76%、2.08%、1.24%和0.40%,且在不同的监督比率下均优于对比算法。可以得出所提算法在矿工不安全行为识别任务上优于目前主流的半监督学习方法,在有效降低标注成本的同时具有较好的识别效果。

     

    Abstract: The unsafe behavior of miners is a key factor influencing the safe production of underground coal mines. The recognition of such behaviors is crucial for achieving intelligent monitoring and enhancing safety in underground operations. Currently, deep learning-based methods for recognizing unsafe behaviors of miners require a large amount of labeled data to train, and data labeling is resource-intensive. Semi-supervised learning-based recognition methods can reduce the labeling cost of miners’ images effectively, but most mainstream semi-supervised learning methods employ Exponential Moving Average (EMA) to update the teacher model conservatively, which results in a slower learning rate for the teacher model in early stages. As a consequence, the quality of the generated pseudo-labels is low, which impacts the training performance considerably. To address this issue, a semi-supervised recognition algorithm based on dynamic EMA is designed to identify miners’ unsafe behavior. Combined with the idea of exponential decay, the weight parameters in EMA are set to vary with the training batch dynamically, which is adaptive to different stages of training. In addition, the mine environment is dim and fuzzy, it is difficult to extract miners’ information, and will aggravate the inconsistency between the classification task and the positioning task of the recognition model, affecting the recognition accuracy. To solve this problem, Efficient Local Attention (ELA) is integrated into the feature pyramid network, and Efficient Local Attention Feature Pyramid Network (ELA-FPN) is constructed to improve the salience of miners’ information. In order to further enhance the consistency between classification and location of miners’ unsafe behaviors, a Feature Alignment Detection Head (FA-Head) is designed to map the location features to the classification features for achieving alignment between classification features, location features and improving the recognition effect on miners’ behaviors. Experiments show that when 10% labeled data is utilized in the miners’ unsafe behavior dataset, the recognition accuracy of the proposed algorithm for the unsafe behaviors of miners reaches 71.008% without increasing the complexity of the model. Compared with the mainstream Unbiased teacher v1, Unbiased teacher v2, Consistent teacher, Dense teacher and ARSL, the results are improved by 5.33%, 1.76%, 2.08%, 1.24% and 0.40%, respectively, and outperformed other state of the arts on the former comparison algorithms under different supervision ratios. It can be concluded that the proposed algorithm is superior to the current mainstream semi-supervised learning method in the task of miners’ unsafe behavior recognition, and has a good recognition effect while the annotation cost is lessened effectively.

     

/

返回文章
返回