基于YOLO的矿井外因火灾早期识别轻量化算法

王凯; 卢锋; 王纪硕; 黄露露; 王炎林; 马艳飞

doi:10.13225/j.cnki.jccs.CQ25.0863

摘要: 煤矿井下外因火灾致灾因素众多且一旦发火致灾后果严重，火灾隐患的早期识别与处置成为风险防控的有效手段，但井下环境干扰因素繁多，且当前无公开的煤矿井下外因火灾数据集，使得现有目标检测算法对外因火灾早期识别的速度与检测精度面临诸多挑战。为解决矿井复杂环境下外因火灾早期难以精准识别的难题，基于You Only Look Once v8s(YOLOv8s)将DCNv3可变形卷积模块嵌入到特征提取网络中，引入RepGFPN重参数化特征金字塔，并添加P₂小目标检测层等措施构建了VI–YOLO模型，同时构建了6400张多场景火灾图像、矿井无火图像和矿井模拟火灾图像的可见光火灾探测数据集。通过开展VI–YOLO模型与其他主流单阶段目标检测算法对比实验，基于OpenVINO与TensorRT对VI–YOLO模型进行推理加速，验证了模型在CPU设备和GPU设备部署应用的可行性，并将模型部署于低算力平台Jetson Nano B01中，验证了边缘端部署的可行性。结果表明，VI–YOLO模型相比基线模型mAP@0.5提升了2.4%，召回率提升0.8%，多尺度火灾特征的有效检测能力更佳；VI–YOLO模型mAP@0.5达90.1%，大幅超越同类单阶段目标检测算法，VI–YOLO模检测速度为25 f/s，满足了井下视觉区域内火灾实时检测需求；在OpenVINO框架进行结构优化和模型精度量化后，半精度模型的推理时间为49.6 ms，减少了301.9 ms，检测速度达到19 f/s，速度提升约6.3倍；在TensorRT框架进行结构优化和模型精度量化，半精度模型的推理时间为4.6 ms，减少了32.4 ms，检测速度为118 f/s，速度提升4.7倍；在Jetson Nano B01部署及TensorRT框架下，单精度模型推理速度提升至12.3 f/s，mAP@0.5为97.6%，半精度模型推理速度提升至15.2 f/s，mAP@0.5为97.5%，为模型在CPU、GPU设备和低算力设备的部署应用提供了新思路。

Abstract: There are numerous factors that can cause external fires in underground coal mines, and once a fire breaks out, the consequences are severe. Early identification and handling of fire hazards have become effective means of risk control. However, the underground environment is full of interfering factors, and there is currently no publicly available dataset for external fires in underground coal mines. This poses many challenges to the speed and detection accuracy of existing target detection algorithms in the early identification of external fires. To solve the problem of the difficulty in accurately identifying external fires in complex mine environments at an early stage, the VI–YOLO model was constructed by embedding the DCNv3 deformable convolution module into the feature extraction network based on You Only Look Once v8s (YOLOv8s), introducing the RepGFPN reparameterized feature pyramid, and adding a P₂ small target detection layer. A visible light fire detection dataset was simultaneously constructed, which included 6 400 multi-scenario fire images, mine non-fire images, and simulated mine fire images. Through conducting a comparison experiment of the VI–YOLO model with other mainstream single-stage target detection algorithms, the VI–YOLO model was accelerated for inference using OpenVINO and TensorRT, and its feasibility in deployment and application on CPU devices and GPU devices was verified. The model was deployed on the low-performance platform Jetson Nano B01, verifying the feasibility of edge-end deployment. The results show that the VI–YOLO model has improved mAP@0.5 by 2.4% compared to the baseline model, with a recall rate increase of 0.8%, and has better detection capabilities for multi-scale fire features. The mAP@0.5 of the VI–YOLO model reaches 90.1%, significantly surpassing similar single-stage target detection algorithms. The detection speed of the VI–YOLO model is 25 f/s, meeting the real-time detection requirements for fires in the underground visual area. After structural optimization and model accuracy quantification in the OpenVINO framework, the inference time of the half-precision model is 49.6 ms, reducing 301.9 ms, and the detection speed reaches 19 f/s, with a speed increase of approximately 6.3 times. After structural optimization and model accuracy quantification in the TensorRT framework, the inference time of the half-precision model is 4.6 ms, reducing 32.4 ms, and the detection speed is 118 f/s, with a speed increase of 4.7 times. In the Jetson Nano B01 deployment and TensorRT framework, the inference speed of the single-precision model is increased to 12.3 f/s, with a mAP@0.5 of 97.6%, and the inference speed of the half-precision model is increased to 15.2 f/s, with a mAP@0.5 of 97.5%, providing new ideas for the deployment and application of the model on CPU, GPU devices, and low-performance devices.

基于YOLO的矿井外因火灾早期识别轻量化算法

Lightweight algorithm for early recognition of mine exogenous fire based on YOLO

相关链接