基于深度学习和模型压缩技术的轻量级煤矿人车检测模型以贵州地区煤矿为例

Lightweight coal miners and manned vehicles detection model based on deep learning and model compression techniques: A case study of coal mines in Guizhou region

  • 摘要: 煤矿工人和载人车辆(煤矿人车)的智能识别是视频监控系统的重要组成部分,也是煤矿智能化发展的关键任务。然而,煤矿人车检测场景较为复杂,大型人车检测模型部署在有限的计算设备上难以实现,如何在模型检测性能和检测效率之间取得平衡存在诸多挑战。以贵州地区煤矿视频监控数据集为例,提出了一种基于深度学习和模型压缩技术的轻量级煤矿人车检测模型,该模型精准实时的完成了煤矿人车检测任务,对网络进行瘦身的同时几乎没有损失检测性能。具体来说,在网络模型设计阶段,以YOLOv8s为基线提出了一种名为FCW-YOLO的煤矿人车轻量级检测模型,首先将Faster-Block和坐标注意力和开发到网络的特征提取模块中,设计了一种新颖的C2f-Faster-CA轻量级架构,旨在减少网络的冗余通道同时自适应捕捉全局关键信息;其次,采用了WIOU边界回归损失函数以增加模型对普通质量样本的关注,降低了训练样本不平衡带来的回归误差等问题。在模型压缩阶段,联动剪枝算法对提出的FCW-YOLO模型进行通道级别的稀疏,模型可自动识别不重要的通道并对其进行删减,实现了煤矿人车检测模型二次轻量化设计FCWP-YOLO。在自建的煤矿人车检测数据集上的结果表明,提出的模型参数量,计算量和模型大小分别为2.3 M,4.0 GFLOPs,6.0 MB,对比基线模型分别实现了4.9、4.7、4.4倍的压缩效果,平均检测精度为88.7%,提高了1.1%,每张图像的处理速度仅为5.6 ms。对比多种轻量级架构和先进的检测模型,该方法精度表现优异,计算成本更低,实时性能更好,为资源受限的煤矿场景提供了一种可行的煤矿人车检测方法,满足煤矿视频监控部署要求,可为煤矿人车智能巡检任务提供实时预警。

     

    Abstract: Intelligent recognition of coal mine workers and manned vehicles (coal mine pedestrian-vehicles) is an important component of video surveillance systems and a key task in the development of coal mine intelligence. However, the detection scene of coal mine pedestrian-vehicles is complex, and deploying large pedestrian-vehicle detection models on limited computing devices is challenging. Balancing between model detection performance and efficiency poses many challenges. This paper proposes a lightweight coal mine pedestrian detection model based on deep learning and model compression techniques. Taking the coal mine video surveillance dataset in Guizhou region as an example. The model accurately and in real-time completes the task of detecting coal mine pedestrian-vehicles, achieving a balance between model detection performance and efficiency. Specifically, in the network model design phase, a lightweight detection model named FCW-YOLO is proposed based on YOLOv8s as the baseline. Faster-Block and coordinate attention are integrated into the feature extraction module of the network, designing a novel C2f-Faster-CA lightweight architecture to reduce redundant channels of the network while adaptively capturing global key information. Furthermore, the WIOU boundary regression loss function is employed to increase the model's focus on common quality samples, addressing issues such as regression errors caused by imbalanced training samples. In the model compression phase, the proposed FCW-YOLO model undergoes channel-level sparsity through a collaborative pruning algorithm, automatically identifying unimportant channels and reducing them, resulting in the FCWP-YOLO model, achieving secondary lightweight design of the coal mine pedestrian-vehicle detection model. Results on a self-built coal mine pedestrian-vehicle detection dataset show that the proposed model has parameters, computational load, and model size of 2.3 M, 4.0 GFLOPs, and 6.0 MB, respectively, achieving compression ratios of 4.9 times, 4.7 times, and 4.4 times compared to the baseline model. The average detection accuracy is 88.7%, an improvement of 1.1%, with a processing speed of only 5.6ms per image. Compared to various lightweight architectures and advanced detection models, this method demonstrates excellent accuracy, lower computational costs, and better real-time performance, providing a feasible coal mine pedestrian-vehicle detection method for resource-constrained coal mine scenarios, meeting the deployment requirements of coal mine video surveillance and enabling real-time alerts for intelligent inspection of coal mine pedestrian-vehicles.

     

/

返回文章
返回