基于改进U−Net的煤矸图像分割模型与放煤控制技术

Coal gangue image recognition model based on improved U−Net and top coal caving control

  • 摘要: 煤矸识别技术是综放工作面实现智能化的关键技术之一,同时也是该领域面临的一个重要挑战。针对目前煤矸图像数据集整体质量差、数据规模小、煤矸图像分割模型检测速度慢、识别精度低等问题,参考实际综放工作面搭建了大尺寸等比例综放开采相似模拟平台,基于该平台建立了煤矸图像采集系统,采集构建了高清仿真综放工作面煤矸图像数据集,提出一种基于特征金字塔网络(FPN)和空洞空间金字塔池化(ASPP)的改进U−Net煤矸分割模型,提高了煤矸图像的分割精度。通过在U−Net模型的跳跃连接中添加FPN模块,同时在解码器部分引入ASPP模块,建立了FPN−ASPP−U−Net煤矸分割模型,消融试验验证了FPN模块和ASPP模块对U−Net模型性能的提升。结果表明:FPN−ASPP−U−Net模型分割效果最好,均准确率(MA)为97.29%,均F1得分(MF1)为97.44%,均交并比(MI)为95.65%,模型参数量(MP)为29.64 M,浮点运算量(F)为341.29 G,每秒帧数(f)为41.1 f/s,与U−Net模型相比,MIMF1MA分别提升了2.64%、1.06%和1.15%,模型参数量仅仅增加了0.33 M,改进后的模型在图像分割速度上有少量提升。设计了FPN−ASPP−U−Net模型与PSPNet、SegFormer、DeepLabV3+、PSANet语义分割模型的图像分割效果对比试验,结果表明:FPN−ASPP−U−Net模型对煤矸图像分割的性能最好,同时模型整体计算参数量最小,在分割精度和分割速度之间有着较好的平衡。对于粉尘影响下的不清晰图像,采用暗通道与高斯加权相结合的方法对图像数据集进行去雾增强,轻度粉尘、中度粉尘、重度粉尘去雾前后的模型对煤的分割精度提高了14.81%、17.79%、23.62%,对矸的分割精度提高了11.73%、14.50%、14.86%。基于研究结论提出了FPN−ASPP−U−Net模型的煤矸图像混矸率计算方法,开展了煤矸图像分割控制放煤试验,以混矸率20%作为放煤口关闭的阈值,单次放煤口开关期间真实混矸率与模型预测混矸率平均误差率为4.71%,验证了基于煤矸图像混矸率对放煤控制的可行性。最后,封装模型代码研发了煤矸图像智能识别软件,设计了煤矸分割现场应用方案,在榆树田煤矿110501综放工作面进行了图像控制放煤试验,验证了该方法能够对煤矸图像进行精准分割,对放煤口开关进行合理控制,提高了综放工作面的智能化水平,为推动煤矿进一步智能化建设提供了有效的技术手段与参考价值。

     

    Abstract: Coal gangue identification technology is a critical component for the automation of fully mechanized mining faces and represents a significant challenge within this domain. To address the challenges of suboptimal quality and limited scale of existing coal gangue image datasets, as well as the slow detection speeds and low recognition accuracy of coal gangue image segmentation models, a large-scale, proportionally accurate simulation platform for fully mechanized mining faces has been established, drawing from real-world mining scenarios. Utilizing this platform, a coal gangue image acquisition system has been developed to construct a high-resolution, simulated dataset of coal gangue images for fully mechanized top coal caving face. An advanced U−Net−based coal gangue segmentation model has been developed, incorporating Feature Pyramid Networks (FPN) and Atrous Spatial Pyramid Pooling (ASPP). This approach significantly improves the segmentation accuracy of coal gangue images. By incorporating the Feature Pyramid Networks (FPN) module into the skip connections of the U−Net architecture and integrating the Atrous Spatial Pyramid Pooling (ASPP) module within the decoder stage, a novel FPN-ASPP−U−Net coal gangue segmentation model has been developed. Ablation studies confirmed that the integration of the FPN and ASPP modules significantly enhances the performance of the U−Net model. The FPN−ASPP−U−Net model exhibits superior segmentation efficacy, achieving a mean accuracy (MA) of 97.29%, a mean F1-score (MF1) of 97.44%, and a mean Intersection over Union (MI) of 95.65%. The model's parameter count is 29.64 M, with FLOPs (F) 341.29 G and a frame rate (f) of 41.1 frames per second. Relative to the baseline U−Net model, the MI, MF1, and MA are improved by 2.64%, 1.06%, and 1.15%, respectively, with only a marginal increase of 0.33 M. This enhancement results in a modest improvement in image segmentation speed. A rigorous comparative analysis was conducted to evaluate the performance of the FPN−ASPP−U−Net model relative to PSPNet, SegFormer, DeepLabV3+, and PSANet for image segmentation tasks. The results substantiate that the FPN−ASPP−U−Net model delivers superior performance in coal gangue image segmentation, while also maintaining the lowest overall computational parameter count. This model demonstrates a well-balanced compromise between segmentation accuracy and computational efficiency, thereby optimizing both precision and processing speed in practical scenarios. In response to image degradation caused by dust, a hybrid dehazing approach leveraging dark channel prior combined with Gaussian weighting was implemented on the image dataset. This methodology significantly enhanced segmentation accuracy. Specifically, for coal, the segmentation accuracy improved by 14.81%, 17.79%, and 23.62% under light, moderate, and severe dust conditions, respectively. For gangue, segmentation accuracy saw improvements of 11.73%, 14.50%, and 14.86% under the same dust conditions. These enhancements demonstrate the effectiveness of the proposed dehazing strategy in mitigating dust-related artifacts and improving segmentation performance across varying levels of dust intensity. Building on the research conclusions, a method for calculating the gangue mixture rate in coal gangue images utilizing the FPN−ASPP−U−Net model was developed. A control experiment for the drawing opening was performed, with a threshold of 20% gangue mixture rate established for closing the drawing opening. During a single operational cycle of the drawing opening, the average discrepancy between the actual and model-predicted gangue mixture rates was 4.71%, thereby confirming the viability of employing gangue mixture rate measurements from coal gangue images for effective discharge control. Finally, the encapsulated model code facilitated the development of an intelligent software solution for coal gangue image recognition. An application framework for gangue segmentation in operational environments was designed, and an image-based discharge control experiment was conducted at the 110501 fully mechanized mining face of the Yushutian coal mine. The validation of this methodology demonstrated its capability to perform accurate segmentation of coal gangue images and to facilitate rational control of the drawing opening. This advancement has significantly enhanced the automation level of the fully mechanized mining face, providing a robust technical framework and valuable reference for advancing the intelligent development of coal mining operations.

     

/

返回文章
返回