一种U型煤岩显微组分组语义分割网络模型

王培珍; 周峰; 张克; 汪旋; 张代林

doi:10.13225/j.cnki.jccs.2024.1233

摘要: 煤岩显微组分组的语义分割是采用图像技术对其显微组分组含量进行自动测定的重要前提。为了提高煤岩显微组分组识别的准确性，根据煤岩显微图像的特点，基于U-Net网络模型的架构，提出一种用于煤岩显微组分组语义分割的U型深度学习模型。首先，采用基于组混合注意力机制搭建的Transformer主干网络构建U型网络的编码器，并通过多个尺度特征的聚合及自注意力机制实现特征信息更全面的聚合，使模型对复杂图像中的结构信息更敏感。其次，在编码器末端级联空洞空间卷积池化金字塔模块，以在不降低特征图分辨率的前提下扩大感受野。最后，构建邻层减法模块以聚合不同尺度的编码器和解码器之间的差异特征，通过改进层级间信息聚合的方式，更有效地整合高层次的语义信息与低层次的细节信息，提供更丰富的特征表达，使模型在具有复杂结构的煤岩显微图像和多尺度目标的分割上具有更好的精度和鲁棒性。针对现有方法多数针对单一煤种或单一目标，构建了包含不同煤化程度的煤岩显微组分组语义分割数据集，采用该数据集训练所提语义分割模型，实现了适应于多种煤阶的煤岩显微组分组的多目标同时分割与解释。结果表明：所提出的网络其mDice系数可达93.39%，平均交并比(mIoU)为88.19%，像素准确率(PA)为96.50%，比U-Net原型网络分别提升了4.48%、7.32%和2.82%。将由模型计算所得煤岩显微组分组的数据与人工测定的结果进行比较，极差小于4%的占比为95.3%，验证了所提出的语义分割模型在煤岩组分组语义分割中的适应性与稳定性。

Abstract: The semantic segmentation of coal maceral groups is an important prerequisite for automatically determining the content of each group using image technology. In order to improve the accuracy of coal maceral groups recognition, according to the characteristics of coal microscopic images, a U-type deep learning model based on U-Net network model architecture is proposed. Firstly, a Transformer backbone network based on group mixing attention mechanism is employed to construct the encoder of the U-type net. By multiple scale features aggregating and self-attention mechanism, more comprehensive features are aggregated, the model is more sensitive to structural information of complex image. Secondly, an atrous spatial pyramid pooling module is cascaded at the end of the encoder to expand the receptive field without reducing the resolution of the feature map. Finally, an adjacent layer subtraction module is constructed to aggregate the differential features between encoders and decoders of different scales, by modified the aggregation method of information between levels, high-level semantic information and low-level detail information are integrated more effectively, and richer feature expressions are provided. By integrating multi-scale information, the accuracy and robustness of the model in the segmentation of complex scenes and multi-scale targets of coal microscopic images are improved. In response to the fact that most existing methods focus on single coal type and single object a semantic segmentation dataset containing coal maceral groups with different coal ranks is constructed. After training the proposed semantic segmentation model with the dataset, the simultaneous segmentation and interpretation of multi-objective of maceral groups suitable for multiple coal ranks are realized. The results showed that the proposed model can achieve a mDice coefficient of 93.39%, an average intersection over union (mIoU) of 88.19%, and a pixel accuracy (PA) of 96.50%, which are 4.48%, 7.32%, and 2.82% higher than those with the original U-Net network. Comparing the content data of coal maceral groups calculated by the model with the results of manual identification, 95.3% of the data are with the range 4%, indicating the applicability and stability of the proposed model in the coal maceral groups semantic segmentation.

一种U型煤岩显微组分组语义分割网络模型

A U-type semantic segmentation network model for coal maceral grouping

相关链接