Transformer与卷积神经网络相结合的皮肤镜图像自动分割算法

韦春苗; 徐岩; 蒋新辉; 魏一铭

doi:10.3724/SP.J.1089.2022.19210

Transformer与卷积神经网络相结合的皮肤镜图像自动分割算法

Automatic Segmentation Algorithm of Dermoscopy Image Based on Transformer and Convolutional Neural Network

摘要

摘要: 皮肤镜图像的病灶区域与背景像素相似度高,且病灶存在形状多样,边缘模糊,人工或毛发遮挡等情况,为了获得更高精度的皮肤病变分割,提出了一种皮肤镜图像自动分割算法.首先,使用ResNet 34提取多种分辨率特征,在上下文部分使用Transformer模块对输入的特征进行全局建模;其次,通过混合池化模块聚合上下文特征的多尺度信息,在对应连接编解码器的跳跃连接间设计一个高效卷积模块以提高跳跃路径的边缘细化和抗干扰能力;最后,利用解码器恢复图像分辨率,并逐层融合其他浅层分辨率特征,利用Focal Loss函数改善难分割目标的精度.文中算法在ISIC2017,ISIC2018数据集上获得的Dice系数、准确率、Jaccard指数、灵敏度得分分别为88.83%,94.77%,81.43%, 88.49%和89.46%, 94.50%, 82.56%, 94.62%,与其他算法相比具有一定的优势,证明了该算法的有效性.

Abstract: The lesion area of the dermoscopic image is highly similar to the background pixels, and there are various shapes, blurred edges, artificial or hair occlusion, etc. In order to obtain higher-precision segmentation of skin lesions, a automatic segmentation algorithm for dermoscopic images is proposed in this paper.Firstly, ResNet 34 is used to extract multiple resolution features, and the Transformer module is used to model globally the input features in the context part. Secondly, the multi-scale information of context features is aggregated through the Mixed Pooling Module. In addition, an efficient convolution module is designed between jump links of the corresponding codec modules to improve the edge refinement and anti-interference ability of the jump path. Finally, the decoder is used to restore the image resolution and fuse other shallow resolution features information, and using Focal Loss function to improve the accuracy of segment targets. The Dice coefficient, accuracy, Jackard index and sensitivity scores obtained by our method on the ISIC 2017 and ISIC 2018 datasets are 88.83%, 94.77%, 81.43%, 88.49% and 89.46%, 94.50%,82.56% and 94.62% respectively. Compared with other algorithms, our method has certain advantages,proving its effectiveness.

HTML全文

参考文献(28)

施引文献

资源附件(0)