高级检索
姚鹏飞, 魏育坤, 路昊, 王素琴. 基于混合尺度非局部注意力的纹理图像超分辨率[J]. 计算机辅助设计与图形学学报, 2023, 35(10): 1479-1488. DOI: 10.3724/SP.J.1089.2023.19488
引用本文: 姚鹏飞, 魏育坤, 路昊, 王素琴. 基于混合尺度非局部注意力的纹理图像超分辨率[J]. 计算机辅助设计与图形学学报, 2023, 35(10): 1479-1488. DOI: 10.3724/SP.J.1089.2023.19488
Yao Pengfei, Wei Yukun, Lu Hao, Wang Suqin. Super-Resolution Reconstruction of Texture Image Based on Mixed-Scale Non-Local Attention[J]. Journal of Computer-Aided Design & Computer Graphics, 2023, 35(10): 1479-1488. DOI: 10.3724/SP.J.1089.2023.19488
Citation: Yao Pengfei, Wei Yukun, Lu Hao, Wang Suqin. Super-Resolution Reconstruction of Texture Image Based on Mixed-Scale Non-Local Attention[J]. Journal of Computer-Aided Design & Computer Graphics, 2023, 35(10): 1479-1488. DOI: 10.3724/SP.J.1089.2023.19488

基于混合尺度非局部注意力的纹理图像超分辨率

Super-Resolution Reconstruction of Texture Image Based on Mixed-Scale Non-Local Attention

  • 摘要: 相较于一般图像, 纹理图像细节特征尺度小、密度大, 导致低分辨率下会丢失更多高频信息, 影响超分辨率重建的效果. 基于此, 提出一种利用混合尺度非局部注意力的纹理图像超分辨率方法. 首先, 在跨尺度非局部注意力的基础上提出等尺度非局部注意力, 用于在整幅图像中挖掘等尺度相似特征块的高频信息, 为解决 2 种注意力并行部署带来的计算操作与参数量较多的问题, 设计参数共享的方法, 将 2 种注意力合并为混合尺度非局部注意力(MSNLA); 其次, 通过通道投影的方式将 MSNLA 生成的不同尺度的相似特征与输入特征图融合; 最后, 利用非局部特征融合重建的方法将 MSNLA 提取到的特征组合后进行超分辨率重建. 实验结果表明, 在 DTD 数据集上, 该方法相较于 CSNLN 算法的 PSNR 提高了 0.16 dB, 模型参数量减少了约 10.3%, 并且重建图像取得了更好的视觉效果.

     

    Abstract: Compared with ordinary images, the local detail of texture images has a small scale while high density, which may lose high-frequency details at low-resolution, thus affecting the effect of super-resolution image reconstruction. To solve this problem, we presented a super-resolution method for texture images using Mixed-Scale Non-Local Attention (MSNLA). Firstly, we proposed Equal-Scale Non-Local attention (ESNLA) based on Cross-Scale Non-Local Attention (CSNLA) to extract the high-frequency information of equal-scale similar feature blocks in the whole image. Besides, considering that deploying parallelized non-local attention modules will bring heavy computational burden and will increase the number of parameters, we proposed a parameter sharing method that combined CSNLA and ESNLA, namely MSNLA. Secondly, we fused the similar feature of different scales generated by MSNLA into the input feature map using channel projection. Finally, we combined the features extracted by MSNLA for super-resolution reconstruction using non-local feature fusion. Experimental results on Describable Texture Dataset (DTD) demonstrate that our proposed algorithm improve the PSNR by 0.16 dB while reducing the number of model parameters by about 10.3% with better visual effect.

     

/

返回文章
返回