高级检索
谭鑫, 齐福霖, 王楠, 张志忠, 谢源, 马利庄. 基于视觉失真的玻璃表面检测方法[J]. 计算机辅助设计与图形学学报. DOI: 10.3724/SP.J.1089.2023-00342
引用本文: 谭鑫, 齐福霖, 王楠, 张志忠, 谢源, 马利庄. 基于视觉失真的玻璃表面检测方法[J]. 计算机辅助设计与图形学学报. DOI: 10.3724/SP.J.1089.2023-00342
Xin TAN, Fulin Qi, Nan Wang, Zhizhong Zhang, Yuan Xie, Lizhuang Ma. Glass Surface Detection Method Based on Visual Distortion[J]. Journal of Computer-Aided Design & Computer Graphics. DOI: 10.3724/SP.J.1089.2023-00342
Citation: Xin TAN, Fulin Qi, Nan Wang, Zhizhong Zhang, Yuan Xie, Lizhuang Ma. Glass Surface Detection Method Based on Visual Distortion[J]. Journal of Computer-Aided Design & Computer Graphics. DOI: 10.3724/SP.J.1089.2023-00342

基于视觉失真的玻璃表面检测方法

Glass Surface Detection Method Based on Visual Distortion

  • 摘要: 随着科技水平的不断提高, 人们对于准确解析复杂现实场景的需求日益增加. 尽管前沿的语义分割方法在常规物体检测过程中取得了重要成果, 但仍在玻璃表面检测任务中受到挑战. 玻璃具有独特的物理和化学性质, 导致其不具备自身的特有视觉外观和明显的边界. 玻璃表面在现实场景中借用其后方的任意场景或物体的外观, 使得检测过程变得困难. 目前, 虽然有一些方法尝试利用额外的训练输入或参照物边界信息进行玻璃表面检测, 但这些方法仅在满足训练要求的情况下有效. 因此, 本文旨在利用玻璃本身的材质特性, 在无需额外训练输入和参照物信息的情况下实现玻璃表面的有效检测. 本文关注玻璃造成的图像失真现象, 据此设计了视觉失真感知模块. 该模块提取多尺度的视觉失真信息, 并进行有效融合, 以引导主干网络关注玻璃表面导致的图像失真区域, 从而定位玻璃表面. 此外, 本文还注意到玻璃表面的结构信息有助于预测玻璃掩膜的细化, 通过分类子任务的形式利用玻璃表面质心信息, 辅助进行预测玻璃掩膜细化. 相比与前沿方法, 本文在广泛使用的玻璃表面数据集上关于标准评估指标上均取得了较大的提升.

     

    Abstract:  As technology continues to advance, the demand for accurately parsing complex real-world scenes is increasing. Although state-of-the-art semantic segmentation methods have achieved remarkable success in normal object detection, they are often challenged in glass surface detection. Glass has unique physical and chemical properties that make it lack its own distinct visual appearance and clear boundaries. Glass surfaces borrow the appearance of any scene or object behind them in real-world scenes, making the detection process difficult. Although some methods attempt to use additional training inputs or reference boundary information for glass surface detection, these methods are only effective with certain training requirements. Therefore, this paper aims to achieve effective glass surface detection without the need for additional training inputs or reference object information by utilizing the intrinsic properties of glass itself. This paper focuses on the image distortion caused by glass and designs a visual distortion aware module based on this phenomenon. The module extracts multi-scale visual distortion information and effectively integrates it to guide the backbone to focus on the areas of image distortion caused by glass surfaces, thereby locating the glass surfaces. In addition, this paper also notes that the structural information of glass surfaces helps to refine the prediction of glass masks, and utilizes the centroid information of glass surfaces in the form of a classification subtask to assist in predicting glass masks. Compared with state-of-the-art methods, this paper achieved significant improvements in standard evaluation metrics for the widely-used glass surface datasets.

     

/

返回文章
返回