高级检索
吴东, 卢利琼, 熊建芳. 基于SSD框架的自然场景盲文识别方法研究[J]. 计算机辅助设计与图形学学报. DOI: 10.3724/SP.J.1089.2023-00630
引用本文: 吴东, 卢利琼, 熊建芳. 基于SSD框架的自然场景盲文识别方法研究[J]. 计算机辅助设计与图形学学报. DOI: 10.3724/SP.J.1089.2023-00630
Dong Wu, Liqiong Lu, Jianfang Xiong. Research on Natural Scene Braille Recognition Method based on SSD Framework[J]. Journal of Computer-Aided Design & Computer Graphics. DOI: 10.3724/SP.J.1089.2023-00630
Citation: Dong Wu, Liqiong Lu, Jianfang Xiong. Research on Natural Scene Braille Recognition Method based on SSD Framework[J]. Journal of Computer-Aided Design & Computer Graphics. DOI: 10.3724/SP.J.1089.2023-00630

基于SSD框架的自然场景盲文识别方法研究

Research on Natural Scene Braille Recognition Method based on SSD Framework

  • 摘要: 盲文是视障人士学习知识和技术的工具,但正常人却通常对盲文知之甚少,造成正常人和盲人之间的沟通障碍重重。在此背景下构建了自然场景盲文段图像数据集,并基于SSD的框架设计了一种自然场景盲文识别方法。该方法首先分析自然场景图像中的盲文通常尺寸较小排列紧密,且盲文字符的宽高比基本固定;随后结合盲文的特点设计了CNN结构和识别特征层选择策略、默认框大小、盲文字符标签和损失函数以及图像输入策略后提出了自然场景盲文识别方法。在以上盲文段图像数据集上,本文提出的盲文识别方法Hmean值达到了0.796,FPS为66.22;与目标识别领域经典算法SSD和Faster RCNN相比,盲文识别性能提升明显.

     

    Abstract: Braille is a tool for visually impaired individuals to learn knowledge and skills. But normal individuals often have little knowledge of Braille, resulting in numerous communication barriers between normal and visually impaired individuals. In this context, a natural scene Braille segment image dataset was constructed, and a natural scene Braille recognition method was designed based on the SSD framework. This method first analyzed that braille in natural scene images is usually small in size and tightly arranged, and the aspect ratio of braille characters is basically fixed; Subsequently, combining the characteristics of Braille, a CNN structure and recognition feature layer selection strategy, default box size, Braille character labels and loss functions, as well as an image input strategy were designed, and a natural scene braille recognition method was proposed. On the above Braille segment image dataset, the proposed Braille recognition method has gotten an Hmean value of 0.796 and an FPS value of 66.22; Compared with classic algorithms SSD and Faster RCNN in the field of object recognition, Braille recognition performance has been significantly improved.

     

/

返回文章
返回