高级检索

多任务网络融合多层信息的目标定位

Multi-Task Network Combing Multi-Level Information for Object Localization

  • 摘要: 目标定位是计算机视觉中的关键问题,针对目前区域卷积神经网络及其扩展算法的精度受限于输出卷积图尺寸、不能得到目标准确位置的问题,提出一种基于多任务卷积神经网络的目标定位算法.在特征提取阶段,不同的任务共享相同的特征提取层;然后分别针对不同层次信息训练对应的后续网络;在提取出高层和低层信息后,融合低层信息和高层信息得到准确的目标位置.在PASCAL VOC 2007数据库和交通场景数据库进行实验的结果表明,该算法可以有效地提高目标定位的精度.

     

    Abstract: Object localization is a very important challenge in computer vision. At present, region convolutional neural networks(RCNN) and extensions had accuracies limited by the size of the output convolution map, hence they could not get exact object locations. In this paper, a new algorithm based on multi-task convolutional neural network was proposed for object localization, firstly different tasks shared feature extraction layer, and then the sub-networks for different levels of information were trained. When the high-level and low-level information were extracted, the accurate target location was obtained by a fusion of these two level information. Finally, the performance of the algorithm was verified on the PASCAL VOC 2007 database and the traffic scene database, and experiments show that the algorithm can effectively improve the accuracy of object localization.

     

/

返回文章
返回