高级检索
唐乾坤, 胡瑜. 基于正负锚点框均衡及特征对齐的单阶段目标检测算法[J]. 计算机辅助设计与图形学学报, 2020, 32(11): 1773-1783. DOI: 10.3724/SP.J.1089.2020.18175
引用本文: 唐乾坤, 胡瑜. 基于正负锚点框均衡及特征对齐的单阶段目标检测算法[J]. 计算机辅助设计与图形学学报, 2020, 32(11): 1773-1783. DOI: 10.3724/SP.J.1089.2020.18175
Tang Qiankun, Hu Yu. PosNeg-Balanced Anchors with Aligned Features for Single-Shot Object Detection[J]. Journal of Computer-Aided Design & Computer Graphics, 2020, 32(11): 1773-1783. DOI: 10.3724/SP.J.1089.2020.18175
Citation: Tang Qiankun, Hu Yu. PosNeg-Balanced Anchors with Aligned Features for Single-Shot Object Detection[J]. Journal of Computer-Aided Design & Computer Graphics, 2020, 32(11): 1773-1783. DOI: 10.3724/SP.J.1089.2020.18175

基于正负锚点框均衡及特征对齐的单阶段目标检测算法

PosNeg-Balanced Anchors with Aligned Features for Single-Shot Object Detection

  • 摘要: 针对正负例锚点框不均衡将降低基于锚点框的单阶段目标检测算法的检测精度的问题,提出一种包含锚点框提升模块和特征对齐模块来均衡正负例锚点框的算法.首先在锚点框提升模块中预测各个锚点框为正例的可能性,并粗略调整初始锚点框的位置和尺寸;然后在特征对齐模块中为调整后的锚点框提取预测所需的对齐特征;最后检测网络借助锚点框提升模块输出信息,从调整后的锚点框中识别出简单负例锚点框,并在训练阶段忽略其梯度.将文中算法应用于以VGG-16和ResNet-101为特征提取网络的编解码架构中,在目标检测数据集MS COCO和PASCAL VOC上进行实验,结果表明,该算法能够显著改善不均衡问题,提高单阶段目标检测算法的检测精度(MS COCO和PASCAL VOC上的精度分别为42.8%和82.7%),并维持28.6帧/s的实时运行速度.

     

    Abstract: We introduce a novel single-shot object detector to ease the imbalance of foreground-background class by suppressing the easy negatives while increasing the positives.To achieve this,we propose an anchor promotion module(APM)which predicts the probability of each anchor as positive and adjusts their initial locations and shapes to promote both the quality and quantity of positive anchors.In addition,we design an efficient feature alignment module to extract aligned features for fitting the promoted anchors with the help of both location and shape transformation information from the APM.The probabilities from APM are helpful for the detection classifier to identify the easy negatives and to ignore their gradients during training.We assemble the proposed modules to the backbone of VGG-16 and ResNet-101 network with an encoder-decoder architecture.Extensive experiments on MS COCO and PASCAL VOC well demonstrate our model performs competitively with alternative methods(42.8%mAP on MS COCO and 82.7%mAP on PASCAL VOC)and can run at 28.6 FPS.

     

/

返回文章
返回