高级检索

基于特征增强与多项式插值的目标检测网络

Object Detection Network based on Feature Enhancement and Polynomial Interpolation

  • 摘要: 针对Deformable DETR仅采用ResNet进行简单特征提取限制了后续模块检测效果的问题, 提出一种基于特征增强与多项式差值的目标检测网络. 首先引入特征提取模块同时提取图像局部和全局信息, 帮助网络更精确地捕捉图像关键特征; 然后设计双注意力模块根据需求动态地调整特征通道和空间位置的权重, 使网络能够聚焦于对当前任务更为重要的图像区域; 最后提出一种多项式插值方法拟合目标点周围更多的特征向量, 计算生成更高质量的特征向量. 在COCO数据集上采用一致的实验条件进行实验, 相比Deformable DETR, 所提网络平均检测精度提升至44.8%, 大目标检测精度提升1.9个百分点, 各项检测精度均得到提升, 并且优于对比的其他系列网络.

     

    Abstract: Addressing the limitation of Deformable DETR, which solely relies on ResNet for basic feature extraction, thereby constraining the detection performance of subsequent modules, this paper proposes a target detection network based on feature enhancement and polynomial interpolation. Firstly, a feature extraction module is introduced to simultaneously capture both local and global information from images, aiding the network in more accurately identifying key image features. Secondly, a dual attention module is designed to dynamically adjust the weights of feature channels and spatial positions according to requirements, enabling the network to focus on image regions that are more critical to the current task. Lastly, a polynomial interpolation method is proposed to fit more feature vectors around the target points, thereby generating higher-quality feature vectors through computation. Experiments conducted on the COCO dataset under consistent conditions revealed that, compared to Deformable DETR, the proposed network achieves an average detection accuracy of 44.8%, with a 1.9 percentage point increase in large object detection accuracy. All detection accuracy metrics are improved, and the network outperforms other comparable networks in the series.

     

/

返回文章
返回