高级检索
赵国庆, 董天阳, 童程凯, 沈冰雁. 面向复杂光照的舞台演员检测[J]. 计算机辅助设计与图形学学报.
引用本文: 赵国庆, 董天阳, 童程凯, 沈冰雁. 面向复杂光照的舞台演员检测[J]. 计算机辅助设计与图形学学报.
Actor Detection Method for Complex Lighting Stage[J]. Journal of Computer-Aided Design & Computer Graphics.
Citation: Actor Detection Method for Complex Lighting Stage[J]. Journal of Computer-Aided Design & Computer Graphics.

面向复杂光照的舞台演员检测

Actor Detection Method for Complex Lighting Stage

  • 摘要: 复杂舞台场景存在多个光源产生的偏色和光照不均匀问题, 严重影响了演员检测的精度. 针对上述问题, 提出了一种基于伪多模态融合的演员检测方法. 首先, 随机选取一种光照处理方法构建增强图像, 与原图像构成伪多模态图像对; 其次, 在增强图像中以演员关键点建立候选集合, 从集合中随机选取部分关键点所在的区域构建增强补丁集合, 并将补丁替换到原始图像中进行训练; 然后, 在传统特征金字塔的基础上, 借鉴Transformer编码器的构建形式, 利用视觉注意力模块构建视觉注意力编码器, 强化多尺度特征的交互逻辑. 在自建的4 543张舞台演员数据集上, 与三个模型进行了组合, 均值平均精度分别提升了0.4%~2.9%, 结果表明所提方法能够较好地降低偏色和不均匀光照的影响.

     

    Abstract: In complex stage scenes, the accuracy of actor detection is severely affected by color bias and uneven illumination problems generated by multiple light sources. To solve these problems, an actor detection method based on pseudo-multimodal fusion is proposed. First, a random illumination method is used to construct an enhanced image to form a pseudo-multimodal image pair with the original image; second, an actor key point candidate set is created in the enhanced image, and some key points are randomly selected from the set to construct enhanced patches, and the patches are replaced into the original image for training. Then, based on the traditional feature pyramid, the visual attention module is used to construct visual attention encoders to enhance the interaction logic of multi-scale features. On a self-constructed dataset of 4 543 stage actors, the average accuracy is improved  0.4%~2.9% by combining the three models, and the results show that the proposed method can better reduce the effects of color bias and uneven illumination.

     

/

返回文章
返回