高级检索
杨国烨, 周文洋, 刘兰, 张松海. 基于包围盒回归的图像构图推荐[J]. 计算机辅助设计与图形学学报, 2021, 33(5): 746-754. DOI: 10.3724/SP.J.1089.2021.18560
引用本文: 杨国烨, 周文洋, 刘兰, 张松海. 基于包围盒回归的图像构图推荐[J]. 计算机辅助设计与图形学学报, 2021, 33(5): 746-754. DOI: 10.3724/SP.J.1089.2021.18560
Yang Guoye, Zhou Wenyang, Liu Lan, Zhang Songhai. Bounding Box Regression Based Image Composition Recommendation[J]. Journal of Computer-Aided Design & Computer Graphics, 2021, 33(5): 746-754. DOI: 10.3724/SP.J.1089.2021.18560
Citation: Yang Guoye, Zhou Wenyang, Liu Lan, Zhang Songhai. Bounding Box Regression Based Image Composition Recommendation[J]. Journal of Computer-Aided Design & Computer Graphics, 2021, 33(5): 746-754. DOI: 10.3724/SP.J.1089.2021.18560

基于包围盒回归的图像构图推荐

Bounding Box Regression Based Image Composition Recommendation

  • 摘要: 图像构图推荐旨在找到一幅图像中最具构图美学价值的裁剪,可以辅助拍照者拍摄出构图优美、雅致、协调的照片.由于该任务较难精确、完整地标注出所有优秀构图包围盒,故此前基于神经网络的方法多数并不直接回归构图包围盒,而是通过枚举预制的构图包围盒回归分数,取分数最大者为构图结果.而这一做法会对回归结果精度和算法效率造成负面影响.通过提出一个由特征提取模块、包围盒回归模块和分数回归模块组成的可回归构图包围盒的端到端神经网络模型,并设计相应的数据构造方法、训练方法和损失函数,克服上述难点.选择FCDB和FLMS这2个公开数据集作为测试集,与现有方法相比,该方法在IoU和Disp测试指标上均达到最优.

     

    Abstract: Image composition recommendation aims to find the most aesthetically valuable crop in an image,which can assist the photographer to take beautiful,elegant,and coordinated photos.However,most of the previous methods based on neural network do not generate the composition boxes directly because it is difficult to accurately and completely mark all excellent composition bounding boxes.Instead,these methods first enumerate some pre-made boxes,regress their score and return the box with highest score,which will have a negative impact on the accuracy of regression results and algorithm efficiency.By proposing an end-to-end neural network model consisting of a feature extraction module,a bounding box regression module,and a score regression module,which can regress the composition bounding box,and designing the corresponding data construction method,training method and loss function,the above difficulties have been successfully overcome.In this paper,two public datasets,FCDB and FLMS,are selected as the test set.Compared with the existing methods,this method achieves the best IoU and Disp.

     

/

返回文章
返回