Feedback Attention Model for Image Captioning

Lyu Fan; Hu Fuyuan; Zhang Yanning; Xia Zhenping; Sheng Victor S

doi:10.3724/SP.J.1089.2019.17505

Lyu Fan, Hu Fuyuan, Zhang Yanning, Xia Zhenping, Sheng Victor S. Feedback Attention Model for Image Captioning[J]. Journal of Computer-Aided Design & Computer Graphics, 2019, 31(7): 1122-1129. DOI: 10.3724/SP.J.1089.2019.17505

Citation:

Feedback Attention Model for Image Captioning

Graphical Abstract

Graphical Abstract

Abstract

Abstract

The image captioning problem aims to let machine generate relevant sentence of a given image, which has been applied to the service robot. To improve the performance of image captioning effectively, some researchers propose to leverage the attention mechanism. However, the mechanism often suffers from distraction and sentence-disorder. In this paper, we propose an image captioning model based on a novel feed-back attention mechanism. In generating the corresponding language for a given image, the proposed model uses the attention feedback from the generated language. With the feedback, the attention heatmap of the original image will be revised, and the generated sentence will also be better. We evaluate the proposed method on three benchmark datasets, i.e., Flickr8k, Flickr30k and MSCOCO, and the experimental results show the superiority of the proposed method.

FullText(HTML)

References (0)

Cited By

Turn off MathJax

Article Contents

Feedback Attention Model for Image Captioning

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content