Patrick McGuire: Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation. (arXiv:1701.08251v1 [cs.CL])

Monday, January 30, 2017

Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation. (arXiv:1701.08251v1 [cs.CL])

The popularity of image sharing on social media reflects the important role visual context plays in everyday conversation. In this paper, we present a novel task, Image-Grounded Conversations (IGC), in which natural-sounding conversations are generated about shared photographic images. We investigate this task using training data derived from image-grounded conversations on social media and introduce a new dataset of crowd-sourced conversations for benchmarking progress. Experiments using deep neural network models trained on social media data show that the combination of visual and textual context can enhance the quality of generated conversational turns. In human evaluation, a gap between human performance and that of both neural and retrieval architectures suggests that IGC presents an interesting challenge for vision and language research.

from cs.AI updates on arXiv.org http://ift.tt/2kaGK2v
via IFTTT

Patrick McGuire

Latest YouTube Video

Monday, January 30, 2017

Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation. (arXiv:1701.08251v1 [cs.CL])

No comments:

Click to Show Support

Click to Show Support