Latest YouTube Video

Tuesday, February 14, 2017

Hadamard Product for Low-rank Bilinear Pooling. (arXiv:1610.04325v3 [cs.CV] UPDATED)

Bilinear models provide rich representations compared with linear models. They have been applied in various visual tasks, such as object recognition, segmentation, and visual question-answering, to get state-of-the-art performances taking advantage of the expanded representations. However, bilinear representations tend to be high-dimensional, limiting the applicability to computationally complex tasks. We propose low-rank bilinear pooling using Hadamard product for an efficient attention mechanism of multimodal learning. We show that our model outperforms compact bilinear pooling in visual question-answering tasks with the state-of-the-art results on the VQA dataset, having a better parsimonious property.



from cs.AI updates on arXiv.org http://ift.tt/2ed1inE
via IFTTT

No comments: