Patrick McGuire: Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus. (arXiv:1603.06807v1 [cs.CL])

Tuesday, March 22, 2016

Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus. (arXiv:1603.06807v1 [cs.CL])

Over the past decade, large-scale supervised learning corpora have enabled machine learning researchers to make substantial advances. However, to this date, there are no large-scale question-answer corpora available. In this paper we present the 30M Factoid Question-Answer Corpus, an enormous question-answer pair corpus produced by applying a novel neural network architecture on the knowledge base Freebase to transduce facts into natural language questions. The produced question-answer pairs are evaluated both by human evaluators and using automatic evaluation metrics, including well-established machine translation and sentence similarity metrics. Across all evaluation criteria the question-generation model outperforms the competing template-based baseline. Furthermore, when presented to human evaluators, the generated questions appear to be indistinguishable from real human-generated questions.

Donate to arXiv

from cs.AI updates on arXiv.org http://ift.tt/1Ujqzi0
via IFTTT

Patrick McGuire

Latest YouTube Video

Tuesday, March 22, 2016

Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus. (arXiv:1603.06807v1 [cs.CL])

No comments:

Click to Show Support

Click to Show Support