|
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/32120
|
Title: | A Portuguese Dataset for Evaluation of Semantic Question Answering |
Authors: | Araújo, Denis Rigo, Sandro Quaresma, Paulo Muniz, João Henrique |
Issue Date: | 2020 |
Publisher: | Springer |
Citation: | Denis Andrei de Araujo, Sandro José Rigo, Paulo Quaresma, and João Henrique Muniz.
A portuguese dataset for evaluation of semantic question answering. In Paulo Quaresma,
Renata Vieira, Sandra M. Aluı́sio, Helena Moniz, Fernando Batista, and Teresa Gonçalves,
editors, Computational Processing of the Portuguese Language - 14th International Con-
ference, PROPOR 2020, Evora, Portugal, March 2-4, 2020, Proceedings, volume 12037 of
Lecture Notes in Computer Science, pages 217–227. Springer, 2020. |
Abstract: | Research on question answering tools over open linked data is increas-
ing, and that brings the necessity of resources to allow for the evaluation and
comparison of such systems. Question Answering over Linked Data (QALD) is a
traditional benchmark event that occurs annually since 2011. However, although
the multilingual task is available since its third edition, there is a necessity to foster
the actual Portuguese Language resources present in this event benchmark. In this
paper, we describe the development of the Portuguese language as a QALD corpus
complement. The corpus is based on an existing QALD multilingual corpus and
comprises 258 sentences used for the event challenge in 2017. We constructed a
second corpus to allow direct comparison with the DBPedia Portuguese content.
The main topics to highlight are the adopted methodology, which results in corpus
related to frequent Brazilian Portuguese use of the language, and the work on
adapting the answers to the DBPedia PT knowledge base, providing a corpus to
evaluate Portuguese QA systems accurately. |
URI: | http://hdl.handle.net/10174/32120 |
Type: | article |
Appears in Collections: | INF - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|