|
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/27059
|
Title: | Event extraction and representation: A case study for the portuguese language |
Authors: | Quaresma, Paulo Nogueira, Vítor Raiyani, Kashyap Bayot, Roy |
Keywords: | Events Information extraction Natural language processing Ontologies population Text mining |
Issue Date: | Jun-2019 |
Publisher: | MDPI AG |
Abstract: | Text information extraction is an important natural language processing (NLP) task, which aims to automatically identify, extract, and represent information from text. In this context, event extraction plays a relevant role, allowing actions, agents, objects, places, and time periods to be identified and represented. The extracted information can be represented by specialized ontologies, supporting knowledge-based reasoning and inference processes. In this work, we will describe, in detail, our proposal for event extraction from Portuguese documents. The proposed approach is based on a pipeline of specialized natural language processing tools; namely, a part-of-speech tagger, a named entities recognizer, a dependency parser, semantic role labeling, and a knowledge extraction module. The architecture is language-independent, but its modules are language-dependent and can be built using adequate AI (i.e., rule-based or machine learning) methodologies. The developed system was evaluated with a corpus of Portuguese texts and the obtained results are presented and analysed. The current limitations and future work are discussed in detail. |
URI: | http://hdl.handle.net/10174/27059 |
Type: | article |
Appears in Collections: | INF - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|