Please use this identifier to cite or link to this item: http://hdl.handle.net/10174/24454

Title: A multi- versus a single-classifier approach for the identification of modality in the portuguese language
Authors: Sequeira, João
Gonçalves, Teresa
Quaresma, Paulo
Mendes, Amália
Hendrickx, Iris
Issue Date: 2018
Publisher: ELRA - European Language Resources Association
Citation: oão Sequeira, Teresa Gonçalves, Paulo Quaresma, Amália Mendes, and Iris Hendrickx. A multi- versus a single-classifier approach for the identification of modality in the portuguese language. In Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry De- clerck, Sara Goggi, Kôiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga, editors, Proceedings of the Eleventh International Conference on Language Resources and Eval- uation, LREC 2018, Miyazaki, Japan, May 7-12, 2018. European Language Resources Association (ELRA), 2018.
Abstract: This work presents a comparative study between two different approaches to build an automatic classification system for Modality values in the Portuguese language. One approach uses a single multi-class classifier with the full dataset that includes eleven modal verbs; the other builds different classifiers, one for each verb. The performance is measured using precision, recall and F 1 . Due to the unbalanced nature of the dataset a weighted average approach was calculated for each metric. We use support vector machines as our classifier and experimented with various SVM kernels to find the optimal classifier for the task at hand. We experimented with several different types of feature attributes representing parse tree information and compare these complex feature representation against a simple bag-of-words feature representation as baseline. The best obtained F 1 values are above 0.60 and from the results it is possible to conclude that there is no significant difference between both approaches.
URI: http://hdl.handle.net/10174/24454
Type: lecture
Appears in Collections:INF - Comunicações - Em Congressos Científicos Internacionais

Files in This Item:

File Description SizeFormat
A_Multi__versus_a_Single_classifier_Approach_for_the_Identification_of_Modality_in_the_Portuguese_Language.pdf136.35 kBAdobe PDFView/OpenRestrict Access. You can Request a copy!
FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Dspace Dspace
DSpace Software, version 1.6.2 Copyright © 2002-2008 MIT and Hewlett-Packard - Feedback
UEvora B-On Curriculum DeGois