Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/2558
|
Title: | The impact of NLP techniques in the multilabel text classification problem |
Authors: | Gonçalves, Teresa Quaresma, Paulo |
Keywords: | machine learning Text classification |
Issue Date: | 2004 |
Publisher: | Springer-Verlag |
Abstract: | Support Vector Machines have been used successfully to classify text documents into sets of concepts. However, typically, linguistic information is not being used in the classification process or its use has not been fully evaluated.
We apply and evaluate two basic linguistic procedures (stop-word removal and stemming/lemmatization) to the multilabel text classification problem.
These procedures are applied to the Reuters dataset and to the Portuguese juridical documents from Supreme Courts and Attorney General’s Office. |
URI: | http://hdl.handle.net/10174/2558 |
Type: | article |
Appears in Collections: | INF - Artigos em Livros de Actas/Proceedings
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|