Repositório Digital de Publicações Científicas: The impact of NLP techniques in the multilabel text classification problem

Please use this identifier to cite or link to this item: http://hdl.handle.net/10174/2558

Title:	The impact of NLP techniques in the multilabel text classification problem
Authors:	Gonçalves, Teresa Quaresma, Paulo
Keywords:	machine learning Text classification
Issue Date:	2004
Publisher:	Springer-Verlag
Abstract:	Support Vector Machines have been used successfully to classify text documents into sets of concepts. However, typically, linguistic information is not being used in the classification process or its use has not been fully evaluated. We apply and evaluate two basic linguistic procedures (stop-word removal and stemming/lemmatization) to the multilabel text classification problem. These procedures are applied to the Reuters dataset and to the Portuguese juridical documents from Supreme Courts and Attorney General’s Office.
URI:	http://hdl.handle.net/10174/2558
Type:	article
Appears in Collections:	INF - Artigos em Livros de Actas/Proceedings

Files in This Item:

File	Description	Size	Format
tcg04a-impactNLP.pdf	Artigo	164.65 kB	Adobe PDF	View/Open