|
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10174/36338
|
Title: | Using terms and informal definitions to classify domain entities into top-level ontology concepts: An approach based on language models |
Authors: | Lopes, Alcides Carbonera, Joel Schmidt, Daniela Garcia, Luan Rodrigues, Fabricio Abel, Mara |
Keywords: | Ontology learning Top-level ontology Language model |
Issue Date: | 11-Feb-2023 |
Publisher: | Elsevier |
Citation: | Alcides Lopes, Joel Carbonera, Daniela Schmidt, Luan Garcia, Fabricio Rodrigues, Mara Abel,
Using terms and informal definitions to classify domain entities into top-level ontology concepts: An approach based on language models,
Knowledge-Based Systems,
Volume 265,
2023,
110385,
ISSN 0950-7051,
https://doi.org/10.1016/j.knosys.2023.110385.
(https://www.sciencedirect.com/science/article/pii/S0950705123001351)
Abstract: The classification of domain entities into top-level ontology concepts remains an activity performed manually by an ontology engineer. Although some works focus on automating this task by applying machine-learning approaches using textual sentences as input, they require the existence of the domain entities in external knowledge resources, such as pre-trained embedding models. In this context, this work proposes an approach that combines the term representing the domain entity and its informal definition into a single text sentence without requiring external knowledge resources. Thus, we use this sentence as the input of a deep neural network that contains a language model as a layer. Also, we present a methodology used to extract two novel datasets from the OntoWordNet ontology based on Dolce-Lite and Dolce-Lite-Plus top-level ontologies. Our experiments show that by using the transformer-based language models, we achieve promising results in classifying domain entities into 82 top-level ontology concepts, with 94% regarding micro F1-score.
Keywords: Ontology learning; Top-level ontology; Language model |
Abstract: | The classification of domain entities into top-level ontology concepts remains an activity performed manually by an ontology engineer. Although some works focus on automating this task by applying machine-learning approaches using textual sentences as input, they require the existence of the domain entities in external knowledge resources, such as pre-trained embedding models. In this context, this work proposes an approach that combines the term representing the domain entity and its informal definition into a single text sentence without requiring external knowledge resources. Thus, we use this sentence as the input of a deep neural network that contains a language model as a layer. Also, we present a methodology used to extract two novel datasets from the OntoWordNet ontology based on Dolce-Lite and Dolce-Lite-Plus top-level ontologies. Our experiments show that by using the transformer-based language models, we achieve promising results in classifying domain entities into 82 top-level ontology concepts, with 94% regarding micro F1-score. |
URI: | https://www.sciencedirect.com/science/article/pii/S0950705123001351?via%3Dihub http://hdl.handle.net/10174/36338 |
Type: | article |
Appears in Collections: | INF - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|