Please use this identifier to cite or link to this item: http://hdl.handle.net/10174/36338

Title: Using terms and informal definitions to classify domain entities into top-level ontology concepts: An approach based on language models
Authors: Lopes, Alcides
Carbonera, Joel
Schmidt, Daniela
Garcia, Luan
Rodrigues, Fabricio
Abel, Mara
Keywords: Ontology learning
Top-level ontology
Language model
Issue Date: 11-Feb-2023
Publisher: Elsevier
Citation: Alcides Lopes, Joel Carbonera, Daniela Schmidt, Luan Garcia, Fabricio Rodrigues, Mara Abel, Using terms and informal definitions to classify domain entities into top-level ontology concepts: An approach based on language models, Knowledge-Based Systems, Volume 265, 2023, 110385, ISSN 0950-7051, https://doi.org/10.1016/j.knosys.2023.110385. (https://www.sciencedirect.com/science/article/pii/S0950705123001351) Abstract: The classification of domain entities into top-level ontology concepts remains an activity performed manually by an ontology engineer. Although some works focus on automating this task by applying machine-learning approaches using textual sentences as input, they require the existence of the domain entities in external knowledge resources, such as pre-trained embedding models. In this context, this work proposes an approach that combines the term representing the domain entity and its informal definition into a single text sentence without requiring external knowledge resources. Thus, we use this sentence as the input of a deep neural network that contains a language model as a layer. Also, we present a methodology used to extract two novel datasets from the OntoWordNet ontology based on Dolce-Lite and Dolce-Lite-Plus top-level ontologies. Our experiments show that by using the transformer-based language models, we achieve promising results in classifying domain entities into 82 top-level ontology concepts, with 94% regarding micro F1-score. Keywords: Ontology learning; Top-level ontology; Language model
Abstract: The classification of domain entities into top-level ontology concepts remains an activity performed manually by an ontology engineer. Although some works focus on automating this task by applying machine-learning approaches using textual sentences as input, they require the existence of the domain entities in external knowledge resources, such as pre-trained embedding models. In this context, this work proposes an approach that combines the term representing the domain entity and its informal definition into a single text sentence without requiring external knowledge resources. Thus, we use this sentence as the input of a deep neural network that contains a language model as a layer. Also, we present a methodology used to extract two novel datasets from the OntoWordNet ontology based on Dolce-Lite and Dolce-Lite-Plus top-level ontologies. Our experiments show that by using the transformer-based language models, we achieve promising results in classifying domain entities into 82 top-level ontology concepts, with 94% regarding micro F1-score.
URI: https://www.sciencedirect.com/science/article/pii/S0950705123001351?via%3Dihub
http://hdl.handle.net/10174/36338
Type: article
Appears in Collections:INF - Publicações - Artigos em Revistas Internacionais Com Arbitragem Científica

Files in This Item:

File Description SizeFormat
1-s2.0-S0950705123001351-main.pdf1.48 MBAdobe PDFView/OpenRestrict Access. You can Request a copy!
FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpaceOrkut
Formato BibTex mendeley Endnote Logotipo do DeGóis 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Dspace Dspace
DSpace Software, version 1.6.2 Copyright © 2002-2008 MIT and Hewlett-Packard - Feedback
UEvora B-On Curriculum DeGois