.

Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying

LAUR Repository

Show simple item record

dc.contributor.author Tekli, Joe
dc.contributor.author Tekli, Gilbert
dc.contributor.author Chbeir, Richard
dc.date.accessioned 2024-08-20T10:47:53Z
dc.date.available 2024-08-20T10:47:53Z
dc.date.copyright 2023 en_US
dc.date.issued 2023
dc.identifier.issn 2406-1018 en_US
dc.identifier.uri http://hdl.handle.net/10725/15998
dc.description.abstract Many efforts have been deployed by the IR community to extend freetext query processing toward semi-structured XML search. Most methods rely on the concept of Lowest Comment Ancestor (LCA) between two or multiple structural nodes to identify the most specific XML elements containing query keywords posted by the user. Yet, few of the existing approaches consider XML semantics, and the methods that process semantics generally rely on computationally expensive word sense disambiguation (WSD) techniques, or apply semantic analysis in one stage only: performing query relaxation/refinement over the bag of words retrieval model, to reduce processing time. In this paper, we describe a new approach for XML keyword search aiming to solve the limitations mentioned above. Our solution first transforms the XML document collection (offline) and the keyword query (on-the-fly) into meaningful semantic representations using context-based and global disambiguation methods, specially designed to allow almost linear computation efficiency. We use a semantic-aware inverted index to allow semantic-aware search, result selection, and result ranking functionality. The semantically augmented XML data tree is processed for structural node clustering, based on semantic query concepts (i.e., key-concepts), in order to identify and rank candidate answer sub-trees containing related occurrences of query key-concepts. Dedicated weighting functions and various search algorithms have been developed for that purpose and will be presented here. Experimental results highlight the quality and potential of our approach. en_US
dc.language.iso en en_US
dc.title Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying en_US
dc.type Article en_US
dc.description.version Published en_US
dc.author.school SOE en_US
dc.author.idnumber 201306321 en_US
dc.author.department Electrical And Computer Engineering en_US
dc.relation.journal Computer Science and Information Systems en_US
dc.journal.volume 20 en_US
dc.journal.issue 1 en_US
dc.article.pages 423-457 en_US
dc.keywords Semi-structured data en_US
dc.keywords XML en_US
dc.keywords Semantic Analysis en_US
dc.keywords Semantic Disambiguation en_US
dc.keywords Keyword Search en_US
dc.keywords Query Processing en_US
dc.identifier.doi https://doi.org/10.2298/CSIS220228063T en_US
dc.identifier.ctation Tekli, J., Tekli, G., & Chbeir, R. (2023). Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying. Computer Science and Information Systems, 20(1), 423-457. en_US
dc.author.email joe.tekli@lau.edu.lb en_US
dc.identifier.tou http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php en_US
dc.identifier.url https://doiserbia.nb.rs/Article.aspx?id=1820-02142200063T en_US
dc.orcid.id https://orcid.org/0000-0003-3441-7974 en_US
dc.author.affiliation Lebanese American University en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search LAUR


Advanced Search

Browse

My Account