.

Almost Linear Semantic XML Keyword Search

LAUR Repository

Show simple item record

dc.contributor.author Tekli, Joe
dc.contributor.author Tekli, Gilbert
dc.contributor.author Chbeir, Richard
dc.date.accessioned 2024-11-05T09:54:41Z
dc.date.available 2024-11-05T09:54:41Z
dc.date.copyright 2021 en_US
dc.date.issued 2021-11-09
dc.identifier.isbn 9781450383141 en_US
dc.identifier.uri http://hdl.handle.net/10725/16278
dc.description.abstract Many efforts have been deployed by the IR community to extend free-text query processing toward semi-structured XML search. Most methods rely on the concept of Lowest Comment Ancestor (LCA) between two or multiple structural nodes to identify the most specific XML elements containing query keywords posted by the user. Yet, few of the existing approaches consider XML semantics, and the methods that process semantics generally rely on computationally expensive word sense disambiguation (WSD) techniques, or apply semantic analysis in one stage only: performing query relaxation/refinement over the bag of words retrieval model, to reduce processing time. In this paper, we describe the building blocks of a new approach for XML keyword search aiming to solve the limitations mentioned above. Our solution first transforms the XML document collection (offline) and the keyword query (on-the-fly) into meaningful semantic representations using context-based and global disambiguation methods, specially designed to allow almost linear computation efficiency. Consequently, the semantically augmented XML data tree is processed for structural node clustering, based on semantic query concepts (i.e., key-concepts), in order to identify and rank candidate answer sub-trees containing related occurrences of query key-concepts. Preliminary experiments highlight the quality and potential of our approach. en_US
dc.description.sponsorship ACM en_US
dc.description.sponsorship SIGAPP en_US
dc.language.iso en en_US
dc.publisher The Association for Computing Machinery en_US
dc.subject Computer security -- Congresses en_US
dc.subject Big data -- Congresses en_US
dc.title Almost Linear Semantic XML Keyword Search en_US
dc.type Conference Paper / Proceeding en_US
dc.author.school SOE en_US
dc.author.idnumber 201306321 en_US
dc.author.department Electrical and Computer Engineering en_US
dc.publication.place New York, NY en_US
dc.description.bibliographiccitations Includes bibliographical references. en_US
dc.identifier.doi https://doi.org/10.1145/3444757.3485079 en_US
dc.identifier.ctation Tekli, J., Tekli, G., & Chbeir, R. (2021, November). Almost linear semantic XML keyword search. In Proceedings of the 13th international conference on management of digital EcoSystems (pp. 129-138). New York: ACM en_US
dc.author.email joe.tekli@lau.edu.lb en_US
dc.conference.date 1-3 November, 2021 en_US
dc.conference.place Tunisia (Virtual event) en_US
dc.conference.title MEDES '21: Proceedings of the 13th International Conference on Management of Digital EcoSystems en_US
dc.identifier.tou http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php en_US
dc.identifier.url https://dl.acm.org/doi/abs/10.1145/3444757.3485079 en_US
dc.orcid.id https://orcid.org/0000-0003-3441-7974 en_US
dc.publication.date 2021 en_US
dc.author.affiliation Lebanese American University en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search LAUR


Advanced Search

Browse

My Account