.

Building semantic trees from XML documents

LAUR Repository

Show simple item record

dc.contributor.author Tekli, Joe
dc.contributor.author Charbel, Nathalie
dc.contributor.author Chbeir, Richard
dc.date.accessioned 2017-01-27T08:12:32Z
dc.date.available 2017-01-27T08:12:32Z
dc.date.copyright 2016 en_US
dc.date.issued 2016-05-13
dc.identifier.issn 1570-8268 en_US
dc.identifier.uri http://hdl.handle.net/10725/5081
dc.description.abstract The distributed nature of the Web, as a decentralized system exchanging information between heterogeneous sources, has underlined the need to manage interoperability, i.e., the ability to automatically interpret information in Web documents exchanged between different sources, necessary for efficient information management and search applications. In this context, XML was introduced as a data representation standard that simplifies the tasks of interoperation and integration among heterogeneous data sources, allowing to represent data in (semi-) structured documents consisting of hierarchically nested elements and atomic attributes. However, while XML was shown most effective in exchanging data, i.e., in syntactic interoperability, it has been proven limited when it comes to handling semantics, i.e., semantic interoperability, since it only specifies the syntactic and structural properties of the data without any further semantic meaning. As a result, XML semantic-aware processing has become a motivating challenge in Web data management, requiring dedicated semantic analysis and disambiguation methods to assign well-defined meaning to XML elements and attributes. In this context, most existing approaches: (i) ignore the problem of identifying ambiguous XML elements/nodes, (ii) only partially consider their structural relationships/context, (iii) use syntactic information in processing XML data regardless of the semantics involved, and (iv) are static in adopting fixed disambiguation constraints thus limiting user involvement. In this paper, we provide a new XML Semantic Disambiguation Framework titled XSDFdesigned to address each of the above limitations, taking as input: an XML document, and then producing as output a semantically augmented XML tree made of unambiguous semantic concepts extracted from a reference machine-readable semantic network. XSDF consists of four main modules for: (i) linguistic pre-processing of simple/compound XML node labels and values, (ii) selecting ambiguous XML nodes as targets for disambiguation, (iii) representing target nodes as special sphere neighborhood vectors including all XML structural relationships within a (user-chosen) range, and (iv) running context vectors through a hybrid disambiguation process, combining two approaches: concept-basedand context-based disambiguation, allowing the user to tune disambiguation parameters following her needs. Conducted experiments demonstrate the effectiveness and efficiency of our approach in comparison with alternative methods. We also discuss some practical applications of our method, ranging over semantic-aware query rewriting, semantic document clustering and classification, Mobile and Web services search and discovery, as well as blog analysis and event detection in social networks and tweets. © 2016 Elsevier B.V. All rights reserved. en_US
dc.language.iso en en_US
dc.title Building semantic trees from XML documents en_US
dc.type Article en_US
dc.description.version Published en_US
dc.author.school SOE en_US
dc.author.idnumber 201306321 en_US
dc.author.department Electrical And Computer Engineering en_US
dc.description.embargo N/A en_US
dc.relation.journal Journal of Web Semantics en_US
dc.journal.volume 37-38 en_US
dc.article.pages 1-24 en_US
dc.keywords Context representation en_US
dc.keywords Knowledge bases en_US
dc.keywords Semantic ambiguity en_US
dc.keywords Semantic-aware processing en_US
dc.keywords Word sense disambiguation en_US
dc.keywords XML and Semi-structured data en_US
dc.identifier.doi http://dx.doi.org/10.1016/j.websem.2016.03.002 en_US
dc.identifier.ctation Tekli, J., Charbel, N., & Chbeir, R. (2016). Building semantic trees from XML documents. Web Semantics: Science, Services and Agents on the World Wide Web, 37, 1-24. en_US
dc.author.email joe.tekli@lau.edu.lb en_US
dc.identifier.tou http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php en_US
dc.identifier.url http://www.sciencedirect.com/science/article/pii/S1570826816000202 en_US
dc.orcid.id https://orcid.org/0000-0003-3441-7974 en_US
dc.author.affiliation Lebanese American University en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search LAUR


Advanced Search

Browse

My Account