A fine-grained XML structural comparison approach

LAUR Repository

Show simple item record

dc.contributor.author Tekli, Joe
dc.contributor.author Chbeir, Richard
dc.contributor.author Yetongnon, Kokou
dc.date.accessioned 2017-06-30T08:07:31Z
dc.date.available 2017-06-30T08:07:31Z
dc.date.issued 2017-06-30
dc.identifier.uri http://hdl.handle.net/10725/5856
dc.description.abstract As the Web continues to grow and evolve, more and more information is being placed in structurally rich documents, XML documents in particular, so as to improve the efficiency of similarity clustering, information retrieval and data management applications. Various algorithms for comparing hierarchically structured data, e.g., XML documents, have been proposed in the literature. Most of them make use of techniques for finding the edit distance between tree structures, XML documents being modeled as ordered labeled trees. Nevertheless, a thorough investigation of current approaches led us to identify several structural similarity aspects, i.e. sub-tree related similarities, which are not sufficiently addressed while comparing XML documents. In this paper, we provide an improved comparison method to deal with fine-grained sub-trees and leaf node repetitions, without increasing overall complexity with respect to current XML comparison methods. Our approach consists of two main algorithms for discovering the structural commonality between sub-trees and computing tree-based edit operations costs. A prototype has been developed to evaluate the optimality and performance of our method. Experimental results, on both real and synthetic XML data, demonstrate better performance with respect to alternative XML comparison methods. en_US
dc.language.iso en en_US
dc.publisher Springer en_US
dc.title A fine-grained XML structural comparison approach en_US
dc.type Conference Paper / Proceeding en_US
dc.author.school SOE en_US
dc.author.idnumber 201306321 en_US
dc.author.department Electrical And Computer Engineering en_US
dc.description.embargo N/A en_US
dc.keywords XML en_US
dc.keywords Semi-structured data en_US
dc.keywords Structural similarity en_US
dc.keywords Tree edit distance en_US
dc.identifier.doi http://dx.doi.org/10.1007/978-3-540-75563-0_39 en_US
dc.identifier.ctation Tekli, J., Chbeir, R., & Yetongnon, K. (2007, November). A fine-grained XML structural comparison approach. In International Conference on Conceptual Modeling (pp. 582-598). Springer, Berlin, Heidelberg. en_US
dc.author.email joe.tekli@lau.edu.lb en_US
dc.conference.date 5-9 November 2017 en_US
dc.conference.pages 582-598 en_US
dc.conference.place Auckland, New Zealand en_US
dc.conference.title International Conference on Conceptual Modeling en_US
dc.identifier.tou http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php en_US
dc.identifier.url https://link.springer.com/chapter/10.1007/978-3-540-75563-0_39 en_US
dc.author.affiliation Lebanese American University en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search LAUR

Advanced Search


My Account