A hybrid approach for XML similarity

LAUR Repository

Show simple item record

dc.contributor.author Tekli, Joe
dc.contributor.author Chbeir, Richard
dc.contributor.author Yetongnon, Kokou
dc.date.accessioned 2018-02-08T13:27:48Z
dc.date.available 2018-02-08T13:27:48Z
dc.date.copyright 2007 en_US
dc.date.issued 2018-02-08
dc.identifier.isbn 978-3-540-69507-3 en_US
dc.identifier.uri http://hdl.handle.net/10725/7058
dc.description.abstract In the past few years, XML has been established as an effective means for information management, and has been widely exploited for complex data representation. Owing to an unparalleled increasing use of the XML standard, developing efficient techniques for comparing XML-based documents becomes essential in information retrieval (IR) research. Various algorithms for comparing hierarchically structured data, e.g. XML documents, have been proposed in the literature. However, to our knowledge, most of them focus exclusively on comparing documents based on structural features, overlooking the semantics involved. In this paper, we integrate IR semantic similarity assessment in an edit distance algorithm, seeking to amend similarity judgments when comparing XML-based documents. Our approach comprises of an original edit distance operation cost model, introducing semantic relatedness of XML element/attribute labels, in traditional edit distance computations. A prototype has been developed to evaluate our model’s performance. Experiments yielded notable results. en_US
dc.language.iso en en_US
dc.publisher Springer en_US
dc.title A hybrid approach for XML similarity en_US
dc.type Conference Paper / Proceeding en_US
dc.author.school SOE en_US
dc.author.idnumber 201306321 en_US
dc.author.department Electrical And Computer Engineering en_US
dc.description.embargo N/A en_US
dc.keywords Editing en_US
dc.keywords Mili en_US
dc.identifier.doi https://doi.org/10.1007/978-3-540-69507-3_68 en_US
dc.identifier.ctation Tekli J., Chbeir R., Yetongnon K. (2007) A Hybrid Approach for XML Similarity. In: van Leeuwen J., Italiano G.F., van der Hoek W., Meinel C., Sack H., Plášil F. (eds) SOFSEM 2007: Theory and Practice of Computer Science. SOFSEM 2007. Lecture Notes in Computer Science, vol 4362. Springer, Berlin, Heidelberg en_US
dc.author.email joe.tekli@lau.edu.lb en_US
dc.conference.date January 20-26, 2007 en_US
dc.conference.pages 783-795 en_US
dc.conference.place Harrachov, Czech Republic en_US
dc.conference.title 33rd Conference on Current Trends in Theory and Practice of Computer Science en_US
dc.identifier.tou http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php en_US
dc.identifier.url https://link.springer.com/chapter/10.1007/978-3-540-69507-3_68 en_US
dc.publication.date 2007 en_US
dc.author.affiliation Lebanese American University en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search LAUR

Advanced Search


My Account