LAUR Repository

Show simple item record

dc.contributor.author Tekli, J.
dc.contributor.author Harazaki, M.
dc.contributor.author Yokoyama, S.
dc.contributor.author Fukuta, N.
dc.contributor.author Chbeir, R.
dc.contributor.author Ishikawa, H.
dc.date.accessioned 2017-07-04T11:23:06Z
dc.date.available 2017-07-04T11:23:06Z
dc.date.issued 2017-07-04
dc.identifier.isbn 978-364228806-7 en_US
dc.identifier.uri http://hdl.handle.net/10725/5869
dc.description.abstract XML datasets of various sizes and properties are needed to evaluate the correctness and efficiency of XML-based algorithms and applications. While several downloadable datasets can be found online, these are predefined by system experts and might not be suitable to evaluate every algorithm. Tools for generating synthetic XML documents underline an alternative solution, promoting flexibility and adaptability in generating synthetic document collections. Nonetheless, the usefulness of existing XML generators remains rather limited due to the restricted levels of expressiveness allowed to users. In this paper, we develop a novel XML By example Generator (XBeGene) for producing synthetic XML data which closely reflect the user's requirements. Inspired by the query-by-example paradigm in information retrieval, Our generator system i)allows the user to provide her own sample XML documents as input, ii) analyzes the structure, occurrence frequencies, and content distributions for each XML element in the user input documents, and iii) produces synthetic XML documents which closely concur, in both structural and content features, to the user's input data. The size of each synthetic document as well as that of the entire document collection are also specified by the user. Clustering experiments demonstrate high correlation levels between the specified user requirements and the characteristics of the generated XML data, while timing results confirm our approach's scalability to large scale document collections. en_US
dc.language.iso en en_US
dc.publisher Scopus en_US
dc.title XBeGene en_US
dc.type Conference Paper / Proceeding en_US
dc.title.subtitle scalable XML documents generator by example based on real data en_US
dc.author.school SOE en_US
dc.author.idnumber 201306321 en_US
dc.author.department Electrical And Computer Engineering en_US
dc.description.embargo N/A en_US
dc.keywords Content distribution en_US
dc.keywords Data sets en_US
dc.keywords Document collection en_US
dc.keywords Example based en_US
dc.keywords Generator systems en_US
dc.keywords Input datas en_US
dc.keywords Query-by-example en_US
dc.keywords User input en_US
dc.keywords User requirements en_US
dc.keywords XLM data en_US
dc.identifier.doi http://dx.doi.org/10.1007/978-3-642-28807-4_63 en_US
dc.identifier.ctation Harazaki, M., Tekli, J., Yokoyama, S., Fukuta, N., Chbeir, R., & Ishikawa, H. (2013). XBeGene: scalable XML documents generator by example based on real data. Recent Progress in DEIT, 1, 449-460. en_US
dc.conference.date 5-17 March 2012 en_US
dc.conference.pages 449-460 en_US
dc.conference.place Bali; Indonesia en_US
dc.conference.title International Conference on Data Engineering and Internet Technology en_US
dc.identifier.tou http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php en_US
dc.identifier.url https://www.scopus.com/record/display.uri?eid=2-s2.0-84865663010&doi=10.1007%2f978-3-642-28807-4_63&origin=inward&txGid=52B8F74D6C7097F8BA3452BAE0A513BE.wsnAw8kcdt7IPYLO0V48gA%3a2 en_US
dc.volume 2013 en_US
dc.author.affiliation Lebanese American University en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search LAUR

Advanced Search


My Account