.

An enhanced k-means clustering algorithm for pattern discovery in healthcare data

LAUR Repository

Show simple item record

dc.contributor.author Haraty, Ramzi A.
dc.contributor.author Dimishkieh, Mohamad
dc.contributor.author Masud, Mehedi
dc.date.accessioned 2017-02-01T08:10:16Z
dc.date.available 2017-02-01T08:10:16Z
dc.date.copyright 2015 en_US
dc.date.issued 2017-02-01
dc.identifier.issn 1550-1329 en_US
dc.identifier.uri http://hdl.handle.net/10725/5135
dc.description.abstract The huge amounts of data generated by media sensors in health monitoring systems, by medical diagnosis that produce media (audio, video, image, and text) content, and from health service providers are too complex and voluminous to be processed and analyzed by traditional methods. Data mining approaches offer the methodology and technology to transform these heterogeneous data into meaningful information for decision making. This paper studies data mining applications in healthcare. Mainly, we study k-means clustering algorithms on large datasets and present an enhancement to k-means clustering, which requires k or a lesser number of passes to a dataset. The proposed algorithm, which we call G-means, utilizes a greedy approach to produce the preliminary centroids and then takes k or lesser passes over the dataset to adjust these center points. Our experimental results, which were used in an increasing manner on the same dataset, show that G-means outperforms k-means in terms of entropy and F-scores. The experiments also yield better results for G-means in terms of the coefficient of variance and the execution time. en_US
dc.language.iso en en_US
dc.title An enhanced k-means clustering algorithm for pattern discovery in healthcare data en_US
dc.type Article en_US
dc.description.version Published en_US
dc.author.school SAS en_US
dc.author.idnumber 199729410 en_US
dc.author.department Computer Science and Mathematics en_US
dc.description.embargo N/A en_US
dc.relation.journal International Journal of Distributed Sensor Networks en_US
dc.journal.volume 2015 en_US
dc.article.pages 1-11
dc.identifier.doi http://dx.doi.org/10.1155/2015/615740 en_US
dc.identifier.ctation Haraty, R. A., Dimishkieh, M., & Masud, M. (2015). An enhanced k-means clustering algorithm for pattern discovery in healthcare data. International Journal of Distributed Sensor Networks, 2015, 1-11 en_US
dc.author.email rharaty@lau.edu.lb en_US
dc.identifier.tou http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php en_US
dc.identifier.url http://journals.sagepub.com/doi/full/10.1155/2015/615740 en_US
dc.orcid.id https://orcid.org/0000-0002-6978-3627
dc.author.affiliation Lebanese American University en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search LAUR


Advanced Search

Browse

My Account