dc.contributor.author |
Makarem, Norma |
|
dc.date.accessioned |
2023-10-24T09:18:21Z |
|
dc.date.available |
2023-10-24T09:18:21Z |
|
dc.date.copyright |
2023 |
en_US |
dc.date.issued |
2023-05-19 |
|
dc.identifier.uri |
http://hdl.handle.net/10725/15136 |
|
dc.description.abstract |
Cluster Editing, a known model for correlation clustering, has garnered significant consideration in the parameterized complexity area and has been utilized in a range of practical contexts. In certain situations, the requirement for clusters to be cliques was deemed excessively stringent, leading to the proposal of alternative relaxed clique models for dense subgraphs, such as s-club. In this work, we implement three approaches to tackle the 2-club clustering via edge deletion: a heuristic approach based on the influence of the edge to resolve maximum conflicts, a parameterized algorithm in which by deleting a maximum of k edges, the graph can be transformed into a 2-club cluster based on a branching algorithm, and the approach in Integer Linear Programming to find the optimized solution in an integer formulation. We run these algorithms and cross-compare the three experiment results to the fastest Cluster Editing algorithm which runs in O(1.62k) time. Our 2-club Cluster Edge Deletion approach showed better performance than Cluster Editing in terms of running time, and significantly less cost of modifications while preserving the information of the underlying structure. Finally, we consider the 3-clubs Cluster Edge Deletion which did not have much focus in the literature. The problem can be solved in time O(4k). We present a first fixed-parameter algorithm that breaks this 4k barrier by solving the problem in O(3.65k) time. In addition, we implement this 3CCED algorithm in the heuristic approach and compare the experiments to Cluster Editing and 2CCED. The results show that the 3-club Cluster Edge Deletion presents the highest performance in terms of running time and cost of modifications to reach the desired results. This shows that deletion into a graph whose components are of bounded diameter can be a better model for correlation clustering. |
en_US |
dc.language.iso |
en |
en_US |
dc.subject |
Cluster analysis -- Data processing |
en_US |
dc.subject |
Cluster analysis -- Mathematical models |
en_US |
dc.subject |
Computational complexity |
en_US |
dc.subject |
Computer algorithms |
en_US |
dc.subject |
Lebanese American University -- Dissertations |
en_US |
dc.subject |
Lebanese American University -- Dissertations |
en_US |
dc.title |
Correlation Clustering via s-Club Cluster Edge Deletion |
en_US |
dc.type |
Thesis |
en_US |
dc.title.subtitle |
Theory and Experiments |
en_US |
dc.term.submitted |
Spring |
en_US |
dc.author.degree |
MS in Computer Science |
en_US |
dc.author.school |
SAS |
en_US |
dc.author.idnumber |
200402205 |
en_US |
dc.author.commembers |
Hanna, Eileen-Marie |
|
dc.author.commembers |
Haraty, Ramzi |
|
dc.author.department |
Computer Science And Mathematics |
en_US |
dc.description.physdesc |
1 online resource (xi, 46 leaves) |
en_US |
dc.author.advisor |
Abu Khzam, Faisal |
|
dc.keywords |
Correlation Clustering |
en_US |
dc.keywords |
Cluster Editing |
en_US |
dc.keywords |
s-Club Cluster Edge Deletion |
en_US |
dc.keywords |
2- club |
en_US |
dc.keywords |
3-club |
en_US |
dc.description.bibliographiccitations |
Bibliography: leaves 41-46. |
en_US |
dc.identifier.doi |
https://doi.org/10.26756/th.2023.621 |
|
dc.author.email |
norma.makarem@lau.edu |
en_US |
dc.identifier.tou |
http://libraries.lau.edu.lb/research/laur/terms-of-use/thesis.php |
en_US |
dc.publisher.institution |
Lebanese American University |
en_US |
dc.author.affiliation |
Lebanese American University |
en_US |