dc.contributor.author |
Abu-Khzam, F.N. |
|
dc.contributor.author |
Samatova, N.F. |
|
dc.contributor.author |
Ostrouchov, G. |
|
dc.contributor.author |
Langston, M.A. |
|
dc.contributor.author |
Al Geist, G. |
|
dc.date.accessioned |
2018-04-24T09:27:43Z |
|
dc.date.available |
2018-04-24T09:27:43Z |
|
dc.date.copyright |
2002 |
en_US |
dc.date.issued |
2018-04-24 |
|
dc.identifier.uri |
http://hdl.handle.net/10725/7497 |
|
dc.description.abstract |
It is well known that information retrieval, clustering and visualization can often be improved by reducing the dimensionality of high dimensional data. Classical techniques offer optimality but are much too slow for extremely large databases. The problem becomes harder yet when data are distributed across geographically dispersed machines. To address this need, an effective distributed dimension reduction algorithm is developed. Motivated by the success of the serial (non-distributed) FastMap heuristic of Faloutsos and Lin, the distributed method presented here is intended to be fast, accurate and reliable. It runs in linear time and requires very little data transmission. A series of experiments is conducted to gauge how the algorithm’s emphasis on minimal data transmission affects solution quality. Stress function measurements indicate that the distributed algorithm is highly competitive with the original FastMap heuristic. |
en_US |
dc.language.iso |
en |
en_US |
dc.title |
Distributed dimension reduction algorithms for widely dispersed data |
en_US |
dc.type |
Conference Paper / Proceeding |
en_US |
dc.author.school |
SAS |
en_US |
dc.author.idnumber |
200302941 |
en_US |
dc.author.department |
Computer Science and Mathematics |
en_US |
dc.description.embargo |
N/A |
en_US |
dc.keywords |
Parallel and distributed algorithms |
en_US |
dc.keywords |
Data mining |
en_US |
dc.keywords |
Information systems |
en_US |
dc.keywords |
Distributed databases |
en_US |
dc.identifier.ctation |
Abu-Khzam, F. N., Samatova, N. F., Ostrouchov, G., Langston, M. A., & Geist, A. (2002). Distributed Dimension Reduction Algorithms for Widely Dispersed Data. In IASTED PDCS (pp. 167-174). |
en_US |
dc.author.email |
faisal.abukhzam@lau.edu.lb |
en_US |
dc.conference.date |
November 4 – 6, 2002 |
en_US |
dc.conference.pages |
167-174 |
en_US |
dc.conference.place |
Cambridge, USA |
en_US |
dc.conference.title |
Parallel and distributed computing and systems |
en_US |
dc.identifier.tou |
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php |
en_US |
dc.identifier.url |
http://www.actapress.com/Abstract.aspx?paperId=24561 |
en_US |
dc.orcid.id |
https://orcid.org/0000-0001-5221-8421 |
en_US |
dc.author.affiliation |
Lebanese American University |
en_US |