dc.contributor.author |
Hajjar, Ali |
|
dc.contributor.author |
Tekli, Joe |
|
dc.contributor.editor |
Chiusano, Silvia |
|
dc.contributor.editor |
Cerquitelli, Tania |
|
dc.contributor.editor |
Wrembel, Robert |
|
dc.date.accessioned |
2024-11-08T10:16:53Z |
|
dc.date.available |
2024-11-08T10:16:53Z |
|
dc.date.copyright |
2022 |
en_US |
dc.date.issued |
2022-08-29 |
|
dc.identifier.uri |
http://hdl.handle.net/10725/16287 |
|
dc.description.abstract |
Large texts are not always entirely meaningful: they might include repetitions and useless details, and might not be easy to interpret by humans. Automatic text summarization aims to simplify text by making it shorter and (possibly) more informative. This paper describes a new solution for extractive text summarization, designed to efficiently process flat (unstructured) text. It performs unsupervised frequency-based document processing to identify the candidate sentences having the highest potential to represent informative content in the document. It introduces a dedicated feature vector representation for sentences to evaluate the relative impact of different sentence terms. The sentence feature vectors are run through a partitional k-means clustering process, to build the extractive summary based on the cluster representatives. Experimental results highlight the quality and efficiency of our approach. |
en_US |
dc.language.iso |
en |
en_US |
dc.publisher |
Springer International |
en_US |
dc.subject |
Database management -- Congresses |
en_US |
dc.subject |
Artificial intelligence -- Congresses |
en_US |
dc.title |
Unsupervised Extractive Text Summarization Using Frequency-Based Sentence Clustering |
en_US |
dc.type |
Conference Paper / Proceeding |
en_US |
dc.author.school |
SOE |
en_US |
dc.author.idnumber |
201306321 |
en_US |
dc.author.department |
Electrical and Computer Engineering |
en_US |
dc.description.physdesc |
1 online resource (332 pages) |
en_US |
dc.publication.place |
Cham |
en_US |
dc.keywords |
Automatic text summarization |
en_US |
dc.keywords |
Extractive summaries |
en_US |
dc.keywords |
Word space model |
en_US |
dc.keywords |
Feature representation |
en_US |
dc.keywords |
k-means clustering |
en_US |
dc.description.bibliographiccitations |
Includes bibliographical references. |
en_US |
dc.identifier.doi |
https://doi.org/10.1007/978-3-031-15743-1_23 |
en_US |
dc.identifier.ctation |
Hajjar, A., & Tekli, J. (2022, August). Unsupervised extractive text summarization using frequency-based sentence clustering. In European conference on advances in databases and information systems (pp. 245-255). Cham: Springer International Publishing. |
en_US |
dc.author.email |
joe.tekli@lau.edu.lb |
en_US |
dc.conference.date |
5–8 September, 2022 |
en_US |
dc.conference.pages |
245–255 |
en_US |
dc.conference.place |
Turin, Italy |
en_US |
dc.conference.title |
New trends in database and information systems : ADBIS 2021 Short Papers |
en_US |
dc.identifier.tou |
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php |
en_US |
dc.identifier.url |
https://link.springer.com/chapter/10.1007/978-3-031-15743-1_23 |
en_US |
dc.orcid.id |
https://orcid.org/0000-0003-3441-7974 |
en_US |
dc.publication.date |
2022 |
en_US |
dc.author.affiliation |
Lebanese American University |
en_US |
dc.relation.numberofseries |
CCIS 1652 |
en_US |
dc.title.volume |
Communications in Computer and Information Science |
en_US |