Unsupervised Extractive Text Summarization Using Frequency-Based Sentence Clustering

Hajjar, Ali; Tekli, Joe

dc.contributor.author	Hajjar, Ali
dc.contributor.author	Tekli, Joe
dc.contributor.editor	Chiusano, Silvia
dc.contributor.editor	Cerquitelli, Tania
dc.contributor.editor	Wrembel, Robert
dc.date.accessioned	2024-11-08T10:16:53Z
dc.date.available	2024-11-08T10:16:53Z
dc.date.copyright	2022	en_US
dc.date.issued	2022-08-29
dc.identifier.uri	http://hdl.handle.net/10725/16287
dc.description.abstract	Large texts are not always entirely meaningful: they might include repetitions and useless details, and might not be easy to interpret by humans. Automatic text summarization aims to simplify text by making it shorter and (possibly) more informative. This paper describes a new solution for extractive text summarization, designed to efficiently process flat (unstructured) text. It performs unsupervised frequency-based document processing to identify the candidate sentences having the highest potential to represent informative content in the document. It introduces a dedicated feature vector representation for sentences to evaluate the relative impact of different sentence terms. The sentence feature vectors are run through a partitional k-means clustering process, to build the extractive summary based on the cluster representatives. Experimental results highlight the quality and efficiency of our approach.	en_US
dc.language.iso	en	en_US
dc.publisher	Springer International	en_US
dc.subject	Database management -- Congresses	en_US
dc.subject	Artificial intelligence -- Congresses	en_US
dc.title	Unsupervised Extractive Text Summarization Using Frequency-Based Sentence Clustering	en_US
dc.type	Conference Paper / Proceeding	en_US
dc.author.school	SOE	en_US
dc.author.idnumber	201306321	en_US
dc.author.department	Electrical and Computer Engineering	en_US
dc.description.physdesc	1 online resource (332 pages)	en_US
dc.publication.place	Cham	en_US
dc.keywords	Automatic text summarization	en_US
dc.keywords	Extractive summaries	en_US
dc.keywords	Word space model	en_US
dc.keywords	Feature representation	en_US
dc.keywords	k-means clustering	en_US
dc.description.bibliographiccitations	Includes bibliographical references.	en_US
dc.identifier.doi	https://doi.org/10.1007/978-3-031-15743-1_23	en_US
dc.identifier.ctation	Hajjar, A., & Tekli, J. (2022, August). Unsupervised extractive text summarization using frequency-based sentence clustering. In European conference on advances in databases and information systems (pp. 245-255). Cham: Springer International Publishing.	en_US
dc.author.email	joe.tekli@lau.edu.lb	en_US
dc.conference.date	5–8 September, 2022	en_US
dc.conference.pages	245–255	en_US
dc.conference.place	Turin, Italy	en_US
dc.conference.title	New trends in database and information systems : ADBIS 2021 Short Papers	en_US
dc.identifier.tou	http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php	en_US
dc.identifier.url	https://link.springer.com/chapter/10.1007/978-3-031-15743-1_23	en_US
dc.orcid.id	https://orcid.org/0000-0003-3441-7974	en_US
dc.publication.date	2022	en_US
dc.author.affiliation	Lebanese American University	en_US
dc.relation.numberofseries	CCIS 1652	en_US
dc.title.volume	Communications in Computer and Information Science	en_US