.

An Innovative Automatic Indexing Method for Arabic Text

LAUR Repository

Show simple item record

dc.contributor.author Masri, Nour
dc.date.accessioned 2022-09-01T06:45:12Z
dc.date.available 2022-09-01T06:45:12Z
dc.date.copyright 2020 en_US
dc.date.issued 2020-01-29
dc.identifier.uri http://hdl.handle.net/10725/13983
dc.description.abstract Automatic indexing and texts retrieval methods for languages have been studied for a long time. Compared to other languages, there is still limited research which has been conducted for the automated Arabic Text Categorization. In this work, we present an innovative method to reinforce the accuracy of automatic indexing of Arabic texts by introducing a Thesaurus. Our model extracts new relevant words by referring to the introduced thesaurus which identi es words correlation. The Thesaurus is built through an NLTK toolkit which contains a library that lists the synonyms of a certain word available in WordNet library. The words having the same meaning and that frequently appear together were grouped under one umbrella using a JSON dictionary making it easier to identify the texts topic. Our results exhibit notable improvement in accuracy and efficiency compared to previous works. en_US
dc.language.iso en en_US
dc.subject Automatic indexing en_US
dc.subject Arabic language -- Data processing en_US
dc.subject Information storage and retrieval systems en_US
dc.subject Lebanese American University -- Dissertations en_US
dc.subject Dissertations, Academic en_US
dc.title An Innovative Automatic Indexing Method for Arabic Text en_US
dc.type Thesis en_US
dc.term.submitted Fall en_US
dc.author.degree MS in Computer Science en_US
dc.author.school SAS en_US
dc.author.idnumber 201203406 en_US
dc.author.commembers Habre, Samer
dc.author.commembers Mourad, Azzam
dc.author.department Computer Science And Mathematics en_US
dc.description.physdesc ix, 60 leaves: ill. en_US
dc.author.advisor Haraty, Ramzi
dc.keywords Automatic Indexing en_US
dc.keywords Arabic Text en_US
dc.keywords Building Thesaurus en_US
dc.keywords Frequent Sets en_US
dc.keywords Synonyms en_US
dc.keywords JSON Dictionary en_US
dc.description.bibliographiccitations Bibliography: leaves 49-54. en_US
dc.identifier.doi https://doi.org/10.26756/th.2022.445
dc.author.email nour.masri@lau.edu.lb en_US
dc.identifier.tou http://libraries.lau.edu.lb/research/laur/terms-of-use/thesis.php en_US
dc.publisher.institution Lebanese American University en_US
dc.author.affiliation Lebanese American University en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search LAUR


Advanced Search

Browse

My Account