Abstract:
In this work, we propose a new model to enhance auto-indexing Arabic texts. Our model denotes extracting new relevant words by relating those chosen by the previous classical methods, to new words using data mining rules. The model uses the Apriori Algorithm - an association rule algorithm for extracting frequent sets containing related items - to extract relations between words in the texts to be indexed with words from texts that belong to the same category. These associations of words extracted are illustrated as sets of words that appear frequently together. Our results show significant improvement in terms of accuracy, efficiency and reliability when compared to previous works.