A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data

Rachid, Jinan

dc.contributor.author	Rachid, Jinan
dc.date.accessioned	2024-06-27T06:38:38Z
dc.date.available	2024-06-27T06:38:38Z
dc.date.copyright	2023	en_US
dc.date.issued	2023-12-17
dc.identifier.uri	http://hdl.handle.net/10725/15805
dc.description.abstract	People use informal language on microblog platforms to share their opinions on products, events, sports, or politics. Moreover, microblog platforms often harbor instances of hate speech and cyberbullying, resulting in a massive amount of data available for natural language processing applications. Most studies have predominantly focused on common languages like English for tasks such as hate speech detection, sentiment analysis, and emotion analysis. Dialectal Arabic presents additional challenges due to its morphological richness and complexity, making NLP applications more intricate. While recent research has explored Arabic and Arabizi dialects, there has been limited attention given to Lebanese Arabizi. To address this gap, our objective was to construct a substantial Lebanese Arabizi dataset and make it accessible for NLP research. Additionally, we sought to develop a new approach to Arabizi detection and explored the identification of sarcasm and emotion recognition. The dataset comprised 11,000 rows, a combination of comments collected from Instagram and tweets. We utilized a pre-trained DziriBERT model for Arabizi identification and sarcasm detection, comparing the performances of contextual embedding and semantic embedding models. The word embeddings were then input into a Bidirectional Long Short-Term Memory (BiLSTM) model for emotion recognition. The Arabizi identification model achieved an impressive macro F1 score of 98%, while the sarcasm detection model achieved an average macro F1 score of 63%. This Arabizi detection model not only contributes to expanding the Arabizi dataset but also holds potential for broader applications. Sarcasm detection is crucial for microblog platforms to filter content, particularly since it heavily relies on the manual reporting of offensive material. Additionally, emotion recognition assists companies in understanding customers’ opinions about their products and services.	en_US
dc.language.iso	en	en_US
dc.subject	Lebanese American University--Dissertations	en_US
dc.subject	Dissertations, Academic	en_US
dc.subject	Arabic language--Lexicology--Data processing	en_US
dc.subject	Arabic language--Transliteration--Data processing	en_US
dc.subject	Sentiment analysis--Data processing	en_US
dc.subject	Social media--Data processing	en_US
dc.title	A Deep Learning Model for Identifying and Analyzing Sarcasm and Emotions in Lebanese Arabizi from Instagram and Twitter Data	en_US
dc.type	Thesis	en_US
dc.term.submitted	Fall	en_US
dc.author.degree	MS in Computer Science	en_US
dc.author.school	SAS	en_US
dc.author.idnumber	201604608	en_US
dc.author.commembers	El Khatib, Nader
dc.author.commembers	Nour, Chadi
dc.author.department	Computer Science And Mathematics	en_US
dc.description.physdesc	1 online resource (xi, 58 leaves) : col. ill.	en_US
dc.author.advisor	Harmanani, Haidar
dc.keywords	Arabizi	en_US
dc.keywords	BERT	en_US
dc.keywords	Sarcasm detection	en_US
dc.keywords	Emotion recognition	en_US
dc.keywords	Deep learning	en_US
dc.description.bibliographiccitations	Bibliography: leaves 49-58.	en_US
dc.identifier.doi	https://doi.org/10.26756/th.2023.664	en_US
dc.identifier.tou	http://libraries.lau.edu.lb/research/laur/terms-of-use/thesis.php	en_US
dc.publisher.institution	Lebanese American University	en_US
dc.author.affiliation	Lebanese American University	en_US