Dr Alaa Marshan


Senior Lecturer in Intelligent Data Analysis
BSc, MSc and PhD
WED: 11.00 - 16.00, THU: 11am - 3pm, and FRI: 11-3pm

About

My qualifications

2018
PhD in Computer Science
Brunel University London
2012
MSc in Business Systems Integration
Brunel University London
2003
BSc in Computer Engineering
University of Aleppo

Affiliations and memberships

Fellow
Higher Education Academy

Research

Research interests

Teaching

Publications

Highlights

My Google Scholar Page

Alaa Marshan, Farah Nasreen Mohamed Nizar, Athina Ioannou, Konstantina Spanaki (2023)Comparing Machine Learning and Deep Learning Techniques for Text Analytics: Detecting the Severity of Hate Comments Online, In: Information systems frontiers : a journal of research and innovation

Social media platforms have become an increasingly popular tool for individuals to share their thoughts and opinions with other people. However, very often people tend to misuse social media posting abusive comments. Abusive and harassing behaviours can have adverse effects on people's lives. This study takes a novel approach to combat harassment in online platforms by detecting the severity of abusive comments, that has not been investigated before. The study compares the performance of machine learning models such as Naïve Bayes, Random Forest, and Support Vector Machine, with deep learning models such as Convolutional Neural Network (CNN) and Bi-directional Long Short-Term Memory (Bi-LSTM). Moreover, in this work we investigate the effect of text pre-processing on the performance of the machine and deep learning models, the feature set for the abusive comments was made using unigrams and bigrams for the machine learning models and word embeddings for the deep learning models. The comparison of the models’ performances showed that the Random Forest with bigrams achieved the best overall performance with an accuracy of (0.94), a precision of (0.91), a recall of (0.94), and an F1 score of (0.92). The study develops an efficient model to detect severity of abusive language in online platforms, offering important implications both to theory and practice.