ac

A Comparative Analysis of Deep Learning Models for SMS Spam Detection: CNN-LSTM, CNN-GRU, and ResNet Approaches

Authors

  • Gregorius Airlangga Atma Jaya Catholic University of Indonesia

DOI:

10.47709/cnahpc.v6i4.4827

Keywords:

SMS Spam Detection, CNN-LSTM, CNN-GRU, ResNet, Deep Learning Models

Dimension Badge Record



Abstract

Spam messages have become a growing challenge in mobile communication, threatening user security and data privacy. Traditional spam detection methods, including rule-based and machine learning techniques, are increasingly insufficient due to the evolving sophistication of spam tactics. This research evaluates the effectiveness of advanced deep learning models such as CNN-LSTM, CNN-GRU, and ResNet for SMS spam detection. The dataset used consists of diverse SMS messages labeled as either spam or legitimate (ham), ensuring broad coverage of real-world spam patterns. The study employs a robust ten-fold cross-validation approach to assess the generalization capabilities of the models, measuring performance based on accuracy, precision, recall, and F1 score. The results indicate that ResNet outperformed the other models, achieving an average accuracy of 99.08% and an F1 score of 0.9646, making it the most reliable model for spam detection. CNN-GRU demonstrated competitive performance with a balance between accuracy (98.97%) and computational efficiency, making it suitable for real-time applications. CNN-LSTM, while highly accurate (98.92%), showed a slightly lower recall compared to the other models, indicating a more cautious approach to detecting spam. These findings highlight the potential of hybrid deep learning models in addressing the complexities of SMS spam detection. Future research could focus on optimizing these models for deployment in resource-constrained environments, such as mobile devices, and further exploring the integration of residual connections for more effective spam filtering.

Downloads

Download data is not yet available.
Google Scholar Cite Analysis
Abstract viewed = 30 times

References

Abbas, A. M. (2021). Social network analysis using deep learning: applications and schemes. Social Network Analysis and Mining, 11(1), 106.

Agarwal, R., Dhoot, A., Kant, S., Bisht, V. S., Malik, H., Ansari, M. F., Afthanorhan, A. & Hossaini, M. A. (2024). A novel approach for spam detection using natural language processing with AMALS models. IEEE Access.

Ahmadzadeh, E., Kim, H., Jeong, O., Kim, N. & Moon, I. (2022). A deep bidirectional LSTM-GRU network model for automated ciphertext classification. IEEE Access, 10, 3228–3237.

Ahmed, S. F., Alam, M. S. Bin, Hassan, M., Rozbu, M. R., Ishtiak, T., Rafa, N., Mofijur, M., Shawkat Ali, A. B. M. & Gandomi, A. H. (2023). Deep learning modelling techniques: current progress, applications, advantages, and challenges. Artificial Intelligence Review, 56(11), 13521–13617.

Akhter, M. P., Jiangbin, Z., Naqvi, I. R., Abdelmajeed, M., Mehmood, A. & Sadiq, M. T. (2020). Document-level text classification using single-layer multisize filters convolutional neural network. IEEE Access, 8, 42689–42707.

Alkhalil, Z., Hewage, C., Nawaf, L. & Khan, I. (2021). Phishing attacks: A recent comprehensive study and a new anatomy. Frontiers in Computer Science, 3, 563060.

Ansari, L., Ji, S., Chen, Q. & Cambria, E. (2022). Ensemble hybrid learning methods for automated depression detection. IEEE Transactions on Computational Social Systems, 10(1), 211–219.

Bukhari, S. M. S., Zafar, M. H., Abou Houran, M., Moosavi, S. K. R., Mansoor, M., Muaaz, M. & Sanfilippo, F. (2024). Secure and privacy-preserving intrusion detection in wireless sensor networks: Federated learning with SCNN-Bi-LSTM for enhanced reliability. Ad Hoc Networks, 155, 103407.

Cai, H., Lin, J., Lin, Y., Liu, Z., Tang, H., Wang, H., Zhu, L. & Han, S. (2022). Enable deep learning on mobile devices: Methods, systems, and applications. ACM Transactions on Design Automation of Electronic Systems (TODAES), 27(3), 1–50.

Cao, Y., Geddes, T. A., Yang, J. Y. H. & Yang, P. (2020). Ensemble deep learning in bioinformatics. Nature Machine Intelligence, 2(9), 500–508.

Dapat, V. (2024). SMS Spam Detection Dataset. https://www.kaggle.com/datasets/vishakhdapat/sms-spam-detection-dataset/data

Daraghmi, E. Y., Qadan, S., Daraghmi, Y., Yussuf, R., Cheikhrouhou, O. & Baz, M. (2024). From Text to Insight: An Integrated CNN-BiLSTM-GRU Model for Arabic Cyberbullying Detection. IEEE Access.

Das, S., Mandal, S. & Basak, R. (2023). Spam email detection using a novel multilayer classification-based decision technique. International Journal of Computers and Applications, 45(9), 587–599.

Dey, A., Nayak, S., Kumar, R. & Mohanty, S. N. (2024). How Machine Learning is Innovating Today’s World: A Concise Technical Guide. John Wiley & Sons.

Do, N. Q., Selamat, A., Krejcar, O., Herrera-Viedma, E. & Fujita, H. (2022). Deep learning for phishing detection: Taxonomy, current challenges and future directions. Ieee Access, 10, 36429–36463.

Dua, N., Singh, S. N. & Semwal, V. B. (2021). Multi-input CNN-GRU based human activity recognition using wearable sensors. Computing, 103(7), 1461–1478.

Durga, B. K. & Rajesh, V. (2022). A ResNet deep learning based facial recognition design for future multimedia applications. Computers and Electrical Engineering, 104, 108384.

Gasparetto, A., Marcuzzo, M., Zangari, A. & Albarelli, A. (2022). A survey on text classification algorithms: From text to predictions. Information, 13(2), 83.

Gaurav, D., Tiwari, S. M., Goyal, A., Gandhi, N. & Abraham, A. (2020). Machine intelligence-based algorithms for spam filtering on document labeling. Soft Computing, 24(13), 9625–9638.

Guo, Z., Yang, C., Wang, D. & Liu, H. (2023). A novel deep learning model integrating CNN and GRU to predict particulate matter concentrations. Process Safety and Environmental Protection, 173, 604–613.

Hua, H., Liu, M., Li, Y., Deng, S. & Wang, Q. (2023). An ensemble framework for short-term load forecasting based on parallel CNN and GRU with improved ResNet. Electric Power Systems Research, 216, 109057.

Islam, M. S., Kabir, M. N., Ghani, N. A., Zamli, K. Z., Zulkifli, N. S. A., Rahman, M. M. & Moni, M. A. (2024). Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach. Artificial Intelligence Review, 57(3), 62.

Jafari, S. & Byun, Y.-C. (2023). A CNN-GRU Approach to the Accurate Prediction of Batteries’ Remaining Useful Life from Charging Profiles. Computers, 12(11), 219.

Jain, A. (2021). SPAM filtering using artificial intelligence. Artificial Intelligence and Data Mining Approaches in Security Frameworks, 261–291.

Karasoy, O. & Balli, S. (2022). Spam SMS detection for Turkish language with deep text analysis and deep learning methods. Arabian Journal for Science and Engineering, 47(8), 9361–9377.

Kernbach, J. M. & Staartjes, V. E. (2022). Foundations of machine learning-based clinical prediction modeling: Part II—Generalization and overfitting. Machine Learning in Clinical Neuroscience: Foundations and Applications, 15–21.

Kigerl, A. (2020). Spam-based scams. The Palgrave Handbook of International Cybercrime and Cyberdeviance, 877–897.

Klemm, C. & Vennemann, P. (2021). Modeling and optimization of multi-energy systems in mixed-use districts: A review of existing methods and approaches. Renewable and Sustainable Energy Reviews, 135, 110206.

Kumari, P. & Toshniwal, D. (2021). Extreme gradient boosting and deep neural network based ensemble learning approach to forecast hourly solar irradiance. Journal of Cleaner Production, 279, 123285.

Li, H., Wang, S. X., Shang, F., Niu, K. & Song, R. (2024). Applications of large language models in cloud computing: An empirical study using real-world data. International Journal of Innovative Research in Computer Science & Technology, 12(4), 59–69.

Lu, L., Zhang, C., Cao, K., Deng, T. & Yang, Q. (2022). A multichannel CNN-GRU model for human activity recognition. IEEE Access, 10, 66797–66810.

Mehnatkesh, H., Jalali, S. M. J., Khosravi, A. & Nahavandi, S. (2023). An intelligent driven deep residual learning framework for brain tumor classification using MRI images. Expert Systems with Applications, 213, 119087.

Mienye, I. D., Swart, T. G. & Obaido, G. (2024). Recurrent Neural Networks: A Comprehensive Review of Architectures, Variants, and Applications. Information, 15(9), 517.

Minu, M. S. & Canessane, R. A. (2022). Deep learning-based aerial image classification model using inception with residual network and multilayer perceptron. Microprocessors and Microsystems, 95, 104652.

Nandwani, P. & Verma, R. (2021). A review on sentiment analysis and emotion detection from text. Social Network Analysis and Mining, 11(1), 81.

Naseem, U., Razzak, I., Khan, S. K. & Prasad, M. (2021). A comprehensive survey on word representation models: From classical to state-of-the-art word representation language models. Transactions on Asian and Low-Resource Language Information Processing, 20(5), 1–35.

Oruh, J., Viriri, S. & Adegun, A. (2022). Long short-term memory recurrent neural network for automatic speech recognition. IEEE Access, 10, 30069–30079.

Rao, S., Verma, A. K. & Bhatia, T. (2021). A review on social spam detection: Challenges, open issues, and future directions. Expert Systems with Applications, 186, 115742.

Rayan, A. (2022). Analysis of e-Mail Spam Detection Using a Novel Machine Learning-Based Hybrid Bagging Technique. Computational Intelligence and Neuroscience, 2022(1), 2500772.

Remya, S., Pillai, M. J., Nair, K. K., Subbareddy, S. R. & Cho, Y. Y. (2024). An Effective Detection Approach for Phishing URL Using ResMLP. IEEE Access.

Sabeeh, V., Zohdy, M., Mollah, A. & Al Bashaireh, R. (2020). Fake news detection on social media using deep learning and semantic knowledge sources. International Journal of Computer Science and Information Security (IJCSIS), 18(2), 45–68.

Sabir, B., Ullah, F., Babar, M. A. & Gaire, R. (2021). Machine learning for detecting data exfiltration: A review. ACM Computing Surveys (CSUR), 54(3), 1–47.

Saidani, N. (2021). A learning approach for spam detection using semantic representation. Université du Québec en Outaouais.

Salman, M., Ikram, M. & Kaafar, M. A. (2024). Investigating Evasive Techniques in SMS Spam Filtering: A Comparative Analysis of Machine Learning Models. IEEE Access.

Sharmin, T., Di Troia, F., Potika, K. & Stamp, M. (2020). Convolutional neural networks for image spam detection. Information Security Journal: A Global Perspective, 29(3), 103–117.

Thakur, P., Joshi, K., Jain, S. & Thakral, P. (2023). Spam Detection in Emails using Machine Learning.

Tubishat, M., Al-Obeidat, F., Sadiq, A. S. & Mirjalili, S. (2023). An Improved Dandelion Optimizer Algorithm for Spam Detection: Next-Generation Email Filtering System. Computers, 12(10), 196.

Vankdothu, R. & Hameed, M. A. (2022). Brain tumor MRI images identification and classification based on the recurrent convolutional neural network. Measurement: Sensors, 24, 100412.

Vijayakumar, B. & Thomas, C. (2024). The ethics of envisioning spam free email inboxes. AI and Ethics, 1–24.

Yin, X., Liu, Q., Pan, Y., Huang, X., Wu, J. & Wang, X. (2021). Strength of stacking technique of ensemble learning in rockburst prediction with imbalanced data: Comparison of eight single and ensemble models. Natural Resources Research, 30, 1795–1815.

Downloads

ARTICLE Published HISTORY

Submitted Date: 2024-10-16
Accepted Date: 2024-10-17
Published Date: 2024-10-31

How to Cite

Airlangga, G. (2024). A Comparative Analysis of Deep Learning Models for SMS Spam Detection: CNN-LSTM, CNN-GRU, and ResNet Approaches. Journal of Computer Networks, Architecture and High Performance Computing, 6(4), 1952-1960. https://doi.org/10.47709/cnahpc.v6i4.4827