Recognition of Arabic Vocabulary Based on Machine Learning Using a Convolutional Neural Network on Mobile Devices

Authors

  • Rahmat Rahmat Institut Agama Islam Negeri Ternate, Indonesia
  • Fauzi Institut Agama Islam Negeri Ternate, Indonesia
  • Muhammad Husni Mubarak Institut Agama Islam Negeri Manado, Indonesia

DOI:

https://doi.org/10.70115/semesta.v4i2.439

Keywords:

mufradat recognition, machine learning , mobile device, convolutional neural networks, transfer learning

Abstract

This study develops and evaluates machine-learning models based on Convolutional Neural Networks (CNNs) for recognizing images of Arabic vocabulary (mufradat) and for deploying these models on resource-constrained mobile devices. Whereas most prior research on Arabic-script recognition has concentrated on isolated characters executed on desktop hardware, the recognition of whole words—whose connected and visually similar glyphs increase classification difficulty—remains comparatively underexplored, particularly for on-device educational use. To address this gap, the study contributes (i) a purpose-built image dataset of fifteen academic Arabic words, (ii) a systematic comparison between a CNN trained from scratch and a MobileNetV2 transfer-learning model, and (iii) a quantified analysis of mobile deployment. An experimental approach was adopted using 3,000 images (200 per class) compiled from tablet handwriting and Microsoft Word screen-captured images, partitioned through a stratified 70/15/15 training, validation, and testing split. Both models were trained using the Adam optimizer (learning rate 1×10⁻⁴), a batch size of 32, and 50 epochs. The from-scratch five-convolution model attained 94.4% test accuracy (loss 0.26; macro-averaged F1-score 0.95), whereas the MobileNetV2 model attained 99.1% accuracy (loss 0.20; macro-averaged F1-score 0.99). After conversion to TensorFlow Lite, the MobileNetV2 model required only 9.1 MB of storage and 42 ms per inference on a mid-range Android device, compared with 103 MB and 180 ms for the from-scratch model, confirming its suitability for real-time use. The findings demonstrate that transfer learning achieves higher accuracy with markedly fewer parameters and a smaller computational footprint, providing an efficient foundation for mobile-assisted Arabic vocabulary learning.

References

Agusten, D., & Supriyatin, W. (2015). Rancang bangun aplikasi huruf hijaiyah dan angka Arab sebagai media pembelajaran interaktif menggunakan Adobe Flash CS 5.5. Proceedings of KOMMIT.

Akil, I., & Chaidir, I. (2021). Deteksi karakter huruf Arab dengan menggunakan Convolutional Neural Network. INTI Nusa Mandiri, 15(2), 183–188. https://doi.org/10.33480/inti.v15i2.2179

Altwaijry, N., & Al-Turaiki, I. (2021). Arabic handwriting recognition system using convolutional neural network. Neural Computing and Applications, 33(7), 2249–2261. https://doi.org/10.1007/s00521-020-05070-8

Arhmawati, R. A., Azzahroh J. R., S., & Faizin, M. (2025). Metode pembelajaran dalam pendidikan Islam: Strategi, pendekatan, dan tantangan di era digital. TAMADDUN: Jurnal Ilmu Sosial, Seni, dan Humaniora, 3(3), 147–157. https://ejournal.ahs-edu.org/index.php/tamaddun/article/view/367

Bin Durayhim, A., Al-Ajlan, A., Al-Turaiki, I., & Altwaijry, N. (2023). Towards accurate children's Arabic handwriting recognition via deep learning. Applied Sciences, 13(3), 1692. https://doi.org/10.3390/app13031692

Buduma, N., & Locascio, N. (n.d.). Fundamentals of deep learning. O'Reilly Media.

El Khayati, M., Maafiri, A., Himeur, Y., Alkhazaleh, H. A., Atalla, S., & Mansoor, W. (2025). Leveraging transfer learning and mobile-enabled convolutional neural networks for improved Arabic handwritten character recognition. arXiv preprint arXiv:2509.05019.

El-Sawy, A., Loey, M., & El-Bakry, H. (2017). Arabic handwritten characters recognition using convolutional neural network. WSEAS Transactions on Computer Research, 5, 11–19.

Faizullah, S., Ayub, M. S., Hussain, S., & Khan, M. A. (2023). A survey of OCR in Arabic language: Applications, techniques, and challenges. Applied Sciences, 13(7), 4584. https://doi.org/10.3390/app13074584

Fauzi, F., Permanasari, A. E., & Setiawan, N. A. (2021). Butterfly image classification using convolutional neural network (CNN). 2021 3rd International Conference on Electronics Representation and Algorithm (ICERA), 66–70. https://doi.org/10.1109/ICERA53111.2021.9538686

Gulzar, Y. (2023). Fruit image classification model based on MobileNetV2 with deep transfer learning technique. Sustainability, 15(3), 1906. https://doi.org/10.3390/su15031906

Halim, A. F. (2018). Multimedia pembelajaran bahasa Arab berbasis mobile. Jurnal Al-Fawa'id: Jurnal Agama dan Bahasa, 8(1). https://doi.org/10.54214/alfawaid.Vol8.Iss1.112

Haniah, H. (2014). Pemanfaatan teknologi informasi dalam mengatasi masalah belajar bahasa Arab. Al-Ta'rib: Jurnal Ilmiah Program Studi Pendidikan Bahasa Arab IAIN Palangka Raya, 2(1). https://doi.org/10.23971/altarib.v2i1.588

Hjaiej, M., Cheikh, I. B., & Abbes, H. (2025). Deep learning for Arabic word classification: Leveraging transfer learning and Grad-CAM for morphological analysis. In Pattern Recognition. ICPR 2024 (Lecture Notes in Computer Science, Vol. 15331, pp. 295–309). Springer. https://doi.org/10.1007/978-3-031-78119-3_22

Istiqomah, Rofiq, M. H., & Hasanah, K. D. (2024). Pengaruh media komik Sahabat Anak Muslim dalam peningkatan motivasi belajar peserta didik mata pelajaran Pendidikan Agama Islam di SDN Gondang. SEMESTA: Jurnal Ilmu Pendidikan dan Pengajaran, 2(2), 76–82. https://ejournal.ahs-edu.org/index.php/semesta/article/view/128

Kasim, N., & Nugraha, G. S. (2021). Pengenalan pola tulisan tangan aksara Arab menggunakan metode convolution neural network. Jurnal Teknologi Informasi, Komputer, dan Aplikasinya (JTIKA), 3(1), 85–95. https://doi.org/10.29303/jtika.v3i1.136

Khan, S., Rahmani, H., Ali Shah, S. A., & Bennamoun, M. (n.d.). A guide to convolutional neural networks for computer vision. Morgan & Claypool.

Lahiani, H., & Frikha, M. (2024). Exploring CNN-based transfer learning approaches for Arabic alphabets sign language recognition using the ArSL2018 dataset. International Journal of Intelligent Engineering Informatics, 12(2), 236–260. https://doi.org/10.1504/IJIEI.2024.138858

Lamsaf, A., Ait Kerroum, M., Boulaknadel, S., & Fakhri, Y. (2022). Recognition of Arabic handwritten words using convolutional neural network. Indonesian Journal of Electrical Engineering and Computer Science, 26(2), 1148–1155. https://doi.org/10.11591/ijeecs.v26.i2.pp1148-1155

Mosbah, L., Moalla, I., Hamdani, T. M., Neji, B., Beyrouthy, T., & Alimi, A. M. (2024). ADOCRNet: A deep learning OCR for Arabic documents recognition. IEEE Access, 12, 55620–55631. https://doi.org/10.1109/ACCESS.2024.3379530

Mudhsh, M., & Almodfer, R. (2017). Arabic handwritten alphanumeric character recognition using very deep neural network. Information, 8(3), 105. https://doi.org/10.3390/info8030105

Mustofa, S. (2017). Strategi pembelajaran bahasa Arab inovatif (Vol. 2). UIN-Maliki Press.

Najam, R., & Faizullah, S. (2023). Analysis of recent deep learning techniques for Arabic handwritten-text OCR and post-OCR correction. Applied Sciences, 13(13), 7568. https://doi.org/10.3390/app13137568

Nurrahmah, Turmuzi, A., Deni, S., Yakin, N., & Anam, M. C. (2024). Penggunaan media pembelajaran dalam meningkatkan motivasi belajar siswa bidang studi IPS. TAMADDUN: Jurnal Ilmu Sosial, Seni, dan Humaniora, 2(3), 146–152. https://ejournal.ahs-edu.org/index.php/tamaddun/article/view/313

Rahal, N., Tounsi, M., Hussain, A., & Alimi, A. M. (2021). Deep sparse auto-encoder features learning for Arabic text recognition. IEEE Access, 9, 18569–18584. https://doi.org/10.1109/ACCESS.2021.3053618

Riswadi, Amrullah, Z., & Ulum, B. (2025). Integrasi metode active learning dalam penguatan nilai-nilai Islami pada pembelajaran PAI. SEMESTA: Jurnal Ilmu Pendidikan dan Pengajaran, 3(3), 102–112. https://ejournal.ahs-edu.org/index.php/semesta/article/view/319

Shareef, S. R., & Irhayim, Y. F. (2021). A review: Isolated Arabic words recognition using artificial intelligent techniques. Journal of Physics: Conference Series, 1897(1), 012026. https://doi.org/10.1088/1742-6596/1897/1/012026

Sugiyono. (2013). Metode penelitian kuantitatif, kualitatif, dan R&D. Alfabeta.

Sumiyati, Muafaq, M. F., & Susilowati, S. (2024). Media for effective instruction of Islamic education. SEMESTA: Jurnal Ilmu Pendidikan dan Pengajaran, 2(2), 83–90. https://ejournal.ahs-edu.org/index.php/semesta/article/view/157

The Royal Islamic Strategic Studies Centre. (n.d.). Home. Retrieved September 29, 2022, from https://rissc.jo/

Ullah, Z., & Jamjoom, M. (2022). An intelligent approach for Arabic handwritten letter recognition using convolutional neural network. PeerJ Computer Science, 8, e995. https://doi.org/10.7717/peerj-cs.995

Wagaa, N., Kallel, H., & Mellouli, N. (2022). Improved Arabic alphabet characters classification using convolutional neural networks (CNN). Computational Intelligence and Neuroscience, 2022, 9965426. https://doi.org/10.1155/2022/9965426

Downloads

Published

2026-06-04

Issue

Section

Articles

How to Cite

Recognition of Arabic Vocabulary Based on Machine Learning Using a Convolutional Neural Network on Mobile Devices. (2026). SEMESTA: Jurnal Ilmu Pendidikan Dan Pengajaran, 4(2), 154-172. https://doi.org/10.70115/semesta.v4i2.439