Recognition of Arabic Vocabulary Based on Machine Learning Using a Convolutional Neural Network on Mobile Devices
DOI:
https://doi.org/10.70115/semesta.v4i2.439Keywords:
mufradat recognition, machine learning , mobile device, convolutional neural networks, transfer learningAbstract
This study develops and evaluates machine-learning models based on Convolutional Neural Networks (CNNs) for recognizing images of Arabic vocabulary (mufradat) and for deploying these models on resource-constrained mobile devices. Whereas most prior research on Arabic-script recognition has concentrated on isolated characters executed on desktop hardware, the recognition of whole words—whose connected and visually similar glyphs increase classification difficulty—remains comparatively underexplored, particularly for on-device educational use. To address this gap, the study contributes (i) a purpose-built image dataset of fifteen academic Arabic words, (ii) a systematic comparison between a CNN trained from scratch and a MobileNetV2 transfer-learning model, and (iii) a quantified analysis of mobile deployment. An experimental approach was adopted using 3,000 images (200 per class) compiled from tablet handwriting and Microsoft Word screen-captured images, partitioned through a stratified 70/15/15 training, validation, and testing split. Both models were trained using the Adam optimizer (learning rate 1×10⁻⁴), a batch size of 32, and 50 epochs. The from-scratch five-convolution model attained 94.4% test accuracy (loss 0.26; macro-averaged F1-score 0.95), whereas the MobileNetV2 model attained 99.1% accuracy (loss 0.20; macro-averaged F1-score 0.99). After conversion to TensorFlow Lite, the MobileNetV2 model required only 9.1 MB of storage and 42 ms per inference on a mid-range Android device, compared with 103 MB and 180 ms for the from-scratch model, confirming its suitability for real-time use. The findings demonstrate that transfer learning achieves higher accuracy with markedly fewer parameters and a smaller computational footprint, providing an efficient foundation for mobile-assisted Arabic vocabulary learning.
References
Agusten, D., & Supriyatin, W. (2015). Rancang bangun aplikasi huruf hijaiyah dan angka Arab sebagai media pembelajaran interaktif menggunakan Adobe Flash CS 5.5. Proceedings of KOMMIT.
Akil, I., & Chaidir, I. (2021). Deteksi karakter huruf Arab dengan menggunakan Convolutional Neural Network. INTI Nusa Mandiri, 15(2), 183–188. https://doi.org/10.33480/inti.v15i2.2179
Altwaijry, N., & Al-Turaiki, I. (2021). Arabic handwriting recognition system using convolutional neural network. Neural Computing and Applications, 33(7), 2249–2261. https://doi.org/10.1007/s00521-020-05070-8
Arhmawati, R. A., Azzahroh J. R., S., & Faizin, M. (2025). Metode pembelajaran dalam pendidikan Islam: Strategi, pendekatan, dan tantangan di era digital. TAMADDUN: Jurnal Ilmu Sosial, Seni, dan Humaniora, 3(3), 147–157. https://ejournal.ahs-edu.org/index.php/tamaddun/article/view/367
Bin Durayhim, A., Al-Ajlan, A., Al-Turaiki, I., & Altwaijry, N. (2023). Towards accurate children's Arabic handwriting recognition via deep learning. Applied Sciences, 13(3), 1692. https://doi.org/10.3390/app13031692
Buduma, N., & Locascio, N. (n.d.). Fundamentals of deep learning. O'Reilly Media.
El Khayati, M., Maafiri, A., Himeur, Y., Alkhazaleh, H. A., Atalla, S., & Mansoor, W. (2025). Leveraging transfer learning and mobile-enabled convolutional neural networks for improved Arabic handwritten character recognition. arXiv preprint arXiv:2509.05019.
El-Sawy, A., Loey, M., & El-Bakry, H. (2017). Arabic handwritten characters recognition using convolutional neural network. WSEAS Transactions on Computer Research, 5, 11–19.
Faizullah, S., Ayub, M. S., Hussain, S., & Khan, M. A. (2023). A survey of OCR in Arabic language: Applications, techniques, and challenges. Applied Sciences, 13(7), 4584. https://doi.org/10.3390/app13074584
Fauzi, F., Permanasari, A. E., & Setiawan, N. A. (2021). Butterfly image classification using convolutional neural network (CNN). 2021 3rd International Conference on Electronics Representation and Algorithm (ICERA), 66–70. https://doi.org/10.1109/ICERA53111.2021.9538686
Gulzar, Y. (2023). Fruit image classification model based on MobileNetV2 with deep transfer learning technique. Sustainability, 15(3), 1906. https://doi.org/10.3390/su15031906
Halim, A. F. (2018). Multimedia pembelajaran bahasa Arab berbasis mobile. Jurnal Al-Fawa'id: Jurnal Agama dan Bahasa, 8(1). https://doi.org/10.54214/alfawaid.Vol8.Iss1.112
Haniah, H. (2014). Pemanfaatan teknologi informasi dalam mengatasi masalah belajar bahasa Arab. Al-Ta'rib: Jurnal Ilmiah Program Studi Pendidikan Bahasa Arab IAIN Palangka Raya, 2(1). https://doi.org/10.23971/altarib.v2i1.588
Hjaiej, M., Cheikh, I. B., & Abbes, H. (2025). Deep learning for Arabic word classification: Leveraging transfer learning and Grad-CAM for morphological analysis. In Pattern Recognition. ICPR 2024 (Lecture Notes in Computer Science, Vol. 15331, pp. 295–309). Springer. https://doi.org/10.1007/978-3-031-78119-3_22
Istiqomah, Rofiq, M. H., & Hasanah, K. D. (2024). Pengaruh media komik Sahabat Anak Muslim dalam peningkatan motivasi belajar peserta didik mata pelajaran Pendidikan Agama Islam di SDN Gondang. SEMESTA: Jurnal Ilmu Pendidikan dan Pengajaran, 2(2), 76–82. https://ejournal.ahs-edu.org/index.php/semesta/article/view/128
Kasim, N., & Nugraha, G. S. (2021). Pengenalan pola tulisan tangan aksara Arab menggunakan metode convolution neural network. Jurnal Teknologi Informasi, Komputer, dan Aplikasinya (JTIKA), 3(1), 85–95. https://doi.org/10.29303/jtika.v3i1.136
Khan, S., Rahmani, H., Ali Shah, S. A., & Bennamoun, M. (n.d.). A guide to convolutional neural networks for computer vision. Morgan & Claypool.
Lahiani, H., & Frikha, M. (2024). Exploring CNN-based transfer learning approaches for Arabic alphabets sign language recognition using the ArSL2018 dataset. International Journal of Intelligent Engineering Informatics, 12(2), 236–260. https://doi.org/10.1504/IJIEI.2024.138858
Lamsaf, A., Ait Kerroum, M., Boulaknadel, S., & Fakhri, Y. (2022). Recognition of Arabic handwritten words using convolutional neural network. Indonesian Journal of Electrical Engineering and Computer Science, 26(2), 1148–1155. https://doi.org/10.11591/ijeecs.v26.i2.pp1148-1155
Mosbah, L., Moalla, I., Hamdani, T. M., Neji, B., Beyrouthy, T., & Alimi, A. M. (2024). ADOCRNet: A deep learning OCR for Arabic documents recognition. IEEE Access, 12, 55620–55631. https://doi.org/10.1109/ACCESS.2024.3379530
Mudhsh, M., & Almodfer, R. (2017). Arabic handwritten alphanumeric character recognition using very deep neural network. Information, 8(3), 105. https://doi.org/10.3390/info8030105
Mustofa, S. (2017). Strategi pembelajaran bahasa Arab inovatif (Vol. 2). UIN-Maliki Press.
Najam, R., & Faizullah, S. (2023). Analysis of recent deep learning techniques for Arabic handwritten-text OCR and post-OCR correction. Applied Sciences, 13(13), 7568. https://doi.org/10.3390/app13137568
Nurrahmah, Turmuzi, A., Deni, S., Yakin, N., & Anam, M. C. (2024). Penggunaan media pembelajaran dalam meningkatkan motivasi belajar siswa bidang studi IPS. TAMADDUN: Jurnal Ilmu Sosial, Seni, dan Humaniora, 2(3), 146–152. https://ejournal.ahs-edu.org/index.php/tamaddun/article/view/313
Rahal, N., Tounsi, M., Hussain, A., & Alimi, A. M. (2021). Deep sparse auto-encoder features learning for Arabic text recognition. IEEE Access, 9, 18569–18584. https://doi.org/10.1109/ACCESS.2021.3053618
Riswadi, Amrullah, Z., & Ulum, B. (2025). Integrasi metode active learning dalam penguatan nilai-nilai Islami pada pembelajaran PAI. SEMESTA: Jurnal Ilmu Pendidikan dan Pengajaran, 3(3), 102–112. https://ejournal.ahs-edu.org/index.php/semesta/article/view/319
Shareef, S. R., & Irhayim, Y. F. (2021). A review: Isolated Arabic words recognition using artificial intelligent techniques. Journal of Physics: Conference Series, 1897(1), 012026. https://doi.org/10.1088/1742-6596/1897/1/012026
Sugiyono. (2013). Metode penelitian kuantitatif, kualitatif, dan R&D. Alfabeta.
Sumiyati, Muafaq, M. F., & Susilowati, S. (2024). Media for effective instruction of Islamic education. SEMESTA: Jurnal Ilmu Pendidikan dan Pengajaran, 2(2), 83–90. https://ejournal.ahs-edu.org/index.php/semesta/article/view/157
The Royal Islamic Strategic Studies Centre. (n.d.). Home. Retrieved September 29, 2022, from https://rissc.jo/
Ullah, Z., & Jamjoom, M. (2022). An intelligent approach for Arabic handwritten letter recognition using convolutional neural network. PeerJ Computer Science, 8, e995. https://doi.org/10.7717/peerj-cs.995
Wagaa, N., Kallel, H., & Mellouli, N. (2022). Improved Arabic alphabet characters classification using convolutional neural networks (CNN). Computational Intelligence and Neuroscience, 2022, 9965426. https://doi.org/10.1155/2022/9965426
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Rahmat, Fauzi, Muhammad Husni Mubarak

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.







