A novel method is proposed to recognize the Arab/Jawi and Roman digits. This new method is based on features from the triangle geometry, normalized into nine features. The features are used for zoning which results in five and 25 zones. The algorithm is validated by using three standard datasets which are publicly available and used by researchers in this field. The first dataset is HODA that contains 60,000 images for training and 20,000 images for testing. The second dataset is IFHCDB. This dataset has 52,380 isolated characters and 17,740 digits. Only the 17,740 images of digits are used for this research. For the roman digit, MNIST are chosen. MNIST dataset has 60,000 images for training and 10,000 images for testing. Supervised (SML) and Unsupervised Machine Learning (UML) are used to test the nine features. The SML used are Neural Network (NN) and Support Vector Machine (SVM). Whereas the UML uses Euclidean Distance Method with data mining algorithms; namely Mean Average Precision (eMAP) and Frequency Based (eFB). Results for SML testing for HODA dataset are 98.07% accuracy for SVM, and 96.73% for NN. For IFHCDB and MNIST the accuracy are 91.75% and 93.095% respectively. For the UML tests, HODA dataset is 93.91%, IFHCDB 85.94% and MNIST 86.61%. The train and test images are selected using both random and the original dataset's distribution. The results show that the accuracy of proposed algorithm is over 90% for each SML trained datasets where the highest result is the one that uses 25 zones features.
Skip Nav Destination
Article navigation
22 April 2013
PROCEEDINGS OF THE 20TH NATIONAL SYMPOSIUM ON MATHEMATICAL SCIENCES: Research in Mathematical Sciences: A Catalyst for Creativity and Innovation
18–20 December 2012
Palm Garden Hotel, Putrajaya, Malaysia
Research Article|
April 22 2013
Digit recognition for Arabic/Jawi and Roman using features from triangle geometry
Mohd Sanusi Azmi;
Mohd Sanusi Azmi
Faculty of Information and Communication Technology, Universiti Teknikal Malaysia, 76100 Melaka,
Malaysia
Search for other works by this author on:
Khairuddin Omar;
Khairuddin Omar
Faculty of Information Sciences and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor DE,
Malaysia
Search for other works by this author on:
Mohamad Faidzul Nasrudin;
Mohamad Faidzul Nasrudin
Faculty of Information Sciences and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor DE,
Malaysia
Search for other works by this author on:
Bahari Idrus;
Bahari Idrus
Faculty of Information Sciences and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor DE,
Malaysia
Search for other works by this author on:
Khadijah Wan Mohd Ghazali
Khadijah Wan Mohd Ghazali
Faculty of Information and Communication Technology, Universiti Teknikal Malaysia, 76100 Melaka,
Malaysia
Search for other works by this author on:
AIP Conf. Proc. 1522, 526–537 (2013)
Citation
Mohd Sanusi Azmi, Khairuddin Omar, Mohamad Faidzul Nasrudin, Bahari Idrus, Khadijah Wan Mohd Ghazali; Digit recognition for Arabic/Jawi and Roman using features from triangle geometry. AIP Conf. Proc. 22 April 2013; 1522 (1): 526–537. https://doi.org/10.1063/1.4801171
Download citation file:
Sign in
Don't already have an account? Register
Sign In
You could not be signed in. Please check your credentials and make sure you have an active account and try again.
Could not validate captcha. Please try again.
Sign in via your Institution
Sign in via your InstitutionPay-Per-View Access
$40.00
Citing articles via
Related Content
Preliminary study towards the development of copying skill assessment on dyslexic children in Jawi handwriting
AIP Conference Proceedings (May 2015)
The identity of Kelantan peranakan Chinese through clothing: An aesthetic morphology approach
AIP Conference Proceedings (July 2021)
Non‐natives’ identification of English consonants in noise
J Acoust Soc Am (May 1997)
Roman domination matrix
AIP Conference Proceedings (June 2023)
Characterization of the Roman curse tablet
AIP Conference Proceedings (August 2017)