Recognition of Nastaliq Urdu Text using Multi-SVM
Herleen Kour1, Mehvish Yasin2, Naveen Gondhi3
1Herleen Kour*, Computer science and engineering,Shri Mata Vaishno Devi University, Katra, India. 

2Mehvish Yasin, Computer science and engineering, Shri Mata Vaishno Devi University, Katra, India. 
3Dr. Naveen Gondhi, Computer science and engineering,Shri Mata Vaishno Devi University, Katra, India.
Manuscript received on January 02, 2020. | Revised Manuscript received on January 15, 2020. | Manuscript published on January 30, 2020. | PP: 5665-5674 | Volume-8 Issue-5, January 2020. | Retrieval Number: E6949018520/2020©BEIESP | DOI: 10.35940/ijrte.E6949.018520

Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (

Abstract: Optical Character Recognition has emerged as an attractive research field nowadays. Lot of work has been done in Urdu script based on various approaches and diverse methodologies have been put forward based on Nastaliq font style. Urdu is written diagonally from top to bottom, the style known as Nastaliq. This feature of Nastaliq makes Urdu highly cursive and more sensitive leading to a difficult recognition problem. Due to the peculiarities of Nastaliq Style of writing, we have chosen ligature as a basic unit of recognition in order to reduce the complexity of system. The accuracy rate of recognizing ligature in Urdu text corresponds to the efficiency with which the ligatures are segmented. In addition to extracting connected components, the ligature segmentation takes into consideration various factors like baseline information, height, width, and centroid. In this paper ligature Recognition is performed by using multi-SVM (Sup-port Vector Machine) approach which gives an accuracy of 97% when 903 text images are fed to it.
Keywords: OCR, Nastaliq, Segmentation, Recognition, SVM
Scope of the Article: Pattern Recognition