Rule Based Parts of Speech Tagger for Chhattisgarhi Language
Vikas Pandey1, M.V Padmavati2, Ramesh Kumar3

1Vikas Pandey, Department of Information Technology, Bhilai Institute of Technology, (Chhattisgarh), India.
2Dr. M.V Padmavati, Department of Computer Science and Engg., Bhilai Institute of Technology, (Chhattisgarh), India.
3Dr. Ramesh Kumar, Department of Computer Science and Engg., Bhilai Institute of Technology, Durg, (Chhattisgarh), India.

Manuscript received on 24 September 2018 | Revised Manuscript received on 30 September 2018 | Manuscript published on 30 November 2018 | PP: 192-194 | Volume-7 Issue-4, November 2018 | Retrieval Number: E1826017519©BEIESP
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (

Abstract: There is an increasing demand for machine translation systems for various regional languages of India. Chhattisgarhi being the language of the young Chhattisgarh state requires automatic languages translating system. Various types of natural language processing (NLP) tools are required for developing Chhattisgarhi to Hindi machine translation (MT) system. In this paper, we are presenting rule based parts of speech tagger for Chhattisgarhi language. Parts of Speech tagging is a procedure in which each word of sentence is assigned a tag from tag set. The Parts of Speech tagger is based on rule base which is formed by taken into consideration the grammatical structure of Chhattisgarhi language. The system is constructed over corpus size of 40,000 words with tag set consists of 30 different parts of speech tags. The corpus is taken from various Chhattisgarhi stories. The system achieves an accuracy of 78%.
Keywords: Chhattisgarhi, Machine Translation, Natural Language Processing, Parts of Speech tagger, Rule Based System

Scope of the Article: Machine/ Deep Learning with IoT & IoE