Speech Recognition System for Isolated Tamil Words using Random Forest Algorithm
Nivetha S1, Rathinavelu A2, Gayathri S3

1Nivetha S, Department of Computer Science and Engineering, Dr Mahalingam College of Engineering and Technology, Pollachi, India.
2Dr. Rathinavelu A, Department of Computer Science and Engineering, Dr Mahalingam College of Engineering and Technology, Pollachi, India.
3Ms. Gayathri S, Department of Computer Science and Engineering, Dr Mahalingam College of Engineering and Technology, Pollachi, India.

Manuscript received on April 30, 2020. | Revised Manuscript received on May 06, 2020. | Manuscript published on May 30, 2020. | PP: 2431-2435 | Volume-9 Issue-1, May 2020. | Retrieval Number: A1467059120/2020©BEIESP | DOI: 10.35940/ijrte.A1467.059120
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: ASR is the use of system software and hardware based techniques to identify and process human voice. In this research, Tamil words are analyzed, segmented as syllables, followed by feature extraction and recognition. Syllables are segmented using short term energy and segmentation is done in order to minimize the corpus size. The algorithm for syllable segmentation works by performing the STE function of the continuous speech signal. The proposed approach for speech recognition uses the combination of Mel-Frequency Cepstral Coefficients (MFCC) and Linear Predictive Coding (LPC). MFCC features are used to extract a feature vector containing all information about the linguistic message. The LPC affords a robust, dependable and correct technique for estimating the parameters that signify the vocal tract system.LPC features can reduce the bit rate of speech (i.e reducing the measurement of transmitting signal).The combined feature extraction technique will minimize the size of transmitting signal. Then the proposed FE algorithm is evaluated on the speech corpus using the Random forest approach. Random forest is an effective algorithm which can build a reliable training model as its training time is less because the classifier works on the subset of features alone. 
Keywords: Automatic Speech Recognition (ASR), LPC, MFCC, Random forest, Recognition rate, Short term energy
Scope of the Article: Automatic Speech Recognition