Hierarchical Picture of existing Audio-Visual Speech Database
Bibish Kumar K T¹, Sunil John², Muraleedharan K M³, R K Sunil Kumar⁴
¹Bibish Kumar K T, Computer Speech & intelligence Research Centre, Department of Physics, Government College, Madappally, Vadakara, Calicut, Kerala, India.
²Sunil John, Computer Speech & intelligence Research Centre, Department of Physics, Government College, Madappally, Vadakara, Calicut, Kerala, India.
³Muraleedharan K M, Computer Speech & intelligence Research Centre, Department of Physics, Government College, Madappally, Vadakara, Calicut, Kerala, India.
⁴R K Sunil Kumar, School of Information Science and Technology, Kannur University, Kerala, India.
Manuscript received on 05 August 2019. | Revised Manuscript received on 16 August 2019. | Manuscript published on 30 September 2019. | PP: 8372-8379 | Volume-8 Issue-3 September 2019 | Retrieval Number: C6483098319/2019©BEIESP | DOI: 10.35940/ijrte.C6483.098319

Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Despite the technological improvement and arrival of new methodologies in the different process of a speech-based applications, a parallel development is not observed in the availability of audio-visual speech database. This paper provides a detailed hierarchical picture of the existing audio-visual speech database. Since the performance of a speech-based application deeply depends on the different parameters like the number of speakers, speaker variability, phonetically balanced sentences, recording quality etc. involved in the creation of a database to attain specific task. This paper gave more importance to these parameters involved in the exciting audio-visual speech database rather than the experimental side which need linguistic knowledge about the concerned language in the feature extraction task and classification task. This paper is arranged in such a way that a new face in this realm can capture the needy things to build a speech database in his language. In addition, this paper differs from other review papers in the aspect that it gives equal importance to the available audio-visual speech database in the resourced and under-resourced languages.
Keywords: Audio-Visual Speech Database, Speech-Based Applications, Video Parameters and Audio Parameters.
Scope of the Article: Visual Analytics

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US