Automatic Geminate Insertion Algorithm for Japanese Audio Data
Hirofumi Maeda1, Kenta Yamamoto2

1Hirofumi Maeda*, Department of Information Science and Technology, National Institute of Technology (KOSEN), Yuge College, Ehime Prefecture, Japan.
2Kenta Yamamoto, Department of General Education, National Institute of Technology (KOSEN), Yuge College, Ehime Prefecture, Japan.

Manuscript received on July 17, 2021. | Revised Manuscript received on July 21, 2021. | Manuscript published on July 30, 2021. | PP: 163-169 | Volume-10 Issue-2, July 2021. | Retrieval Number: 100.1/ijrte.B62840710221| DOI: 10.35940/ijrte.B6284.0710221
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Generally, it is quite difficult for Japanese language learners to acquire Japanese special morae, namely, geminate, syllabic nasals and long vowels compared to independent morae. Among these three special morae, geminate is particularly difficult, and it takes much longer to fully acquire both production and perception of it. Especially for learners of Chinese native speakers, previous studies has shown that both production and perception of geminate are difficult in terms of the fact that not only no geminate is found in Chinese language, but also the phonological interaction between Japanese accent and Chinese tones. However, in the field of Japanese speech acquisition, research has not making progress because of a major problem, that is, researchers themselves manually create the acoustic experiment stimuli. Therefore, in this study, as a method to solve this problem, we propose an algorithm that automatically inserts geminate into the audio data used in Japanese speech acquisition research. This algorithm automates the insertion of geminate by performing three processes in order: mora extraction by noise removal, matching of original audio data and extracted mora, and insertion of soundless duration and geminate. The algorithm makes it possible to remove the noise, which is -50 dBFS and continues for 10ms or more, and replace it with soundless duration instead, allowing Japanese native speakers to percept it as geminate. The accuracy was equivalent as a result of comparing the data that was manually modified by a phonology researcher with the data that was generated by the algorithm. The result shows that the algorithm can be a practical solution for the automation of geminate insertion. 
Keywords: Acoustic processing, Japanese language learning, Geminate, Noise reduction, Algorithm.