Pemilihan Ciri Vokal Indonesia untuk Animasi Lip Sync

Anung Rachman

Anung Rachman Institut Seni Indonesia (ISI) Surakarta

Keywords: animasi, pengenalan suara, referensi vokal, frekuensi forman

Abstract

Recently, voice recognition technology is widely used to produce lip sync animation. However, the reference in current speech recognition system is not in Indonesian language, and therefore, the results in Indonesian system become inaccurate. To develop a better, suitable system, Indonesian vowels should be used in the speech recognition systems. Because there is a vowel in every syllable, vowels have an important role in lip sync animation. Hence, features of vowels need to be selected properly to increase accuracy of the system. This process can be performed by comparing mean of formant values of a vowel in a word at each position; the front, the middle, and the back. The results show different values for those three positions. If the values are significantly different, the formant values are unfit for vowels reference of voice recognition system. The result of this research shows that the formant values of vowel 'i' and 'o' are not different significantly in all positions. Formant values of vowel 'a' and 'u' do not differ significantly in only one position. And the values of formant of vowel 'e' are significantly different between each position.

References

S.M. Hwang, H.K.Yun, B.H.Song, “Automatic Lip Sync Solution for Virtual Characters in 3D Animations”, ICCT, vol. 2, no.1, pp. 432-433, 2013.

S.M. Hwang, H.K.Yun, B.H.Song, “Speaker Dependent Real-Time Vowel Recognition Algorithm for Lip Sync in Digital Contents”, IT Convergence and Security (ICITCS), pp. 1-4, 2013.

S. Azmi. M.Y., “An Improved Feature Extraction Method for Malay Vowel Recognition based on Spectrum Delta,” Int. J. Softw. Eng. Its Appl., vol. 8, no. 1, pp. 413–426, 2014.

M. Płonkowski, “Using bands of frequencies for vowel recognition for Polish,” Int J Speech Technology, DOI 10.1007/s10772-014-9259-z. 2014.

A. Biswas, P. K. Sahu, A. Bhowmick, and M. Chandra, “Hindi vowel classification using GFCC and formant analysis in sensor mismatch condition,” WSEAS TRANSACTIONS on SYSTEMS, vol. 13, pp. 130–143, 2014.

S.M. Hwang, H.K.Yun, B.H.Song, “Korean Speech recognition using phonemics for Lip-sync Animation,” Information Science, Electronics and Electrical Engineering (ISEEE), vol.2, pp. 1011 – 1014, 2014.

Journal Metrics (January 2024)
Acceptance Rate	29%
Submission to First Decision	± 36 days
Acceptance to Publication	± 30 days
Acreditation	Sinta 2
h-index	29
5 Year Citations	3549

Username
Password
Remember me
Register