Teachable Machine: East Sumba Dialect (Kambera) Detection Using Open Source Services
Abstract
This research seeks to develop a phonetic detection system for the Kambera dialect, the East Sumba local language, based on the TensorFlow framework that will be implemented in mobile applications. As part of this initiative, this research compiled a representative dataset of Kambera dialect phonetic samples. The main objective is to improve precision in phonetic recognition. Using the Kambera dialect as a case study, the data were extracted and trained using the open-source Teachable Machine service. This research adopted a convolutional neural network (CNN)-based approach combined with the Mel-frequency cepstral coefficients (MFCC) method for more accurate feature extraction. After data collection, model training, testing, and implementation, the model was integrated into the Android platform to benefit the public who wished to understand the Kambera dialect of East Sumba. The development and testing of this system were designed to detect and interpret the phonetics of the local language of East Sumba with the Kambera dialect, making a significant contribution to optimizing phonetic recognition and providing a dataset for ongoing research interests. It also serves as an accessible linguistics educational tool and supports linguistic inclusion and diversification in digital technology. Empirical evaluation showed that the overall average dialect detection precision rate reached 98.3% to 99.6%, with the user satisfaction rate reaching 99.33%. These results confirm that the developed system has a very efficient and good detection capability.
References
E.A.U. Malahina, R.P. Hadjon, and F.Y. Bisilisin, “Teachable Machine: Real-Time Attendance of Students Based on Open Source System,” The IJICS (Int. J. Inform., Comput. Sci.), Vol. 6, No. 3, pp. 140–146, Nov. 2022, doi: 10.30865/ijics.v6i3.4928.
A.T. Murray and J. Baik, “Opensource Spatial Optimization in GIScience for Strategic Positioning,” Trans. GIS, Vol. 27, No. 3, pp. 646–662, May 2023, doi: 10.1111/tgis.13033.
J.M.D.S. Dos Santos, C.A.A.P. Abar, and M.V. de Almeida, “Automatic Feedback GeoGebra Tasks – Searching and Opensource and Collaborative Intelligent Interactive Tutor,” Proc. 26th World Multi-Conf. Systemics Cybern., Inform.: WMSCI 2022, 2022, pp. 77–82, doi: 10.54808/WMSCI2022.03.77.
S. Majumdar and N.D. Pan, “Combining Opensource GIS and Meta-Analysis to Link Rainfall Trend and Human Activity: Case Study on Gumti and Khowai Drainage Systems, Tripura, India,” Spat. Inf. Res., Vol. 28, No. 3, pp. 287–298, Jun. 2020, doi: 10.1007/s41324-019-00288-8.
H.D. Trung, N.T. Hung, and N.H. Trung, “Opensource Based IoT Platform and LoRa Communications with Edge Device Calibration for Real-Time Monitoring Systems,” in Advanced Computational Methods for Knowledge Engineering. ICCSAMA 2019. Advances in Intelligent Systems and Computing, H.A.L. Thi, H.M. Le, T.P. Dinh, and N. Nguyen, Eds., Cham, Switzerland: Springer, 2019, pp. 412–423, doi: 10.1007/978-3-030-38364-0_37.
M. Jebbar, A. Maizate, and R.A. Abdelouahid, “Moroccan’s Arabic Speech Training and Deploying Machine Learning Models with Teachable Machine,” Procedia Comput. Sci., Vol. 203, pp. 801–806, 2022, doi: 10.1016/j.procs.2022.07.120.
P.Y. Prasad et al., “Implementation of Machine Learning Based Google Teachable Machine in Early Childhood Education,” Int. J. Early Child. Special Educ., Vol. 14, No. 3, pp. 4132–4138, May 2022, doi: 10.9756/INT-JECSE/V14I3.527.
D. Agustian, P.P.G.P. Pertama, P.N. Crisnapati, and P.D. Novayanti, “Implementation of Machine Learning Using Google’s Teachable Machine Based on Android,” 2021 3rd Int. Conf. Cybern., Intell. Syst. (ICORIS), 2021, pp. 1–7, doi: 10.1109/ICORIS52787.2021.9649528.
Pengelola Web Kemdikbud (2022) “Pelestarian Bahasa Daerah Menjaga Warisan Bangsa,” [Online], https://www.kemdikbud.go.id/main/blog/2022/02/pelestarian-bahasa-daerah-menjaga-warisan-bangsa, access date: 13 May 2023.
(2023) “Bahasa Sumba Timur,” [Online], https://labbineka.kemdikbud.go.id/bahasa/databahasa/3644a684f98ea8fe223c713b77189a77, access date: 13-May-2023.
R.M.I. Malo, “Preliminary Study of the Dialects of Kambera,” KULTURISTIK: J. Bhs., Budaya, Vol. 5, No. 2, pp. 1–6, Jul. 2021, doi: 10.22225/kulturistik.5.2.3654.
N.W. Kasni, “Strategy to Combine Clauses in Waijewa Dialect a Sumbanese Language,” e-J. Linguist., Vol. 6, No. 2, pp. 93–107, Jul. 2012.
I.G. Budasi, “Bukti-Bukti Leksikal Pembeda Bahasa Wanokaka dan Anakalang di Sumba NTT,” Mabasan, Vol. 4, No. 1, pp. 24–42, Jan.–Jun. 2010, doi: 10.26499/mab.v4i1.184.
I.W. Simpen, A.M. Mbete, I.M. Suastra, and I.W. Pastika, “Kesantunan Berbahasa pada Penutur Bahasa Kambera di Sumba Timur,” e-J. Linguist., Vol. 2, No. 1, pp. 1–15, May 2008.
J.S. Lansing et al., “Coevolution of languages and Genes on the Island of Sumba, Eastern Indonesia,” Proc. Nat. Acad. Sci. U. S. A., Vol. 104, No. 41, pp. 16022–16026, Oct. 2007, doi: 10.1073/pnas.0704451104.
I. Iswanto, V.J. Arnold, J. Kabnani, and T. Salau, “Kajian Antropolinguistik Bentuk Lingual Umbu dalam Nyanyian Tidur “Ille Le” pada Masyarakat Melolo, Kabupaten Sumba Timur, Nusa Tenggara Timur (Antropholinguistic Study Word “Umbu” in Sleeping Song ‘Ille Le’ at the Melolo Community, East Sumba, East Nusa Tenggara),” Jalabahasa, Vol. 17, No. 2, pp. 179–191, Nov. 2021, doi: 10.36567/jalabahasa.v17i2.768.
M. Hoijtink and A. Planqué-van Hardeveld, “Machine Learning and the Platformization of the Military: A Study of Google’s Machine Learning Platform TensorFlow,” Int. Political Sociol., Vol. 16, No. 2, pp. 1–19, Jun. 2022, doi: 10.1093/ips/olab036.
Y. Xie, M. He, T. Ma, and W. Tian, “Optimal Distributed Parallel Algorithms for Deep Learning Framework TensorFlow,” Appl. Intell., Vol. 52, No. 4, pp. 3880–3900, Mar. 2022, doi: 10.1007/s10489-021-02588-9.
T. He et al., “Bag of Tricks for Image Classification with Convolutional Neural Networks,” 2019 IEEE/CVF Conf. Comput. Vis., Pattern Recognit. (CVPR), 2019, pp. 558–567, doi: 10.1109/CVPR.2019.00065.
G. Jiang, H. He, J. Yan, and P. Xie, “Multiscale Convolutional Neural Networks for Fault Diagnosis of Wind Turbine Gearbox,” IEEE Trans. Ind. Electron., Vol. 66, No. 4, pp. 3196–3207, Apr. 2019, doi: 10.1109/TIE.2018.2844805.
(2022) “What is TensorFlow.js?,” [Online], https://codelabs.developers.google.com/TensorFlowjs-transfer-learning-teachable-machine#1, access date: 3-Oct-2022.
Z. Tariq, S.K. Shah, and Y. Lee, “Feature-Based Fusion Using CNN for Lung and Heart Sound Classification,” Sens., Vol. 22, No. 4, pp. 1–28, Feb. 2022, doi: 10.3390/s22041521.
U.A. Vishniakou and B.H. Shaya, “Voice Detection Using Convolutional Neural Network,” Doklady BGUIR, Vol. 21, No. 2, pp. 114–120, Apr. 2023, doi: 10.35596/1729-7648-2023-21-2-114-120.
A.Y. Kim et al., “Automatic Depression Detection Using Smartphone-Based Text-Dependent Speech Signals: Deep Convolutional Neural Network Approach,” J. Med. Internet Res., Vol. 25, pp. 1–17, Jan. 2023, doi: 10.2196/34474.
P.R. Prakash et al., “A Novel Convolutional Neural Network with Gated Recurrent Unit for Automated Speech Emotion Recognition and Classification,” J. Control, Decis., Vol. 10, No. 1, pp. 54–63, Jan. 2023, doi: 10.1080/23307706.2022.2085198.
Y. Yu, C. Peng, Q. Tang, and X. Wang, “Monaural Music Source Separation Using Deep Convolutional Neural Network Embedded with Feature Extraction Module,” 2022 Asia Conf. Algorithms, Comput., Mach. Learn. (CACML), 2022, pp. 546–551, doi: 10.1109/CACML55074.2022.00098.
P.H. Chandankhede, A.S. Titarmare, and S. Chauhvan, “Voice Recognition Based Security System Using Convolutional Neural Network,” 2021 Int. Conf. Comput. Commun., Intell. Syst. (ICCCIS), 2021, pp. 738–743, doi: 10.1109/ICCCIS51004.2021.9397151.
N. Li et al., “Robust Voice Activity Detection Using a Masked Auditory Encoder Based Convolutional Neural Network,” ICASSP 2021 - 2021 IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2021, pp. 6828–6832, doi: 10.1109/ICASSP39728.2021.9415045.
M. Soundarya, P.R. Karthikeyan, and G. Thangarasu, “Automatic Speech Recognition Trained with Convolutional Neural Network and Predicted with Recurrent Neural Network,” 2023 9th Int. Conf. Elect. Energy Syst. (ICEES), pp. 41–45, doi: 10.1109/ICEES57979.2023.10110224.
M.M. Kamruzzaman, “Arabic Sign Language Recognition and Generating Arabic Speech Using Convolutional Neural Network,” Wirel. Commun., Mob. Comput., Vol. 2020, pp. 1–9, May 2020, doi: 10.1155/2020/3685614.
M. Hireš et al., “Convolutional Neural Network Ensemble for Parkinson’s Disease Detection from Voice Recordings,” Comput. Biol., Med., Vol. 141, pp. 1–9, Feb. 2022, doi: 10.1016/j.compbiomed.2021.105021.
K.K. Lella and A. Pja, “Automatic Diagnosis of COVID-19 Disease Using Deep Convolutional Neural Network with Multi-Feature Channel from Respiratory Sound Data: Cough, Voice, and Breath,” Alex. Eng. J., Vol. 61, No. 2, pp. 1319–1334, Feb. 2022, doi: 10.1016/j.aej.2021.06.024.
I. Kwon et al., “Diagnosis of Early Glottic Cancer Using Laryngeal Image and Voice Based on Ensemble Learning of Convolutional Neural Network Classifiers,” J. Voice, to be published.
M. Hireš, M. Gazda, L. Vavrek, and P. Drotár, “Voice-Specific Augmentations for Parkinson’s Disease Detection Using Deep Convolutional Neural Network,” 2022 IEEE 20th Jubil. World Symp. Appl. Mach. Intell., Inform. (SAMI), 2022, pp. 000213–000218, doi: 10.1109/SAMI54271.2022.9780856.
T.T. Leonid and R. Jayaparvathy, “Classification of Elephant Sounds Using Parallel Convolutional Neural Network,” Intell. Automat., Soft Comput., Vol. 32, No. 3, pp. 1415–1426, Jun. 2022, doi: 10.32604/iasc.2022.021939.
S.D.H. Permana et al, “Classification of Bird Sounds as an Early Warning Method of Forest Fires Using Convolutional Neural Network (CNN) Algorithm,” J. King Saud Univ. – Comput., Inf. Sci., Vol. 34, No. 7, pp. 4345–4357, Jul. 2022, doi: 10.1016/j.jksuci.2021.04.013.
W.S.E. Putra, “Klasifikasi Citra Menggunakan Convolutional Neural Network (CNN) pada Caltech 101,” J. Tek. ITS, Vol. 5, No. 1, pp. A65–A69, Mar. 2016, doi: 10.12962/j23373539.v5i1.15696.
M. Carney et al., “Teachable Machine: Approachable Web-Based Tool for Exploring Machine Learning Classification,” Ext. Abstr. 2020 CHI Conf. Hum. Factors Comput. Syst., 2020, pp. 1–8, doi: 10.1145/3334480.3382839.
© Jurnal Nasional Teknik Elektro dan Teknologi Informasi, under the terms of the Creative Commons Attribution-ShareAlike 4.0 International License.