Tamil Natural Language Voice Classification using Recurrent Neural Networks

Authors

  • Nishaanth Kanna Ravichandran Student, Department of Computer Science and Engineering, PSG College of Technology, Coimbatore, India

Keywords:

Deep Recurrent Neural Network (DRNN), Long-Short Term Memory (LSTM), Mel Frequency Cepstral Coefficient (MFCC), TensorFlow Lite

Abstract

Audio Classification systems are used to classify the given audio into N outputs. Various models can accurately classify a given sound. But few can accurately classify a given Natural Language (Tamil) Voice, especially Tamil Vowels pronounced by Children with learning disabilities. Voice classification is done by recording the audio and converting it into digital data. The audio samples then undergo a feature extraction process to extra Mel Frequency Cepstral Coefficients. These coefficients are then used as input to the Deep Recurrent Neural Networks (DRNN) (LSTM) to accurately classify them into Tamil vowels. This paper focuses on a learning system (android app), with an on-device inference model developed using TensorFlow Lite, that records the children’s voice data, classifies them, and provides accuracy and training to improve their pronunciation.

Downloads

Download data is not yet available.

Downloads

Published

18-01-2022

How to Cite

[1]
N. K. Ravichandran, “Tamil Natural Language Voice Classification using Recurrent Neural Networks”, IJRESM, vol. 5, no. 1, pp. 79–82, Jan. 2022.

Issue

Section

Articles