Automatic Speech Grading using a Multimodal Deep Learning Framework using Bert and Whisper

Reddy, M. Hemantheswar and Rishitha, K. and Raj, P. Bharath and Pandiri, D N Kiran and Srinivas, U. Thulasi (2025) Automatic Speech Grading using a Multimodal Deep Learning Framework using Bert and Whisper. In: Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I.

[thumbnail of 79390.pdf] PDF
79390.pdf

Download (361kB)

Abstract

This paper proposes a Natural Language Processing (NLP-based) program of speech grading for not only the audio but also the video portion that quantitatively evaluates speech in terms of grammar, vocabulary, pronunciation, fluency and accuracy. These conventional speech evaluation methods are prone

Item Type: Conference or Workshop Item (UNSPECIFIED)
Date Deposited: 04 Mar 2026 20:17
Last Modified: 16 Apr 2026 15:59
URI: http://eprints.eai.eu/id/eprint/60036

Actions (login required)

View Item
View Item