Reddy, M. Hemantheswar and Rishitha, K. and Raj, P. Bharath and Pandiri, D N Kiran and Srinivas, U. Thulasi (2025) Automatic Speech Grading using a Multimodal Deep Learning Framework using Bert and Whisper. In: Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I.
79390.pdf
Download (361kB)
Abstract
This paper proposes a Natural Language Processing (NLP-based) program of speech grading for not only the audio but also the video portion that quantitatively evaluates speech in terms of grammar, vocabulary, pronunciation, fluency and accuracy. These conventional speech evaluation methods are prone
| Item Type: | Conference or Workshop Item (UNSPECIFIED) |
|---|---|
| Date Deposited: | 04 Mar 2026 20:17 |
| Last Modified: | 16 Apr 2026 15:59 |
| URI: | http://eprints.eai.eu/id/eprint/60036 |
