Mulukutla, Vamsi Krishna and Pavarala, Sai Supriya and Rudraraju, Srinivasa Raju and Bonthu, Sridevi (2025) Evaluating Open-Source Vision Language Models for Facial Emotion Recognition Against Traditional Deep Learning Models. EAI Endorsed Transactions on AI and Robotics.
79233.pdf
Download (1MB)
Abstract
Facial Emotion Recognition (FER) is crucial for applications such as human-computer interaction and mental health diagnostics. This study presents the first empirical comparison of open-source Vision-Language Models (VLMs), including Phi-3.5 Vision and CLIP, against traditional deep learning models—
| Item Type: | Article |
|---|---|
| Date Deposited: | 04 Mar 2026 20:14 |
| Last Modified: | 10 Apr 2026 18:17 |
| URI: | http://eprints.eai.eu/id/eprint/59887 |
