Evaluating Open-Source Vision Language Models for Facial Emotion Recognition Against Traditional Deep Learning Models

Mulukutla, Vamsi Krishna and Pavarala, Sai Supriya and Rudraraju, Srinivasa Raju and Bonthu, Sridevi (2025) Evaluating Open-Source Vision Language Models for Facial Emotion Recognition Against Traditional Deep Learning Models. EAI Endorsed Transactions on AI and Robotics.

[thumbnail of 79233.pdf] PDF
79233.pdf

Download (1MB)

Abstract

Facial Emotion Recognition (FER) is crucial for applications such as human-computer interaction and mental health diagnostics. This study presents the first empirical comparison of open-source Vision-Language Models (VLMs), including Phi-3.5 Vision and CLIP, against traditional deep learning models—

Item Type: Article
Date Deposited: 04 Mar 2026 20:14
Last Modified: 10 Apr 2026 18:17
URI: http://eprints.eai.eu/id/eprint/59887

Actions (login required)

View Item
View Item