A comparison of small and large (multi)modal language models for sentiment analysis 😄 😨

The purpose of this paper is to provide guidelines for implementing a multimodal model that includes textual and audio features. Specifically, our focus is on the differences between small and large language models: we compare them in terms of performances for a Sentiment Analysis task (Emotion Recognition on the IEMOCAP dataset). In order to highlight the advantages and disadvantages of each approach and to give a meaningful evidence of the differences between the two types of models, we implement and compare the scores among the single modality models (audio or text) and a bimodal model that integrates the best one for each modality, finally analyzing the effectiveness of classic fusion methods.

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
.gitattributes		.gitattributes
MultiModal_Model_Pipeline.ipynb		MultiModal_Model_Pipeline.ipynb
Result Visualization.ipynb		Result Visualization.ipynb
readme.md		readme.md
report.pdf		report.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A comparison of small and large (multi)modal language models for sentiment analysis 😄 😨

About

Contributors 4

Languages

eskinderit/A-comparison-of-small-and-large-uni-multi-modal-language-models-for-sentiment-analysis-

Folders and files

Latest commit

History

Repository files navigation

A comparison of small and large (multi)modal language models for sentiment analysis 😄 😨

About

Topics

Resources

Stars

Watchers

Forks

Contributors 4

Languages