A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods

Opinion and sentiment analysis is a vital task to characterize subjective information in social media posts. In this paper, we present a comprehensive experimental evaluation and comparison with six state-of-the-art methods, from which we have re-implemented one of them. In addition, we investigate different textual and visual feature embeddings that cover different aspects of the content, as well as the recently introduced multimodal CLIP embeddings. Experimental results are presented for two different publicly available benchmark datasets of tweets and corresponding images. In contrast to the evaluation methodology of previous work, we introduce a reproducible and fair evaluation scheme to make results comparable. Finally, we conduct an error analysis to outline the limitations of the methods and possibilities for the future work.

Keywords

Multimodal Sentiment Analysis, Information Retrieval, Social Media, Computer Vision, Natural Language Processing, Transformer Models

Conference

Workshop on Multi-ModalPre-Training for Multimedia Understanding (MMPT 2021), November 16 - 19, 2021, Taipei Taiwan

Publication Type

ConferenceObject

Version

acceptedVersion

URI

https://oa.tib.eu/renate/handle/123456789/7789
https://doi.org/10.34657/6836

Collections

Informationswissenschaften

License

CC BY 4.0 Unported

https://creativecommons.org/licenses/by/4.0/

Full item page

A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods

Files

Date

Authors

Editor

Advisor

Volume

Issue

Journal

Series Titel

Book Title

Publisher

Supplementary Material

Other Versions

Link to publishers' Version

Abstract

Description