Understanding image-text relations and news values for multimodal news analysis

Cheema, Gullal S.; Hakimov, Sherzod; Müller-Budack, Eric; Otto, Christian; Bateman, John A.; Ewerth, Ralph

doi:https://doi.org/10.34657/13526

Understanding image-text relations and news values for multimodal news analysis

dc.bibliographicCitation.articleNumber	1125533
dc.bibliographicCitation.journalTitle	Frontiers in Artificial Intelligence	eng
dc.bibliographicCitation.volume	6
dc.contributor.author	Cheema, Gullal S.
dc.contributor.author	Hakimov, Sherzod
dc.contributor.author	Müller-Budack, Eric
dc.contributor.author	Otto, Christian
dc.contributor.author	Bateman, John A.
dc.contributor.author	Ewerth, Ralph
dc.date.accessioned	2024-03-06T07:21:41Z
dc.date.available	2024-03-06T07:21:41Z
dc.date.issued	2023
dc.description.abstract	The analysis of news dissemination is of utmost importance since the credibility of information and the identification of disinformation and misinformation affect society as a whole. Given the large amounts of news data published daily on the Web, the empirical analysis of news with regard to research questions and the detection of problematic news content on the Web require computational methods that work at scale. Today's online news are typically disseminated in a multimodal form, including various presentation modalities such as text, image, audio, and video. Recent developments in multimodal machine learning now make it possible to capture basic “descriptive” relations between modalities–such as correspondences between words and phrases, on the one hand, and corresponding visual depictions of the verbally expressed information on the other. Although such advances have enabled tremendous progress in tasks like image captioning, text-to-image generation and visual question answering, in domains such as news dissemination, there is a need to go further. In this paper, we introduce a novel framework for the computational analysis of multimodal news. We motivate a set of more complex image-text relations as well as multimodal news values based on real examples of news reports and consider their realization by computational approaches. To this end, we provide (a) an overview of existing literature from semiotics where detailed proposals have been made for taxonomies covering diverse image-text relations generalisable to any domain; (b) an overview of computational work that derives models of image-text relations from data; and (c) an overview of a particular class of news-centric attributes developed in journalism studies called news values. The result is a novel framework for multimodal news analysis that closes existing gaps in previous work while maintaining and combining the strengths of those accounts. We assess and discuss the elements of the framework with real-world examples and use cases, setting out research directions at the intersection of multimodal learning, multimodal analytics and computational social sciences that can benefit from our approach.	eng
dc.description.fonds	TIB_Fonds
dc.description.version	publishedVersion
dc.identifier.uri	https://oa.tib.eu/renate/handle/123456789/14495
dc.identifier.uri	https://doi.org/10.34657/13526
dc.language.iso	eng
dc.publisher	Lausanne : Frontiers Media
dc.relation.doi	https://doi.org/10.3389/frai.2023.1125533
dc.relation.essn	2624-8212
dc.rights.license	CC BY 4.0 Unported
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject.ddc	004
dc.subject.other	computational analytics	eng
dc.subject.other	image-text relations	eng
dc.subject.other	journalism	eng
dc.subject.other	machine learning	eng
dc.subject.other	multimodality	eng
dc.subject.other	news analysis	eng
dc.subject.other	news values	eng
dc.subject.other	semiotics	eng
dc.title	Understanding image-text relations and news values for multimodal news analysis	eng
dc.type	Article	eng
dc.type	Text	eng
tib.accessRights	openAccess
wgl.contributor	TIB
wgl.subject	Informatik
wgl.type	Zeitschriftenartikel

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 5frai-06-1125533.pdf
Size:: 9.73 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Informationswissenschaften