Multimodal news analytics using measures of cross-modal entity and context consistency

dc.bibliographicCitation.firstPage111eng
dc.bibliographicCitation.lastPage125eng
dc.bibliographicCitation.volume10eng
dc.contributor.authorMüller-Budack, Eric
dc.contributor.authorTheiner, Jonas
dc.contributor.authorDiering, Sebastian
dc.contributor.authorIdahl, Maximilian
dc.contributor.authorHakimov, Sherzod
dc.contributor.authorEwerth, Ralph
dc.date.accessioned2021-12-22T08:25:46Z
dc.date.available2021-12-22T08:25:46Z
dc.date.issued2021
dc.description.abstractThe World Wide Web has become a popular source to gather information and news. Multimodal information, e.g., supplement text with photographs, is typically used to convey the news more effectively or to attract attention. The photographs can be decorative, depict additional details, but might also contain misleading information. The quantification of the cross-modal consistency of entity representations can assist human assessors’ evaluation of the overall multimodal message. In some cases such measures might give hints to detect fake news, which is an increasingly important topic in today’s society. In this paper, we present a multimodal approach to quantify the entity coherence between image and text in real-world news. Named entity linking is applied to extract persons, locations, and events from news texts. Several measures are suggested to calculate the cross-modal similarity of the entities in text and photograph by exploiting state-of-the-art computer vision approaches. In contrast to previous work, our system automatically acquires example data from the Web and is applicable to real-world news. Moreover, an approach that quantifies contextual image-text relations is introduced. The feasibility is demonstrated on two datasets that cover different languages, topics, and domains.eng
dc.description.versionpublishedVersioneng
dc.identifier.urihttps://oa.tib.eu/renate/handle/123456789/7782
dc.identifier.urihttps://doi.org/10.34657/6829
dc.language.isoengeng
dc.publisherLondon : Springereng
dc.relation.doihttps://doi.org/10.1007/s13735-021-00207-4
dc.relation.essn2192-662X
dc.relation.ispartofseriesInternational journal of multimedia information retrieval 10 (2021)eng
dc.relation.issn2192-6611
dc.rights.licenseCC BY 4.0 Unportedeng
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/eng
dc.subjectCross-modal consistencyeng
dc.subjectNews analyticseng
dc.subjectImage-text relationseng
dc.subjectImage repurposing detectioneng
dc.subject.ddc004eng
dc.subject.ddc660eng
dc.subject.ddc020eng
dc.titleMultimodal news analytics using measures of cross-modal entity and context consistencyeng
dc.typearticleeng
dc.typeTexteng
dcterms.bibliographicCitation.journalTitleInternational Journal of Multimedia Information Retrievaleng
tib.accessRightsopenAccesseng
wgl.contributorTIBeng
wgl.subjectInformatikeng
wgl.typeZeitschriftenartikeleng
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Müller-Budack2021_Article_MultimodalNewsAnalyticsUsingMe.pdf
Size:
2.2 MB
Format:
Adobe Portable Document Format
Description: