Discriminative multimodal learning via conditional priors in generative models

Andrade Mancisidor, Rogelio; Kampffmeyer, Michael Christian; Aas, Kjersti; Jenssen, Robert

dc.contributor.author	Andrade Mancisidor, Rogelio
dc.contributor.author	Kampffmeyer, Michael Christian
dc.contributor.author	Aas, Kjersti
dc.contributor.author	Jenssen, Robert
dc.date.accessioned	2024-01-05T06:46:50Z
dc.date.available	2024-01-05T06:46:50Z
dc.date.created	2023-12-15T13:34:07Z
dc.date.issued	2023
dc.identifier.citation	Neural Networks. 2023, 169 .	en_US
dc.identifier.issn	0893-6080
dc.identifier.uri	https://hdl.handle.net/11250/3109965
dc.description.abstract	Deep generative models with latent variables have been used lately to learn joint representations and generative processes from multi-modal data, which depict an object from different viewpoints. These two learning mechanisms can, however, conflict with each other and representations can fail to embed information on the data modalities. This research studies the realistic scenario in which all modalities and class labels are available for model training, e.g. images or handwriting, but where some modalities and labels required for downstream tasks are missing, e.g. text or annotations. We show, in this scenario, that the variational lower bound limits mutual information between joint representations and missing modalities. We, to counteract these problems, introduce a novel conditional multi-modal discriminative model that uses an informative prior distribution and optimizes a likelihood-free objective function that maximizes mutual information between joint representations and missing modalities. Extensive experimentation demonstrates the benefits of our proposed model, empirical results show that our model achieves state-of-the-art results in representative problems such as downstream classification, acoustic inversion, and image and annotation generation.
dc.language.iso	eng	en_US
dc.title	Discriminative multimodal learning via conditional priors in generative models	en_US
dc.title.alternative	Discriminative multimodal learning via conditional priors in generative models	en_US
dc.type	Journal article	en_US
dc.type	Peer reviewed	en_US
dc.description.version	publishedVersion
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	2
dc.identifier.doi	10.1016/j.neunet.2023.10.048
dc.identifier.cristin	2214165
dc.source.journal	Neural Networks	en_US
dc.source.volume	169	en_US
dc.source.pagenumber	14	en_US

Tilhørende fil(er)

Filnavn:: 1-s2.0-S089360802300610X-main.pdf
Størrelse:: 2.431Mb
Format:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Publikasjoner fra Cristin [284]
Vitenskapelige tidsskriftartikler og konferanseartikler med fagfellevurdering (NVI-kategori) [215]
Vitenskapelige tidsskriftartikler og konferanseartikler med fagfellevurdering (NVI-kategori)

Vis enkel innførsel