Vis enkel innførsel

dc.contributor.authorAndrade Mancisidor, Rogelio
dc.contributor.authorKampffmeyer, Michael Christian
dc.contributor.authorAas, Kjersti
dc.contributor.authorJenssen, Robert
dc.date.accessioned2024-01-05T06:46:50Z
dc.date.available2024-01-05T06:46:50Z
dc.date.created2023-12-15T13:34:07Z
dc.date.issued2023
dc.identifier.citationNeural Networks. 2023, 169 .en_US
dc.identifier.issn0893-6080
dc.identifier.urihttps://hdl.handle.net/11250/3109965
dc.description.abstractDeep generative models with latent variables have been used lately to learn joint representations and generative processes from multi-modal data, which depict an object from different viewpoints. These two learning mechanisms can, however, conflict with each other and representations can fail to embed information on the data modalities. This research studies the realistic scenario in which all modalities and class labels are available for model training, e.g. images or handwriting, but where some modalities and labels required for downstream tasks are missing, e.g. text or annotations. We show, in this scenario, that the variational lower bound limits mutual information between joint representations and missing modalities. We, to counteract these problems, introduce a novel conditional multi-modal discriminative model that uses an informative prior distribution and optimizes a likelihood-free objective function that maximizes mutual information between joint representations and missing modalities. Extensive experimentation demonstrates the benefits of our proposed model, empirical results show that our model achieves state-of-the-art results in representative problems such as downstream classification, acoustic inversion, and image and annotation generation.
dc.language.isoengen_US
dc.titleDiscriminative multimodal learning via conditional priors in generative modelsen_US
dc.title.alternativeDiscriminative multimodal learning via conditional priors in generative modelsen_US
dc.typeJournal articleen_US
dc.typePeer revieweden_US
dc.description.versionpublishedVersion
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2
dc.identifier.doi10.1016/j.neunet.2023.10.048
dc.identifier.cristin2214165
dc.source.journalNeural Networksen_US
dc.source.volume169en_US
dc.source.pagenumber14en_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel