Vis enkel innførsel

dc.contributor.authorLandsverk, Marius
dc.contributor.authorRiemer-Sørensen, Signe
dc.date.accessioned2023-03-15T13:27:43Z
dc.date.available2023-03-15T13:27:43Z
dc.date.created2023-03-09T20:39:31Z
dc.date.issued2022
dc.identifier.citationProceedings of the Northern Lights Deep Learning Workshop. 2022, 3.en_US
dc.identifier.urihttps://hdl.handle.net/11250/3058478
dc.description.abstractMeasuring model performance is a key issue for deep learning practitioners. However, we often lack the ability to explain why a specific architecture attains superior predictive accuracy for a given data set. Often, validation accuracy is used as a performance heuristic quantifying how well a network generalize to unseen data. Mutual information can be used as a measure of the quality of internal representations in deep learning models, and the information plane provide insights into whether the model exploits the available information in data. The information plane has previously been explored for fully connected neural networks and convolutional architectures. We present an architecture-agnostic method for tracking a network's internal representations during training, which are then used to create the mutual information plane. The method is exemplified for a graph convolutional neural network fitted on the Cora citation data. We compare how the inductive bias introduced in the graph convolutional architecture changes the mutual information plane relative to a fully connected neural network.en_US
dc.language.isoengen_US
dc.publisherSeptentrio Academic Publishingen_US
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleMutual information estimation for graph convolutional neural networksen_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.rights.holder© The author(s).en_US
dc.source.volume3en_US
dc.source.journalProceedings of the Northern Lights Deep Learning Workshopen_US
dc.identifier.doi10.7557/18.6257
dc.identifier.doi10.7557/18.6257
dc.identifier.cristin2132882
dc.relation.projectNorges forskningsråd: 294544en_US
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal