Making sense of multiple distance matrices through common and distinct components

Solberg, Lars Erik; Dahl, Tobias Gulden; Næs, Tormod

dc.contributor.author	Solberg, Lars Erik
dc.contributor.author	Dahl, Tobias Gulden
dc.contributor.author	Næs, Tormod
dc.date.accessioned	2022-04-29T12:46:46Z
dc.date.available	2022-04-29T12:46:46Z
dc.date.created	2021-11-17T13:53:44Z
dc.date.issued	2021
dc.identifier.citation	Journal of Chemometrics. 2021, 35 (11), e3372.	en_US
dc.identifier.issn	0886-9383
dc.identifier.uri	https://hdl.handle.net/11250/2993444
dc.description.abstract	Multiblock analysis attacks the problem of how to combine data from various data sources for purposes such as prediction, classification, clustering, or visual data analysis. A key concept is the distinction between “common” and “distinct” parts, that is, what information repeats itself across the blocks and what is unique to an individual block. The statistical field of multiblock analysis holds many different approaches, which leads to different treatments both of the terms distinct and common themselves and to differences in the numerical results. In this article, we extend the discussion of distinct and common in multiblock analysis to the domain of distance matrices, that is, the situation where data point sets, so-called configurations, are analyzed via relative distances either because configurations are not available directly or because a distance representation is favorable. Situations typical for chemometrics will be highlighted and illustrated in examples. When analyzing different methods, we have focused on three key aspects. First, during the transition from the distance to configuration domains, one needs to consider how multiple distance matrices are treated. Second, when extracting common and distinct parts, one needs to manage a tradeoff between explaining variance and ensuring similarity between subspaces. Third, there is a design choice to be made as to whether the subspace containing the common parts is “shared” between blocks or if separate subspaces are associated with each individual block. The three aspects help to categorize and explain well-known methods in the field. A selection of methods was analyzed and subsequently applied to examples.	en_US
dc.language.iso	eng	en_US
dc.publisher	Wiley	en_US
dc.rights	Navngivelse 4.0 Internasjonal	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/deed.no	*
dc.subject	Common	en_US
dc.subject	Consensus	en_US
dc.subject	Distances	en_US
dc.subject	Distinct	en_US
dc.subject	Multiblock	en_US
dc.subject	Multidimensional scaling	en_US
dc.title	Making sense of multiple distance matrices through common and distinct components	en_US
dc.type	Peer reviewed	en_US
dc.type	Journal article	en_US
dc.description.version	publishedVersion	en_US
dc.rights.holder	© 2021 The Authors. Journal of Chemometrics published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.	en_US
dc.source.volume	35	en_US
dc.source.journal	Journal of Chemometrics	en_US
dc.source.issue	11	en_US
dc.identifier.doi	10.1002/cem.3372
dc.identifier.cristin	1955569
dc.relation.project	Nofima AS: 201702	en_US
dc.relation.project	Norges forskningsråd: 262308	en_US
dc.source.articlenumber	e3372	en_US
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	1

Files in this item

Name:: Journal+of+Chemometrics+-+2021 ...
Size:: 8.398Mb
Format:: PDF
Description:: Article

View/Open

This item appears in the following Collection(s)

Publikasjoner fra CRIStin - SINTEF AS [5673]
SINTEF Digital [2415]

Show simple item record

Except where otherwise noted, this item's license is described as Navngivelse 4.0 Internasjonal