Warning: pg_query(): Query failed: ERROR: missing chunk number 0 for toast value 29512337 in pg_toast_2619 in /dati/webiit-old/includes/database.pgsql.inc on line 138 Warning: ERROR: missing chunk number 0 for toast value 29512337 in pg_toast_2619 query: SELECT data, created, headers, expire, serialized FROM cache_page WHERE cid = 'https://www-old.iit.cnr.it/node/59368' in /dati/webiit-old/includes/database.pgsql.inc on line 159 Warning: pg_query(): Query failed: ERROR: missing chunk number 0 for toast value 29512337 in pg_toast_2619 in /dati/webiit-old/includes/database.pgsql.inc on line 138 Warning: ERROR: missing chunk number 0 for toast value 29512337 in pg_toast_2619 query: SELECT data, created, headers, expire, serialized FROM cache_page WHERE cid = 'https://www-old.iit.cnr.it/node/59368' in /dati/webiit-old/includes/database.pgsql.inc on line 159 Construction of the similarity matrix for the spectral clustering method: Numerical experiments | IIT - CNR - Istituto di Informatica e Telematica
IIT Home Page CNR Home Page

Construction of the similarity matrix for the spectral clustering method: Numerical experiments

Spectral clustering is a powerful method for finding structure in a dataset through the eigenvectors of a similarity matrix. It often outperforms traditional clustering algorithms such as k-means when the structure of the individual clusters is highly non-convex. Its accuracy depends on how the similarity between pairs of data points is defined. Two important items contribute to the construction of the similarity matrix: the sparsity of the underlying weighted graph, which depends mainly on the distances among datapoints, and the similarity function. When a Gaussian similarity function is used, the choice of the scale parameter σ can be critical. In this paper we examine both items, the sparsity and the selection of suitable σ’s, based either directly on the graph associated to the dataset or on the minimal spanning tree (MST) of the graph. An extensive numerical experimentation on artificial and real-world datasets has been carried out to compare the performances of the methods.


Journal of Computational and Applied Mathematics, 2020

Autori esterni: Grazia Lotti (Dip. di Matematica, Universita' di Parma), Ornella Menchi (Dip. di Informatica, Universita' di Pisa), Francesco Romani (Dip. di Informatica, Universita' di Pisa)
Autori IIT:

Tipo: Contributo in rivista ISI
Area di disciplina: Mathematics

File: 1-s2.0-S0377042720300868-JCAM.pdf

Attività: Metodi numerici per problemi di grandi dimensioni