Clustering constrained symbolic data

Dealing with multi-valued data has become quite common in both the framework of databases as well as data analysis. Such data can be constrained by domain knowledge provided by relations between the variables and these relations are expressed by rules. However, such knowledge can introduce a combina...

Full description

Saved in:
Bibliographic Details
Published in:Pattern recognition letters Vol. 30; no. 11; pp. 1037 - 1045
Main Authors: de Carvalho, Francisco de A.T., Csernel, Marc, Lechevallier, Yves
Format: Journal Article
Language:English
Published: Amsterdam Elsevier B.V 01-08-2009
Elsevier
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Dealing with multi-valued data has become quite common in both the framework of databases as well as data analysis. Such data can be constrained by domain knowledge provided by relations between the variables and these relations are expressed by rules. However, such knowledge can introduce a combinatorial increase in the computation time depending on the number of rules. In this paper, we present a way to cluster such data in polynomial time. The method is based on the following: a decomposition of the data according to the rules, a suitable dissimilarity function and a clustering algorithm based on dissimilarities.
ISSN:0167-8655
1872-7344
DOI:10.1016/j.patrec.2009.04.009