Similarity measures for multidimensional data

How similar are two data-cubes? In other words, the question under consideration is: given two sets of points in a multidimensional hierarchical space, what is the distance value between them? In this paper we explore various distance functions that can be used over multidimensional hierarchical spa...

Full description

Saved in:
Bibliographic Details
Published in:2011 IEEE 27th International Conference on Data Engineering pp. 171 - 182
Main Authors: Baikousi, E, Rogkakos, G, Vassiliadis, P
Format: Conference Proceeding
Language:English
Published: IEEE 01-04-2011
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract How similar are two data-cubes? In other words, the question under consideration is: given two sets of points in a multidimensional hierarchical space, what is the distance value between them? In this paper we explore various distance functions that can be used over multidimensional hierarchical spaces. We organize the discussed functions with respect to the properties of the dimension hierarchies, levels and values. In order to discover which distance functions are more suitable and meaningful to the users, we conducted two user study analysis. The first user study analysis concerns the most preferred distance function between two values of a dimension. The findings of this user study indicate that the functions that seem to fit better the user needs are characterized by the tendency to consider as closest to a point in a multidimensional space, points with the smallest shortest path with respect to the same dimension hierarchy. The second user study aimed in discovering which distance function between two data cubes, is mostly preferred by users. The two functions that drew the attention of users where (a) the summation of distances between every cell of a cube with the most similar cell of another cube and (b) the Hausdorff distance function. Overall, the former function was preferred by users than the latter; however the individual scores of the tests indicate that this advantage is rather narrow.
AbstractList How similar are two data-cubes? In other words, the question under consideration is: given two sets of points in a multidimensional hierarchical space, what is the distance value between them? In this paper we explore various distance functions that can be used over multidimensional hierarchical spaces. We organize the discussed functions with respect to the properties of the dimension hierarchies, levels and values. In order to discover which distance functions are more suitable and meaningful to the users, we conducted two user study analysis. The first user study analysis concerns the most preferred distance function between two values of a dimension. The findings of this user study indicate that the functions that seem to fit better the user needs are characterized by the tendency to consider as closest to a point in a multidimensional space, points with the smallest shortest path with respect to the same dimension hierarchy. The second user study aimed in discovering which distance function between two data cubes, is mostly preferred by users. The two functions that drew the attention of users where (a) the summation of distances between every cell of a cube with the most similar cell of another cube and (b) the Hausdorff distance function. Overall, the former function was preferred by users than the latter; however the individual scores of the tests indicate that this advantage is rather narrow.
Author Baikousi, E
Vassiliadis, P
Rogkakos, G
Author_xml – sequence: 1
  givenname: E
  surname: Baikousi
  fullname: Baikousi, E
  email: ebaikou@cs.uoi.gr
  organization: Dept. of Comput. Sci., Univ. of Ioannina, Ioannina, Greece
– sequence: 2
  givenname: G
  surname: Rogkakos
  fullname: Rogkakos, G
  email: grogkako@cs.uoi.gr
  organization: Dept. of Comput. Sci., Univ. of Ioannina, Ioannina, Greece
– sequence: 3
  givenname: P
  surname: Vassiliadis
  fullname: Vassiliadis, P
  email: pvassil@cs.uoi.gr
  organization: Dept. of Comput. Sci., Univ. of Ioannina, Ioannina, Greece
BookMark eNotj81KxDAURq86gp2xDyBu-gKpuUlvfpZSRx0YcKGCuyFtE4j0R5rOYt5ewfk2Z3HgwLeG1TiNHuAOeYnI7cOuftqWgiOWpJU2yl7AGitRVcYqjpeQCamJcaG-riC32pwdWbWCDLmSTEkjbiBP6Zv_zVaIxDNg73GIvZvjcioG79Jx9qkI01wMx36JXRz8mOI0ur7o3OJu4Tq4Pvn8zA18Pm8_6le2f3vZ1Y97FlHTwnQnDDlbCU1BC7RBBMNb3xKh7QIPMgRyDhuU3vPGyAZJcNMYkgZN61Fu4P6_G733h585Dm4-Hc7P5S_KdElQ
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/ICDE.2011.5767869
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 1424489601
142448958X
9781424489589
9781424489602
EISSN 2375-026X
EndPage 182
ExternalDocumentID 5767869
Genre orig-research
GroupedDBID 6IE
6IH
CBEJK
RIE
RIO
ID FETCH-LOGICAL-i175t-7d285a94275f7219f2f80cec5519df0f3ff5aa1b13ee0b83b15208b853818ce13
IEDL.DBID RIE
ISBN 9781424489596
1424489598
ISSN 1063-6382
IngestDate Wed Jun 26 19:20:01 EDT 2024
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-7d285a94275f7219f2f80cec5519df0f3ff5aa1b13ee0b83b15208b853818ce13
PageCount 12
ParticipantIDs ieee_primary_5767869
PublicationCentury 2000
PublicationDate 2011-April
PublicationDateYYYYMMDD 2011-04-01
PublicationDate_xml – month: 04
  year: 2011
  text: 2011-April
PublicationDecade 2010
PublicationTitle 2011 IEEE 27th International Conference on Data Engineering
PublicationTitleAbbrev ICDE
PublicationYear 2011
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000941150
ssj0000669055
Score 1.8992841
Snippet How similar are two data-cubes? In other words, the question under consideration is: given two sets of points in a multidimensional hierarchical space, what is...
SourceID ieee
SourceType Publisher
StartPage 171
SubjectTerms Cities and towns
Computer science
Europe
Lattices
Taxonomy
USA Councils
Weight measurement
Title Similarity measures for multidimensional data
URI https://ieeexplore.ieee.org/document/5767869
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED7RTkwFWsRbHhgxOLYT23MfKgtCKkhsVRzbUpHaIkr_P-fYLUJiYUuyxI5jf_fd4zuAW8YtsmTBqXe8pNIESY2oHG48VdsglXZNjOhOZ-rpTY_GUSbnbl8L471vk8_8fbxsY_lu3WyjqwzJOx6tlelARxmdarX2_hSETsNyjeV7ypiLxk4b7KwExd-M7-q6tCmN3sk95fsqRzwLZh4eh6NxEvfML_zVeaUFnknvf0M-gsFPBR953mPTMRz41Qn0di0cSN7RfaCzxXKB5BZtcbJM7sINQTuWtImGLkr_J9kOElNJB_A6Gb8MpzR3UKALNAu-qHJcl7WRXJUBqZ4JPGjW-AbNJOMCCyKEsq4LWwjvmdXCIpozbRHCEccbX4hT6K7WK38GpLFBBe2FYkUtZZA1j1wLP7HxrpQunEM_zn7-kUQy5nniF38_voTD5JyNKTBX0P363Ppr6Gzc9qZd1m9KRpou
link.rule.ids 310,311,782,786,791,792,798,27935,54769
linkProvider IEEE
linkToHtml http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED7RMsBUoEW88cCIwbGd2J77UCtKhdQisVVJbEtF6kO0_f-ck7QIiYUtyRI7jv3dd4_vAB4Yz5AlC06d5TGVxktqRGJx46k081Jpm4eIbn-sRh-60w0yOY_7WhjnXJF85p7CZRHLt8t8G1xlSN7xaE1MDQ5jqRJTVmvtPSoInoZVVZafZc5cMHeKcGciKP5ofFfZpU1s9E7wqbpPqphnxMzzoN3plvKe1St_9V4poKfX-N-gT6D1U8NH3vbodAoHbnEGjV0TB1Lt6SbQ8Ww-Q3qL1jiZlw7DNUFLlhSphjaI_5fCHSQkk7bgvdedtPu06qFAZ2gYbKiyXMepkVzFHsme8dxrlrscDSVjPfPC-zhNoywSzrFMiwzxnOkMQRyRPHeROIf6YrlwF0DyzCuvnVAsSqX0MuWBbeEnNs7G0vpLaIbZT1elTMa0mvjV34_v4ag_eR1Oh4PRyzUcl67akBBzA_XN19bdQm1tt3fFEn8Dp5udgQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2011+IEEE+27th+International+Conference+on+Data+Engineering&rft.atitle=Similarity+measures+for+multidimensional+data&rft.au=Baikousi%2C+E&rft.au=Rogkakos%2C+G&rft.au=Vassiliadis%2C+P&rft.date=2011-04-01&rft.pub=IEEE&rft.isbn=9781424489596&rft.issn=1063-6382&rft.eissn=2375-026X&rft.spage=171&rft.epage=182&rft_id=info:doi/10.1109%2FICDE.2011.5767869&rft.externalDocID=5767869
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6382&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6382&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6382&client=summon