Ultrametric embedding: application to data fingerprinting and to fast data clustering

Fionn Murtagh

Research output: Chapter in Book/Report/Conference proceedingChapter

97 Downloads (Pure)

Abstract

We begin with pervasive ultrametricity due to high dimensionality and/or spatial sparsity. How extent or degree of ultrametricity can be quantified leads us to the discussion of varied practical cases when ultrametricity can be partially or locally present in data. We show how the ultrametricity can be assessed in text or document collections, and in time series signals. An aspect of importance here is that to draw benefit from this perspective the data may need to be recoded. Such data recoding can also be powerful in proximity searching, as we will show, where the data is embedded globally and not locally in an ultrametric space.
Original languageEnglish
Title of host publicationData Mining and Mathematical Programming
Subtitle of host publicationCRM Proceedings & Lecture Notes Vol. 45
EditorsPanos Pardalos, Pierre Hansen
PublisherAmerican Mathematical Society
Pages199-209
ISBN (Print)10: 0-8218-4485-7, 13: 978-0-8218-4485-4
Publication statusPublished - 2008

Publication series

NameCRM Proceedings & Lecture Notes

Keywords

  • math.ST
  • stat.TH
  • 62H30; 68P30; 68P20

Cite this