Ultrametric embedding: application to data fingerprinting and to fast data clustering. / Murtagh, Fionn.

Data Mining and Mathematical Programming: CRM Proceedings & Lecture Notes Vol. 45. ed. / Panos Pardalos; Pierre Hansen. American Mathematical Society, 2008. p. 199-209 (CRM Proceedings & Lecture Notes).

Research output: Chapter in Book/Report/Conference proceedingChapter

Published

Documents

  • pdf

    153 KB, PDF document

Abstract

We begin with pervasive ultrametricity due to high dimensionality and/or spatial sparsity. How extent or degree of ultrametricity can be quantified leads us to the discussion of varied practical cases when ultrametricity can be partially or locally present in data. We show how the ultrametricity can be assessed in text or document collections, and in time series signals. An aspect of importance here is that to draw benefit from this perspective the data may need to be recoded. Such data recoding can also be powerful in proximity searching, as we will show, where the data is embedded globally and not locally in an ultrametric space.
Original languageEnglish
Title of host publicationData Mining and Mathematical Programming
Subtitle of host publicationCRM Proceedings & Lecture Notes Vol. 45
EditorsPanos Pardalos, Pierre Hansen
PublisherAmerican Mathematical Society
Pages199-209
ISBN (Print)10: 0-8218-4485-7, 13: 978-0-8218-4485-4
Publication statusPublished - 2008

Publication series

NameCRM Proceedings & Lecture Notes
This open access research output is licenced under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License.

ID: 1053752