Clustering in massive data sets. / Murtagh, Fionn; Abello, J (Editor); Pardalos, P M (Editor); Resende, M G C (Editor).

Handbook of Massive Data Sets. Norwell, MA, USA : Kluwer, 2002. p. 401-545.

Research output: Chapter in Book/Report/Conference proceedingChapter

  • Fionn Murtagh
  • J Abello (Editor)
  • P M Pardalos (Editor)
  • M G C Resende (Editor)


We review the time and storage costs of search and clustering algorithms. We exemplify these, based on case-studies in astronomy, information retrieval, visual user interfaces, chemical databases, and other areas. Theoretical results developed as far back as the 1960s still very often remain topical. More recent work is also covered in this article. This includes a solution for the statistical question of how many clusters there are in a dataset. We also look at one line of inquiry in the use of clustering for human-computer user interfaces. Finally, the visualization of data leads to the consideration of data arrays as images, and we speculate on future results to be expected here.
Original languageEnglish
Title of host publicationHandbook of Massive Data Sets
Place of PublicationNorwell, MA, USA
ISBN (Print)1 4020 0489 3
Publication statusPublished - 2002
This open access research output is licenced under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License.

ID: 890777