Grouping Multidimensional Data: Recent Advances in Clustering

Front Cover
Taylor & Francis, 2006 M02 10 - 268 pages

Clustering is one of the most fundamental and essential data analysis techniques. Clustering can be used as an independent data mining task to discern intrinsic characteristics of data, or as a preprocessing step with the clustering results then used for classification, correlation analysis, or anomaly detection.

Kogan and his co-editors have put together recent advances in clustering large and high-dimension data. Their volume addresses new topics and methods which are central to modern data analysis, with particular emphasis on linear algebra tools, opimization methods and statistical techniques. The contributions, written by leading researchers from both academia and industry, cover theoretical basics as well as application and evaluation of algorithms, and thus provide an excellent state-of-the-art overview.

The level of detail, the breadth of coverage, and the comprehensive bibliography make this book a perfect fit for researchers and graduate students in data mining and in many other important related application areas.

From inside the book

Contents

The Star Clustering Algorithm for Information Organization
1
A Survey of Clustering Data Mining Techniques
25
A Comparative Study
73
Clustering Very Large Data Sets
97
Clustering with EntropyLike kMeans Algorithms
127
Sampling Methods for Building Initial Partitions
161
A MATLAB Toolbox for Generating TermDocument
187
Criterion Functions for Clustering on HighDimensional Data
211
References
239
Index
265
Copyright

Other editions - View all

Common terms and phrases

Popular passages

Page 262 - In Proceedings of the 37th Annual Allerton Conference on Communication, Control and Computing, pages 368-377, 1999.
Page 260 - A. Silberschatz and A. Tuzhilin. What makes patterns interesting in Knowledge discovery systems.
Page 253 - Concept Indexing: A Fast Dimensionality Reduction Algorithm with Applications to Document Retrieval & Categorization.

Bibliographic information