Data clustering is an unsupervised process that divides a set of objects into homogeneous groups. Since the publication of the first edition of this book, development in the area has exploded, especially in clustering algorithms for big data and open-source software for cluster analysis. This second edition reflects these new developments.