with applications in r - university of washington · application of modern model-based clustering...

1
For more information, and to order, visit: www.cambridge.org/9781108494205 and enter the code MCCDS2019 at the checkout Model-Based Clustering and Classification for Data Science With Applications in R Cluster analysis finds groups in data automatically. Most methods have been heuristic and leave open such central questions as: how many clusters are there? Which method should I use? How should I handle outliers? Classification assigns new observations to groups given previously classified observations, and also has open questions about parameter tuning, robustness and uncertainty assessment. This book frames cluster analysis and classification in terms of statistical models, thus yielding principled estimation, testing and prediction methods, and sound answers to the central questions. It builds the basic ideas in an accessible but rigorous way, with extensive data examples and R code; describes modern approaches to high-dimensional data and networks; and explains such recent advances as Bayesian regularization, non-Gaussian model-based clustering, cluster merging, variable selection, semi-supervised and robust classification, clustering of functional data, text and images, and co-clustering. Written for advanced undergraduates in data science, as well as researchers and practitioners, it assumes basic knowledge of multivariate calculus, linear algebra, probability and statistics. on this title 31 May 2020 Expires Université Côte d’Azur Charles Bouveyron Inria Saclay Île-de-France Gilles Celeux University College Dublin T. Brendan Murphy University of Washington Adrian E. Raftery 1. Introduction; 2. Model-based clustering: basic ideas; 3. Dealing with difficulties; 4. Model- based classification; 5. Semi-supervised clustering and classification; 6. Discrete data clustering; 7. Variable selection; 8. High-dimensional data; 9. Non-Gaussian model-based clustering; 10. Network data; 11. Model-based clustering with covariates; 12. Other topics; List of R packages; Bibliography; Index. 20% Discount £59.99 £47.99 Discount price Original price $79.99 $63.99 Hardback 978-1-108-49420-5 'Bouveyron, Celeux, Murphy, and Raftery pioneered the theory, computation, and application of modern model-based clustering and discriminant analysis. Here they have produced an exhaustive yet accessible text, covering both the field's state of the art as well as its intellectual development. The authors develop a unified vision of cluster analysis, rooted in the theory and computation of mixture models. Embedded R code points the way for applied readers, while graphical displays develop intuition about both model construction and the critical but often-neglected estimation process. Building on a series of running examples, the authors gradually and methodically extend their core insights into a variety of exciting data structures, including networks and functional data. This text will serve as a backbone for graduate study as well as an important reference for applied data scientists interested in working with cutting-edge tools in semi- and unsupervised machine learning.' John S. Ahlquist, University of California, San Diego August 2019 253 x 177 mm c.446pp 40 b/w illus. 171 colour illus. 48 tables

Upload: others

Post on 07-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: With Applications in R - University of Washington · application of modern model-based clustering and discriminant analysis. Here they have produced an exhaustive yet accessible text,

For more information, and to order, visit: www.cambridge.org/9781108494205 and enter the code MCCDS2019 at the checkout

Model-Based Clustering and Classification for Data Science With Applications in R

Cluster analysis finds groups in data automatically. Most methods have been heuristic and leave open such central questions as: how many clusters are there? Which method should I use? How should I handle outliers? Classification assigns new observations to groups given previously classified observations, and also has open questions about parameter tuning, robustness and uncertainty assessment. This book frames cluster analysis and classification in terms of statistical models, thus yielding principled estimation, testing and prediction methods, and sound answers to the central questions. It builds the basic ideas in an accessible but rigorous way, with extensive data examples and R code; describes modern approaches to high-dimensional data and networks; and explains such recent advances as Bayesian regularization, non-Gaussian model-based clustering, cluster merging, variable selection, semi-supervised and robust classification, clustering of functional data, text and images, and co-clustering. Written for advanced undergraduates in data science, as well as researchers and practitioners, it assumes basic knowledge of multivariate calculus, linear algebra, probability and statistics.

on this title31 May 2020Expires

Université Côte d’AzurCharles Bouveyron

Inria Saclay Île-de-FranceGilles Celeux

University College DublinT. Brendan Murphy

University of WashingtonAdrian E. Raftery

1. Introduction; 2. Model-based clustering: basic ideas; 3. Dealing with difficulties; 4. Model-based classification; 5. Semi-supervised clustering and classification; 6. Discrete dataclustering; 7. Variable selection; 8. High-dimensional data; 9. Non-Gaussian model-basedclustering; 10. Network data; 11. Model-based clustering with covariates; 12. Other topics;List of R packages; Bibliography; Index.

20% Discount

£59.99 £47.99Discount priceOriginal price

$79.99 $63.99

Hardback 978-1-108-49420-5

'Bouveyron, Celeux, Murphy, and Raftery pioneered the theory, computation, and application of modern model-based clustering and discriminant analysis. Here they have produced an exhaustive yet accessible text, covering both the field's state of the art as well as its intellectual development. The authors develop a unified vision of cluster analysis, rooted in the theory and computation of mixture models. Embedded R code points the way for applied readers, while graphical displays develop intuition about both model construction and the critical but often-neglected estimation process. Building on a series of running examples, the authors gradually and methodically extend their core insights into a variety of exciting data structures, including networks and functional data. This text will serve as a backbone for graduate study as well as an important reference for applied data scientists interested in working with cutting-edge tools in semi- and unsupervised machine learning.' John S. Ahlquist, University of California, San Diego

August 2019253 x 177 mm c.446pp 40 b/w illus. 171colour illus. 48 tables