The following topics will be covered:
• Introduction into standard clustering techniques with a focus on k-means clustering
• Feature selection
• Cluster validation
• Visualization techniques for clustering results such as transformation or perturbation based approaches
• Brief introduction into the imputation of missing values in pre-processing or during clustering
• Clustering of mixed data types (numerical and categorical features)
The theoretical explanations will be accompanied by a practical example in R on a public data set showcasing a typical insurance application.
Registration deadline: 29 April 2021
Anmeldungen sind für diese Veranstaltung geschlossen.