Search Results

Now showing 1 - 3 of 3
  • Item
    Assessment of Stability in Partitional Clustering Using Resampling Techniques
    (Karlsruhe : KIT Scientific Publishing, 2016) Mucha, Hans-Joachim
    The assessment of stability in cluster analysis is strongly related to the main difficult problem of determining the number of clusters present in the data. The latter is subject of many investigations and papers considering different resampling techniques as practical tools. In this paper, we consider non-parametric resampling from the empirical distribution of a given dataset in order to investigate the stability of results of partitional clustering. In detail, we investigate here only the very popular K-means method. The estimation of the sampling distribution of the adjusted Rand index (ARI) and the averaged Jaccard index seems to be the most general way to do this. In addition, we compare bootstrapping with different subsampling schemes (i.e., with different cardinality of the drawn samples) with respect to their performance in finding the true number of clusters for both synthetic and real data.
  • Item
    Classification and clustering: models, software and applications
    (Berlin : Weierstraß-Institut für Angewandte Analysis und Stochastik, 2009) Mucha, Hans-Joachim; Ritter, Gunter
    We are pleased to present the report on the 30th Fall Meeting of the working group ``Data Analysis and Numerical Classification'' (AG-DANK) of the German Classification Society. The meeting took place at the Weierstrass Institute for Applied Analysis and Stochastics (WIAS), Berlin, from Friday Nov. 14 till Saturday Nov. 15, 2008. Already 12 years ago, WIAS had hosted a traditional Fall Meeting with special focus on classification and multivariate graphics (Mucha and Bock, 1996). This time, the special topics were stability of clustering and classification, mixture decomposition, visualization, and statistical software.
  • Item
    Big data clustering: Data preprocessing, variable selection, and dimension reduction
    (Berlin : Weierstraß-Institut für Angewandte Analysis und Stochastik, 2017) Mucha, Hans-Joachim
    [no abstract available]