この論文を読む/探す
抄録
Independent observations X_1, X_2,…, X_n are made on a distribution F on R^d. To devide these observations into k clusters, first choose a vector of optimal cluster centers b_n=(b_<n1>, b_<n2>, …, b_<nk>) to minimize [numerical formula] as a function of a=(a_1, a_2, …, a_k), then assign each observation to its nearest cluster center. Each b_<nj> is the mean of observations in its cluster. Pollard (1982) obtained a central limit theorem for the means of the k-clusters. In this paper, it is shown that the bootstrap distribution of the centered b_n has the same limiting distribution ; the argument rests on asymptotic behavior of empirical processes on Vapnik-Chervonenkis classes in triangular array setting. Advantages of the bootstrap methods are discussed and the performance of bootstrap confidence sets is compared with Pollard's confidence sets by Monte Carlo simulation.