Trend of the SQC Based on Data Clustering-Studies of Categorical Data Analysis Based on Principal Points for Multivariate Binary Distribution-

DOI Web Site Open Access

Bibliographic Information

Other Title
  • データのクラスタリングにもとづくSQCの動向~2値型主要点解析法を用いたカテゴリカルデータの解析に関する研究~
  • データ ノ クラスタリング ニ モトズク SQC ノ ドウコウ : 2チガタ シュヨウテン カイセキホウ オ モチイタ カテゴリカルデータ ノ カイセキ ニ カンスル ケンキュウ

Search this article

Abstract

In statistics,there are many studies on principal points.The concept of principal points which is proposed by Flury allows us to carry out such an analysis in a variety of applications and also properties of principal points have been studied. Although principal points of a multivariate distribution have widely studied,there is no discussion of principal points for a multivariate binary distribution.<BR>  Yamashita and Suzuki have define the principal points for a multivariate binary distribution. Since principal points for a multivariate binary distribution are selected from multivariate binary region,there is a problem of the amount of calculation,since this problem is an NP-hard problem. Yamashita and Suzuki have shown the submodularity of principal points for a multivariate binary distribution and proposed an approximation method based on the greedy algorithm.Using the property of submodularity of principal points for a multivariate binary distribution,the accuracy of approximations is at least(1-1/e)times the optimal solution proved by Nemhauser et al.Finally, we show the result of an application of the methods to questionnaire survey data.

Journal

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top