A New Approach to Quantifying Cluster Analysis Results

УДК 519.6 + 519.25

  • Sergei V. Dronov Altai State University, Barnaul, Russia Email: dsv@math.asu.ru
  • Yulia A. Dudareva Altai State Medical University, Barnaul, Russia Email: julia.dudareva@mail.ru
  • Svyatoslav Yu. Eskov Altai State University, Barnaul, Russia Email: eskovslava13@gmail.com
Keywords: cluster variable, cluster quantification, post-hoc cluster analysis problem, latent class analysis

Abstract

The paper proposes a unified approach to several variants of solving the problem of quantifying clusters of a pre-built cluster partition of a finite set. Generally speaking, each cluster gets vector labels when any of the mentioned variants are applied. For this purpose, a technique close to the analysis of latent classes is used. The first latent-object group of methods identifies each object with a vector with coordinates equal to the of the observed characteristics of this object. The second latent-indicative group of methods replaces each characteristic of all objects in a given cluster by a vector of its values. After that, the central vector being the closest to all vectors of the bunch, is selected from the bundle of vectors of a given cluster. The latent-object approach declares such a vector the vector label of the cluster. When using the latent-indicative approach, the obtained labels of different clusters have different dimensions. Thus, possible methods to reduce them to the one-dimensional numeric form are provided along with the recommendations for reducing the dimensionality of data-object labels. A numerical example is considered.

Downloads

Download data is not yet available.

Author Biographies

Sergei V. Dronov, Altai State University, Barnaul, Russia

Candidate of Sciences in Physics and Mathematics, Associate Professor, Associate Professor of the Department of Mathematical Analysis

Yulia A. Dudareva, Altai State Medical University, Barnaul, Russia

Doctor of Sciences in Medicine, Professor of the Department of Obstetrics and Gynecology

Svyatoslav Yu. Eskov, Altai State University, Barnaul, Russia

Undergraduate Student of the Institute of Mathematics and Information Technology

References

Samuels J.A. One-Hot Encoding and Two-Hot Encoding: An Introduction. London: Imperial college London. 2024. 5 P. DOI: : 10.13140/RG.2.2.21459.76327

Manning c.D., Raghavan P., Schutze H. Introduction to Information Retrieval. cambridge: University Press. 2009. 581 p. DOI: : 10.1017/cBO9780511809071

Moreo A., Francisco M., Sebastiani F. Multi-label Quantification // AcM Transactions on Knowledge Discovery from Data. 2024. Vol. 18. No 1. DOI: : 10.48550/ arXiv.2211.08063

Bunse M., Moreo A., Sebastiani F., Senz M. Ordinal Quantification Through Regularization // Data Mining and Knowledge Discovery. 2024. Vol 38. No 6. P. 4076-4121. DOI: : 10.1007/978-3-031-26419-1-3

cheung Kwok-Wai, Tsui Kwok-ching, Liu Jiming. Extended Latent class Models for collaborative Recommendation // IEEE Transactions on Systems, Man, and cybernetics — Part A: Systems and Humans. 2004. Vol. 34. No 1. P. 143-148. DOI: : 10.1109/TSMcA.2003.818877

Rindskopf D. Latent Class Analysis. In: The SAGE Handbook of Quantitative Methods in Psychology. N.Y.: Sage. 2009. P. 226-244. DOI: : 10.4135/9780857020994

Дронов С.В., Шеларь А.Ю. Латентный кластерный анализ для случая двух кластеров // МАК: «Математики — Алтайскому краю» : сборник трудов Всероссийской конференции по математике. Барнаул: Изд-во Алтайского гос. университета. 2018. С. 23-26.

Дронов С.В., Еськов С.Ю. Геометрический подход в post-hoc задаче кластерного анализа // Труды семинара по геометрии и математическому моделированию. 2024. Т. 10. С. 48-53.

Dronov S.V., Sazonova A.S. Two Approaches to Cluster Variable Quantification // Model Assisted Statistics and Applications. 2015. Vol. 10. P. 155-162. DOI: : 10.3233/ MAS-140314

Gewers EL. et al. Principal Component Analysis: A Natural Approach to Data Exploration // ACM Computing Surveys. 2021. Vol. 54. No 4. P. 1-34. DOI: : 10.1145/3447755

Published
2025-09-15
How to Cite
Dronov S. V., Dudareva Y. A., Eskov S. Y. A New Approach to Quantifying Cluster Analysis Results // Izvestiya of Altai State University, 2025, № 4(144). P. 67-72 DOI: 10.14258/izvasu(2025)4-09. URL: https://izvestiya.asu.ru/article/view/%282025%294-09.