Human-controlled iterative subclustering analysis

Mauri Kaipainen, Olli Pekka Pitkänen

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

2 Citations (Scopus)


The accumulation and use of data are rapidly expanding. With it, new kinds of interactions emerge that go beyond traditional data analytics, to the point at which a whole new research area of human-data interaction is has been suggested. Our study reconsiders cluster analysis from this point of view. We aim to redesign the process to be more interactive and transparent for purposes beyond conventional data analysis. We address the core issue of cluster analysis, namely what criteria are to determine the homogeneity of a cluster by means of breaking the algorithm into a sequence of explorative subdivisions proceeding as a human-data dialogue. The system provides the human agent with a heuristic. It is formed by sorting the variables of the data set by descending orthogonality against the variable that was applied as the subdivision criterion of the previous iteration. This allows minimizing redundancy of the analysis while securing distinctions relevant for the analytic intention and contextuality, which go beyond the reach of algorithmic decision. The proposed method constitutes a quick and intuitive access to data mining, facilitating new insights and identifying actionable generalizations.
Original languageEnglish
Title of host publicationProceedings of 2017 IEEE International Conference on Big Data (IEEE BigData)
Publication date12.2017
Publication statusPublished - 12.2017
MoE publication typeA4 Article in conference proceedings
Event2017 IEEE International Conference on Big Data (IEEE BigData) - Boston, United States
Duration: 11.12.201714.12.2017


  • 512 Business and Management


Dive into the research topics of 'Human-controlled iterative subclustering analysis'. Together they form a unique fingerprint.

Cite this