On joint dimension reduction and clustering of categorical data

Alfonso Iodice D’Enza*, Michel Van de Velden, Francesco Palumbo

*Corresponding author for this work

Research output: Chapter/Conference proceedingConference proceedingAcademicpeer-review

3 Citations (Scopus)

Abstract

There exist several methods for clustering high-dimensional data. One popular approach is to use a two-step procedure. In the first step, a dimension reduction technique is used to reduce the dimensionality of the data. In the second step, cluster analysis is applied to the data in the reduced space. This method may be referred to as the tandem approach. An important drawback of this method is that the dimension reduction may distort or hide the cluster structure. As an alternative, various authors have proposed joint dimension reduction and clustering approaches. In this paper we review some of these existing joint dimension reduction and clustering methods for categorical data in a unified framework that facilitates comparison.

Original languageEnglish
Title of host publicationAnalysis and Modeling of Complex Data in Behavioral and Social Sciences
EditorsAkinori Okada, Claus Weihs, Donatella Vicari, Giancarlo Ragozini
Pages161-169
Number of pages9
DOIs
Publication statusPublished - 2014
EventJoint international meeting on Japanese Classification Society and the Classification and Data Analysis Group of the Italian Statistical Society, JCS-CLADAG 2012 - Capri Island, Italy
Duration: 3 Sept 20124 Sept 2012

Publication series

SeriesStudies in Classification, Data Analysis, and Knowledge Organization
Volume49
ISSN1431-8814

Conference

ConferenceJoint international meeting on Japanese Classification Society and the Classification and Data Analysis Group of the Italian Statistical Society, JCS-CLADAG 2012
Country/TerritoryItaly
CityCapri Island
Period3/09/124/09/12

Bibliographical note

Publisher Copyright:
© Springer International Publishing Switzerland 2014.

Fingerprint

Dive into the research topics of 'On joint dimension reduction and clustering of categorical data'. Together they form a unique fingerprint.

Cite this