Systems, Man & Cybernetics - April 2017 - 24

Differences Between
point in a multidimensional space.
Clustering and Coclustering
That is, the variable for each ele-
Although a data set
Clustering is a commonly used meth-
ment in the vector represents an
od in unsupervised learning. Given
axis in the multidimensional space.
can be very large,
the input data as a matrix, the task of
The dimension of the vector is the
it is possible that
clustering is to partition the matrix
number of elements in the vector.
meaningful and
into clusters along either the row or
However, in image processing and
the column direction, but not both
computer programming, we use
coherent patterns
at the same time [Figure 2(a)-(b)].
the terms like 2-D and 3-D images,
embedded in the
The rows and columns correspond
and 2-D and 3-D data arrays. Here,
to samples and features, respec-
dimension means the number of
data array are much
tively, in pattern recognition ap -
directions or indices in an image or
smaller in size.
pl i cations. Similar to clustering,
data array. In tensor algebra, each
coclustering deals with unlabeled
direction is called a mode, and the
data and performs unsupervised pat-
term multiway data array is often
tern classification. However, there is
used. For simplicity, we will use
a fundamental difference between clustering and coclustering.
the word multidimensional for both vector spaces and
In coclustering, the data are partitioned in both row and col-
data arrays when the context is clear. That is, for real
umn directions. Therefore, for 2-D data, a cluster consists of a
numbers, a multidimensional space corresponds to the set
M
subset of rows (or columns) with all columns (rows) present in
R , where M is the number of elements or dimensionality
each row (column), while a cocluster consists of a subset of
of the vectors in the space. An N-dimensional data array
rows and a subset of columns [Figures 2(c)].
(N-way data array, or Nth order, or N-mode tensor)
There are several other differences between clustering and
belongs to the set R M # M # g # M , where M 1, M 1, ..., M N are
coclustering [3], [16]. In hard clustering, the clusters do not
the numbers of elements in corresponding directions.
have any overlap [Figure 2(a)-(b)]. That is, a sample is
In the previous discussion, we used the term coherent
assigned to one and only one cluster unless it is considered as
pattern, which means that elements in the rows or columns
an outlier and is excluded in classification. In soft clustering, a
of the pattern are correlated and display certain common
sample is assigned to each cluster with a membership value in
properties. We will explain later that a coherent pattern can
fuzzy logic modeling. In coclustering, several coclusters can
be represented as a low-rank matrix or tensor and that our
partially overlap. For example, inĀ Figure 2(c), cocluster 3 over-
tasks of coclustering are to extract one or more coherent
laps with all the other three coclusters. In addition, an element
patterns embedded in the matrix or tensor.
in the data matrix may belong to none, one, or more coclus-
ters. As shown in Figure 1(b), a large matrix may contain
much smaller coclusters, and most elements in the matrix are
irrelevant. We need to identify the row and column indices of
relevant elements to detect the coclusters.
In hard clustering [Figure 2(a)-(b)], we can always
exchange the rows or columns so that each cluster occupies a
contiguous region. However, in coclustering, we may not be
able to display all coclusters as contiguous regions at the same
time. This is illustrated in Figure 2(c). If we exchange rows so
that three regions of cocluster 3 become contiguous, then
cocluster 2 or cocluster 4 would be split into more than one
region. Therefore, it is more difficult to visualize all coclusters
than clusters.
The concept of coclustering was introduced in the 1970s,
but it has gained much attention only recently due to its use-
ful applications to genomic data analysis [1]-[26]. In a
(a)
(b)
(c)
genome, genes usually work together to perform a biological
function. In gene expression data, the samples represent
Figure 1. an example of a cocluster. (a) a large
genes and features conditions. A condition can be a time
matrix that appears to contain just random data.
(b) a cocluster exists in the matrix. the elements of
point, an organ, a species, or a different experiment setup. A
the cocluster can be put together to show a coherent
cocluster represents a subset of genes that coexpress (or are
pattern by rearranging rows and columns in (a).
coregulated) under a subset of conditions. Coclustering anal-
(c) the locations of the elements in the cocluster in
ysis is useful for understanding diseases caused by aberrant
the original matrix in (a), with all elements outside
coexpressed genes.
the cocluster shown as a uniform gray background.
1

24

2

N

IEEE SyStEmS, man, & CybErnEtICS magazInE A pri l 2017



Table of Contents for the Digital Edition of Systems, Man & Cybernetics - April 2017

Systems, Man & Cybernetics - April 2017 - Cover1
Systems, Man & Cybernetics - April 2017 - Cover2
Systems, Man & Cybernetics - April 2017 - 1
Systems, Man & Cybernetics - April 2017 - 2
Systems, Man & Cybernetics - April 2017 - 3
Systems, Man & Cybernetics - April 2017 - 4
Systems, Man & Cybernetics - April 2017 - 5
Systems, Man & Cybernetics - April 2017 - 6
Systems, Man & Cybernetics - April 2017 - 7
Systems, Man & Cybernetics - April 2017 - 8
Systems, Man & Cybernetics - April 2017 - 9
Systems, Man & Cybernetics - April 2017 - 10
Systems, Man & Cybernetics - April 2017 - 11
Systems, Man & Cybernetics - April 2017 - 12
Systems, Man & Cybernetics - April 2017 - 13
Systems, Man & Cybernetics - April 2017 - 14
Systems, Man & Cybernetics - April 2017 - 15
Systems, Man & Cybernetics - April 2017 - 16
Systems, Man & Cybernetics - April 2017 - 17
Systems, Man & Cybernetics - April 2017 - 18
Systems, Man & Cybernetics - April 2017 - 19
Systems, Man & Cybernetics - April 2017 - 20
Systems, Man & Cybernetics - April 2017 - 21
Systems, Man & Cybernetics - April 2017 - 22
Systems, Man & Cybernetics - April 2017 - 23
Systems, Man & Cybernetics - April 2017 - 24
Systems, Man & Cybernetics - April 2017 - 25
Systems, Man & Cybernetics - April 2017 - 26
Systems, Man & Cybernetics - April 2017 - 27
Systems, Man & Cybernetics - April 2017 - 28
Systems, Man & Cybernetics - April 2017 - 29
Systems, Man & Cybernetics - April 2017 - 30
Systems, Man & Cybernetics - April 2017 - 31
Systems, Man & Cybernetics - April 2017 - 32
Systems, Man & Cybernetics - April 2017 - 33
Systems, Man & Cybernetics - April 2017 - 34
Systems, Man & Cybernetics - April 2017 - 35
Systems, Man & Cybernetics - April 2017 - 36
Systems, Man & Cybernetics - April 2017 - 37
Systems, Man & Cybernetics - April 2017 - 38
Systems, Man & Cybernetics - April 2017 - 39
Systems, Man & Cybernetics - April 2017 - 40
Systems, Man & Cybernetics - April 2017 - 41
Systems, Man & Cybernetics - April 2017 - 42
Systems, Man & Cybernetics - April 2017 - 43
Systems, Man & Cybernetics - April 2017 - 44
Systems, Man & Cybernetics - April 2017 - 45
Systems, Man & Cybernetics - April 2017 - 46
Systems, Man & Cybernetics - April 2017 - 47
Systems, Man & Cybernetics - April 2017 - 48
Systems, Man & Cybernetics - April 2017 - 49
Systems, Man & Cybernetics - April 2017 - 50
Systems, Man & Cybernetics - April 2017 - 51
Systems, Man & Cybernetics - April 2017 - 52
Systems, Man & Cybernetics - April 2017 - 53
Systems, Man & Cybernetics - April 2017 - 54
Systems, Man & Cybernetics - April 2017 - 55
Systems, Man & Cybernetics - April 2017 - 56
Systems, Man & Cybernetics - April 2017 - Cover3
Systems, Man & Cybernetics - April 2017 - Cover4
https://www.nxtbook.com/nxtbooks/ieee/smc_202310
https://www.nxtbook.com/nxtbooks/ieee/smc_202307
https://www.nxtbook.com/nxtbooks/ieee/smc_202304
https://www.nxtbook.com/nxtbooks/ieee/smc_202301
https://www.nxtbook.com/nxtbooks/ieee/smc_202210
https://www.nxtbook.com/nxtbooks/ieee/smc_202207
https://www.nxtbook.com/nxtbooks/ieee/smc_202204
https://www.nxtbook.com/nxtbooks/ieee/smc_202201
https://www.nxtbook.com/nxtbooks/ieee/smc_202110
https://www.nxtbook.com/nxtbooks/ieee/smc_202107
https://www.nxtbook.com/nxtbooks/ieee/smc_202104
https://www.nxtbook.com/nxtbooks/ieee/smc_202101
https://www.nxtbook.com/nxtbooks/ieee/smc_202010
https://www.nxtbook.com/nxtbooks/ieee/smc_202007
https://www.nxtbook.com/nxtbooks/ieee/smc_202004
https://www.nxtbook.com/nxtbooks/ieee/smc_202001
https://www.nxtbook.com/nxtbooks/ieee/smc_201910
https://www.nxtbook.com/nxtbooks/ieee/smc_201907
https://www.nxtbook.com/nxtbooks/ieee/smc_201904
https://www.nxtbook.com/nxtbooks/ieee/smc_201901
https://www.nxtbook.com/nxtbooks/ieee/smc_201810
https://www.nxtbook.com/nxtbooks/ieee/smc_201807
https://www.nxtbook.com/nxtbooks/ieee/smc_201804
https://www.nxtbook.com/nxtbooks/ieee/smc_201801
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_1017
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_0717
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_0417
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_0117
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_1016
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_0716
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_0416
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_0116
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_1015
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_0715
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_0415
https://www.nxtbook.com/nxtbooks/ieee/systems_man_cybernetics_0115
https://www.nxtbookmedia.com