We use cookies to improve your experience with our site.
Jooil Lee, Yanhua Jin, Won Suk Lee. SUBic: A Scalable Unsupervised Framework for Discovering High Quality Biclusters[J]. Journal of Computer Science and Technology, 2013, 28(4): 636-646. DOI: 10.1007/s11390-013-1364-y
Citation: Jooil Lee, Yanhua Jin, Won Suk Lee. SUBic: A Scalable Unsupervised Framework for Discovering High Quality Biclusters[J]. Journal of Computer Science and Technology, 2013, 28(4): 636-646. DOI: 10.1007/s11390-013-1364-y

SUBic: A Scalable Unsupervised Framework for Discovering High Quality Biclusters

  • A biclustering algorithm extends conventional clustering techniques to extract all of the meaningful subgroups of genes and conditions in the expression matrix of a microarray dataset. However, such algorithms are very sensitive to input parameters and show poor scalability. This paper proposes a scalable unsupervised biclustering framework, SUBic, to find high quality constant-row biclusters in an expression matrix effectively. A one-dimensional clustering algorithm is proposed to partition the attributes, that is, columns of an expression matrix into disjoint groups based on the similarity of expression values. These groups form a set of short transactions and are used to discover a set of frequent itemsets each of which corresponds to a bicluster. However, a bicluster may include any attribute whose expression value is not similar enough to others, so a bicluster refinement is used to enhance the quality of a bicluster by removing those attributes based on its distribution of expression values. The performance of the proposed method is comparatively analyzed through a series of experiments on synthetic and real datasets.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return