We use cookies to improve your experience with our site.
Xueqi Cheng, Songbo Tan, Lilian Tang. Using DragPushing to Refine Concept Index for Text Categorization[J]. Journal of Computer Science and Technology, 2006, 21(4): 592-596.
Citation: Xueqi Cheng, Songbo Tan, Lilian Tang. Using DragPushing to Refine Concept Index for Text Categorization[J]. Journal of Computer Science and Technology, 2006, 21(4): 592-596.

Using DragPushing to Refine Concept Index for Text Categorization

  • Concept index (CI) is a very fast and efficientfeature extraction (FE) algorithm for text classification. The keyapproach in CI scheme is to express each document as a function ofvarious concepts (centroids) present in the collection. However, therepresentative ability of centroids for categorizing corpus is ofteninfluenced by so-called model misfit caused by a number of factors inthe FE process including feature selection to similarity measure. Inorder to address this issue, this work employs the ``DragPushing''Strategy to refine the centroids that are used for concept index. Wepresent an extensive experimental evaluation of refined concept index(RCI) on two English collections and one Chinese corpus usingstate-of-the-art Support Vector Machine (SVM) classifier. The resultsindicate that in each case, RCI-based SVM yields a much betterperformance than the normal CI-based SVM but lower computation costduring training and classification phases.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return