We use cookies to improve your experience with our site.
Xueqi Cheng, Songbo Tan, Lilian Tang. Using DragPushing to Refine Concept Index for Text CategorizationJ. Journal of Computer Science and Technology, 2006, 21(4): 592-596.
Citation: Xueqi Cheng, Songbo Tan, Lilian Tang. Using DragPushing to Refine Concept Index for Text CategorizationJ. Journal of Computer Science and Technology, 2006, 21(4): 592-596.

Using DragPushing to Refine Concept Index for Text Categorization

  • Concept index (CI) is a very fast and efficientfeature extraction (FE) algorithm for text classification. The keyapproach in CI scheme is to express each document as a function ofvarious concepts (centroids) present in the collection. However, therepresentative ability of centroids for categorizing corpus is ofteninfluenced by so-called model misfit caused by a number of factors inthe FE process including feature selection to similarity measure. Inorder to address this issue, this work employs the ``DragPushing''Strategy to refine the centroids that are used for concept index. Wepresent an extensive experimental evaluation of refined concept index(RCI) on two English collections and one Chinese corpus usingstate-of-the-art Support Vector Machine (SVM) classifier. The resultsindicate that in each case, RCI-based SVM yields a much betterperformance than the normal CI-based SVM but lower computation costduring training and classification phases.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return