Automatic Extraction of Words from Chinese Textual Data
-
Abstract
In addition to Chinese character I/O, one of the most important issues for Chinese information processing is automatic extraction of words from textual data. Having discussed the characteristics of Chinese words and sentences, we proved in this paper that this problem cannot be thoroughly resolved. Then, various algorithms for extraction of words from Chinese sentences are reviewed. Finally, a new algorithm is put forward, based on which a highly automatic Chinese information processing system has been deve…
-
-