Special Issue: Data Management and Data Mining

• Articles • Previous Articles     Next Articles

Linguistic Theory Based Contextual Evidence Mining for Statistical Chinese Co-Reference Resolution

Jun Zhao and Fei-Fan Liu   

  1. National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China
  • Received:2006-07-04 Revised:2007-03-19 Online:2007-07-10 Published:2007-07-10

Under statistical learning framework, the paper focuses on how to use traditional linguistic findings on anaphora resolution as a guide for mining and organizing contextual features for Chinese co-reference resolution. The main achievements are as follows. (1) In order to simulate ``syntactic and semantic parallelism factor'', we extract ``bags of word form and POS'' feature and ``bag of semes'' feature from the contexts of the entity mentions and incorporate them into the baseline feature set. (2) Because it is too coarse to use the feature of bags of word form, POS tag and seme to determine the syntactic and semantic parallelism between two entity mentions, we propose a method for contextual feature reconstruction based on semantic similarity computation, in order that the reconstructed contextual features could better approximate the anaphora resolution factor of ``Syntactic and Semantic Parallelism Preferences''. (3) We use an entity-mention-based contextual feature representation instead of isolated word-based contextual feature representation, and expand the size of the contextual windows in addition, in order to approximately simulate ``the selectional restriction factor'' for anaphora resolution. The experiments show that the multi-level contextual features are useful for co-reference resolution, and the statistical system incorporated with these features performs well on the standard ACE datasets.

Key words: multicast; message passing interface (MPI); multi-head worm; multistage interconnection networks (MINs); wormhole routing;

[1] Mitkov R. Anaphora Resolution. London: Longman Press, 2002.

[2] NIST. The Official Evaluation Plan for the ACE 2005 Evaluation. 2005, http://www.nist.gov/speech/tests/ace/ace05/.

[3] Soon W M, Ng H T, Lim D. A machine learning approach to co-reference resolution of noun phrases. -\it Computational Linguistics}, 2001, 27(4): 521$\sim$544.

[4] Ng V, Cardie C. Improving machine learning approaches to co-reference resolution. In -\it Proc. the 40th Anniversary Meeting of the Association for Computational Linguistics $($ACL-02$)$}, Philadelphia, PA, USA, 2002, pp.104$\sim$111.

[5] Vincent Ng. Machine learning for coreference resolution: From local classification to global ranking. In -\it Proc. the 43rd Annual Meeting of the Association for Computational Linguistics $($ACL-05$)$}, Ann Arbor, MI, 2005, pp.157$\sim$164.

[6] Yang X, Zhou G, Su J, Tan C L. Improving noun phrase co-reference resolution by matching strings. In -\it Proc. IJCNLP-04}, Hainan, China, -\it Lecture Notes in Computer Science}, Volume 3248, 2004, pp.22$\sim$31.

[7] Strube M, Rapp S, Muller C. The influence of minimum edit distance on reference resolution. In -\it Proc. the Conference on Empirical Methods in Natural Language Processing $($EMNLP-2002$)$}, Philadelphia, USA, 2002, pp.312$\sim$319.

[8] Houfeng Wang, Tingting He. Research on Chinese pronominal anaphora resolution. -\it Chinese Journal of Computers}, 2001, 24(2): 136$\sim$143.

[9] Houfeng Wang, Zheng Mei. Robust pronominal resolution within Chinese text. -\it Journal of Software}, 2005, 16(5): 700$\sim$707.

[10] Chinchor N, Marsh E, MUC-7 Information Extraction Task Definition, In -\it Proc. the Seventh Message Understanding Conference $($MUC-7$)$}, San Diego, CA, USA, Chinchor NA (ed.), Science Applications International Corporation, 1998.

[11] Vilain M, Burger J, Aberdeen J -\it et al}. A model-theoretic coreference scoring scheme. In -\it Proc. the Sixth Message Understanding Conference $($MUC-6$)$}, Columbia, Maryland, USA, Morgan Kaufmann, 1995, pp.45$\sim$52.

[12] Doddington G, Mitchell A, Przybocki M -\it et al}. Automatic Content Extraction (ACE) program --Task definitions and performance measures. In -\it Proc. the Fourth International Conference on Language Resources and Evaluation $($LREC 2004$)$}, Lisbon, Portugal, 2004, pp.837$\sim$840.

[13] Florian R, Hassan H, Ittycheriah A -\it et al}. A statistical model for multilingual entity detection and tracking. In -\it Proc. the Human Language Technology Conference --North American Chapter of the Association for Computational Linguistics Annual Meeting $($HLT/NAACL-2006$)$}, Boston, Massachusetts, USA, 2004, pp.1$\sim$8.

[14] Iida R, Inui K, Takamura H -\it et al}. Incorporating contextual cues in trainable models for coreference resolution. In -\it Proc. the EACL'03 Workshop on the Computational Treatment of Anaphora}, Budapest, Hungary, 2003, pp.23$\sim$30.

[15] John Bryant. Combining feature based and semantic information for co-reference resolution. Research Report at U.C. Berkeley and ICSI.

[16] Van Deemter K, Kibble R. On Coreferring: Coreference in MUC and Related Annotation Schemes 2000. -\it Computational Linguistics}, 2004, 26(4): 629$\sim$637.

[17] Aone C, Halverson L, Hampton T, Ramos-Santacruz M. SRA: Description of the IE$-2}$ System Used for MUC-7. In -\it Proc. the Seventh Message Understanding Conference $($MUC-7$)$}, Chinchor N A (ed). San Diego, CA, Science Applications International Corporation, 1998.

[18] Jurafsky Dan, James Martin. Speech and Language Processing. Prentice-Hall, Englewood Cliffs NJ, 2000.

[19] Zhendong Dong, Qiang Dong. HowNet and the Computation of Meaning. Singapore: World Scientific 2006.

[20] Qun Liu, Sujian Li. Word similarity computing based on How-net. -\it Journal of Computational Linguistics and Chinese Language Processing}, 2002, 7(2): 59$\sim$76.
[1] Hong-Mei Wei, Jian Gao, Peng Qing, Kang Yu, Yan-Fei Fang, Ming-Lu Li. MPI-RCDD: A Framework for MPI Runtime Communication Deadlock Detection [J]. Journal of Computer Science and Technology, 2020, 35(2): 395-411.
[2] Manas Ranjan Kabat, Manoj Kumar Patel, and Chita Ranjan Tripathy. A Heuristic Algorithm for Core Selection in Multicast Routing [J]. , 2011, 26(6): 954-961.
[3] Shao-Liang Peng, Shan-Shan Li, Lei Chen, Yu-Xing Peng, and Nong Xiao. Scalable Base-Station Model-Based Multicast in Wireless Sensor Networks [J]. , 2008, 23(5 ): 780-791 .
[4] Ji-Peng Zhou. Fault-Tolerant Wormhole Routing with 2 Virtual Channels in Meshes [J]. , 2005, 20(6): 822-830 .
[5] Wei-Sheng Si and Cheng-Zhi Li. RMAC: A Reliable MAC Protocol Supporting Multicast for Wireless Ad Hoc Networks [J]. , 2005, 20(5): 702-712 .
[6] Yuan Zhou, Guang-Sheng Li, Yong-Zhao Zhan, Qi-Rong Mao, and Yi-Bin Hou. DRMR: Dynamic-Ring-Based Multicast Routing Protocol for Ad Hoc Networks [J]. , 2004, 19(6): 0-0.
[7] Alberto Apostolico, Fang-Cheng Gong, and StefanoLonardi. Verbumculus and the Discovery of Unusual Words [J]. , 2004, 19(1): 0-0.
[8] SONG JianPing , HOU ZiFeng and XU Ming . Pseudo-Cycle-Based Multicast Routing in Wormhole-Routed Networks [J]. , 2003, 18(6): 0-0.
[9] LIN Yu (林 宇), WU HaiTao (邬海涛), WANG ChongGang (王重钢) and CHENG ShiDuan (程时端). Dynamic Retransmission Control for Reliable Mobile Multicast [J]. , 2003, 18(3): 0-0.
[10] LI Xianxian (李先贤) and HUAI Jinpeng (怀进鹏). Efficient Non-Repudiation Multicast Source Authentication Schemes [J]. , 2002, 17(6): 0-0.
[11] MA Huadong (马华东) and Kang G. Shin. Hybrid Broadcast for the Video-on-Demand Service [J]. , 2002, 17(4): 0-0.
[12] WU Jie(吴杰)and CHEN Xiao. Fault-Tlerant Tree-Based Multicasting in Mesh Multicomputers [J]. , 2001, 16(5): 0-0.
[13] SONG Jianping(宋建平),HOU Zifeng(侯紫峰)and SHI Yuntao(史云涛). An Optimal Multicast Algorithm for Cube-Connected Cycles [J]. , 2000, 15(6): 0-0.
[14] SONG Jianping; HOU Zifeng; SHI Yuntao;. An Optimal Multicast Algorithm for Cube-Connected Cycles [J]. , 2000, 15(6): 572-583.
[15] HUANG Hao; CHEN Guihai; XIE Li; SUN Zhongxiu;. Multicast Protocol for Uni-Directional Networks [J]. , 2000, 15(2): 158-168.
Full text



[1] Wang Xuan; Lü Zhimin; Tang Yuhai; Xiang Yang;. A High Resolution Chinese Character Generator[J]. , 1986, 1(2): 1 -14 .
[2] Xu Zhiming;. Discrete Interpolation Surface[J]. , 1990, 5(4): 329 -332 .
[3] Han Jianchao; Shi Zhongzhi;. Formalizing Default Reasoning[J]. , 1990, 5(4): 374 -378 .
[4] Weigeng Shi;. Reconnectable Network with Limited Resources[J]. , 1991, 6(3): 243 -249 .
[5] Xu Meirui; Liu Xiaolin;. A VLSI Algorithm for Calculating the Tree to Tree Distance[J]. , 1993, 8(1): 68 -76 .
[6] WU Jie;. Reliable Communication on Cube-Based Multicomputers[J]. , 1996, 11(3): 208 -221 .
[7] FAN Xiaocong; XU Dianxiang; HOU Jianmin; ZHENG Guoliang;. Reasoning about Concurrent Actionsin Multi-Agent Systems[J]. , 1999, 14(4): 422 -428 .
[8] HE Simin; ZHANG Bo;. Solving SAT by Algorithm Transform of Wu s Method[J]. , 1999, 14(5): 468 -480 .
[9] KONG Annjia; ZHANG Xiangde; WANG Guangning;. Computing the K-Terminal Reliability for SONET Self-Healing Rings[J]. , 1999, 14(6): 580 -584 .
[10] Sheng-Zhi Du, Zeng-Qiang Chen, and Zhu-Zhi Yuan. Evolutionary Pseudo-Relaxation Learning Algorithm for Bidirectional Associative Memory[J]. , 2005, 20(4): 559 -566 .

ISSN 1000-9000(Print)

CN 11-2296/TP

Editorial Board
Author Guidelines
Journal of Computer Science and Technology
Institute of Computing Technology, Chinese Academy of Sciences
P.O. Box 2704, Beijing 100190 P.R. China
E-mail: jcst@ict.ac.cn
  Copyright ©2015 JCST, All Rights Reserved