[1]乔磊,李存华,仲兆满,等.基于规则的人物信息抽取算法的研究[J].南京师大学报(自然科学版),2012,35(04):134-139.
Qiao Lei,Li Cunhua,Zhong Zhaoman,et al.Research on People ’ s Information Extraction Based on Rules[J].Journal of Nanjing Normal University(Natural Science Edition),2012,35(04):134-139.
点击复制
基于规则的人物信息抽取算法的研究()
《南京师大学报(自然科学版)》[ISSN:1001-4616/CN:32-1239/N]
- 卷:
-
第35卷
- 期数:
-
2012年04期
- 页码:
-
134-139
- 栏目:
-
计算机科学
- 出版日期:
-
2012-12-20
文章信息/Info
- Title:
-
Research on People ’ s Information Extraction Based on Rules
- 作者:
-
乔磊1; 2; 李存华2; 仲兆满2; 王俊2; 刘冬冬2
-
( 1. 中国矿业大学计算机科学与技术学院,江苏徐州221116) ( 2. 淮海工学院计算机工程学院,江苏连云港222005)
- Author(s):
-
Qiao Lei1; 2; Li Cunhua2; Zhong Zhaoman2; Wang Jun2; Liu Dongdong2
-
1.School of Computer Science and Technology,China University of Mining and Technology,Xuzhou 221116,China
-
- 关键词:
-
文本信息抽取; 人物信息抽取; 人物属性规则; 抽取算法
- Keywords:
-
text information extraction; people’s information extraction; rules of People’s attributes; extraction algorithm
- 分类号:
-
TP391.3
- 摘要:
-
随着互联网的快速发展,信息也呈爆炸式增长,如何从海量的文本信息中获取所需的信息成为当今一门重要的课题.检索、分类、抽取等文本信息处理技术取得了长足发展,但面向人物属性的自动信息提取却没有引起人们的重视.基于规则的人物信息抽取算法,首先对需要抽取的信息进行规则描述,重点是时间、地点、籍贯等信息.在规则的基础上,研究开发人物信息抽取系统,最终实现了半结构化人物属性信息的自动提取.
- Abstract:
-
With the rapid development of internet information with the explosive growth,how to obtain the required information from the vast amounts of text information is becoming an important issue today. Text retrieval,classification,extraction and other information processing technology has made considerable progress,but the automatic information extraction for character attributes did not cause people’s attention. Rule-based character information extraction algorithms need to first extract the information on the rules described,with emphasis on time,place, event and other information. In the rule,based on research and development of character information extraction system,ultimately the character of semistructured information automatically extracted.
参考文献/References:
[1] 易平,刘宗田,周文. 人物传记研究综述[J]. 计算机工程与设计, 2009, 30( 14) : 3426-3428.
[2] Luhn H P. The automatic creation of literature abstracts[J]. IBM Journal of Research Development, 1958,2 ( 2) : 159.
[3] Schiffman B,Mani I,Concepcion K. Producing biographical summaries: combining linguistic knowledge with corpus statistics [C]/ /Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics( ACL’2001) . New Brunswick, New Jersey: Association for Computational Linguistics, 2001: 450-457.
[4] Han Y J,Park S Y,Park S B, et al. Reconstruction of people information based on an event ontology[C]/ /Proceedings of International Conference on Natural Language Processing and Knowledge Engineering. Beijing, 2007: 446-451.
[5] 任宁. 大规模真实文本中的人物职衔信息抽取研究[D]. 北京: 北京语言大学信息科学学院, 2008: 4-7.
[6] 周婷. 异构信息源的领域人物信息抽取研究[D]. 北京: 哈尔滨工业大学计算机科学与技术学院, 2010: 6.
[7] Zhong Z M,Liu Z T,Li C H, et al. Identifying key people from a single document using people event map[J]. Journal of Computational Information Systems, 2010,6 ( 1) : 17-23.
[8] Hayneschan,W-OU,Anders, et al. ICTCLAS[EB/OL].[2012 - 08 - 29]. http: / /baike. baidu. com/view/1215398. htm.
[9] 邓凯元,姜磊. 正则表达式匹配引擎性能分析[J]. 计算机与现代化, 2011( 7) : 105 - 110.
[10] 颜伟王,洁尚英,宋柔.《中国大百科全书》人物传记知识提取加工规范语言[C]/ /全国第七届计算语言学联合学术会议论文集. 哈尔滨, 2003.
备注/Memo
- 备注/Memo:
-
通讯联系人: 乔磊,硕士研究生,研究方向: 人工智能. E-mail: qiaony@163. com
更新日期/Last Update:
2013-03-11