Information Extraction: A Pattern Mining Approach for Free-Form Text
Author: C.-H. Chang
Publish Year: 2003-12-04
Update by: March 30, 2025
摘要
The vast amount of online information available has led to renewed interest in information extraction (IE) systems that analyze input documents to produce a structured representation of selected information from the documents. However, the design of an IE system differs greatly according to its input: from unrestricted plain text, telegraphic passage, to semi- structured Web documents. This paper gives a sur- vey of IE systems and discusses the essentials for IE system design. A pattern mining approach is pro- posed as a general approach for IE system design.