Download now Free registration required
Over the past few years, the authors have been trying to build an end-to-end system at Wisconsin to manage unstructured data, using extraction, integration, and user interaction. This paper describes the key Information Extraction (IE) challenges that they have run into, and sketches the solutions. The authors discuss in particular developing a declarative IE language, optimizing for this language, generating IE provenance, incorporating user feedback into the IE process, developing a novel wiki-based user interface for feedback, best-effort IE, pushing IE into RDBMSs, and more.
- Format: PDF
- Size: 145.6 KB