Adaptive Text Extraction and Mining
Papers from the 2004 AAAI Workshop
Ion Muslea, Program Chair
Technical Report WS-04-01 published by The AAAI Press, Menlo Park, California
This technical report is also available in book and CD format.
Please Note: Abstracts are linked to individual titles, and will appear in a separate browser window. Full-text versions of the papers are linked to the abstract text. Access to full text may be restricted to AAAI members. PDF file sizes may be large!
Contents
Workshop Organizing Committee / 1
Ion Muslea
Talks
Information Extraction by Convergent Boundary Classification / 1
Aidan Finn and Nicholas Kushmerick
IE Evaluation: Criticisms and Recommendations / 7
A. Lavelli, M. E. Califf, F. Ciravegna, D. Freitag, C. Giuliano, N. Kushmerick, and L. Romano
A Comparison of Keyword- and Keyterm-based Methods for Automatic Web Site Summarization / 15
Yongzheng Zhang, Evangelos Milios, and Nur Zincir-Heywood
The Use of Web-based Statistics to Validate Information Extraction / 21
Stephen Soderland, Oren Etzioni, Tal Shaked, and Daniel S. Weld
Using Soft-Matching Mined Rules to Improve Information Extraction / 27
Un Yong Nahm and Raymond J. Mooney
Populating the Semantic Web / 33
Kristina Lerman, Cenk Gazen, Steven Minton, and Craig Knoblock
Handling Irregularities in ROADRUNNER / 39
Valter Crescenzi, Giansalvatore Mecca, and Paolo Merialdo
Posters
A Model for Graded Levels of Generalizations in Intensional Query Answering / 45
Farah Benamara
Learning Text Patterns for Web Information Extraction and Assessment / 50
Doug Downey, Oren Etzioni, Stephen Soderland, and Daniel S. Weld
Using Statistical Techniques and WordNet to Reason with Noisy Data / 56
Rakesh Gupta and Mykel J. Kochenderfer
A Bootstrapping Approach to Information Extraction Domain Porting / 62
Cheng Niu, Wei Li, and Rohini K. Srihari
Class Extraction from the World Wide Web / 68
Ana-Maria Popescu, Alexander Yates, and Oren Etzioni
Automatic Model Structuring from Text using BioMedical Ontology / 74
Rohit Joshi, Xiaoli Li, Sreeram Ramachandaran, and Tze Yun Leong
Unsupervised Induction of IE Domain Knowledge using an Ontology / 80
Mark Stevenson
Lexical Semantics Domain Model for Information Extraction / 86
Patricia Lutsky
A Graphical Model for Shallow Parsing Sequences / 89
Adrian Silvescu and Vasant Honavar
AAAI Digital Library
AAAI relies on your generous support through membership and donations. If you find these resources useful, we would be grateful for your support.