Extracting structured data from web pages
"Most current work is deficient in providing users the meaning of the attributes of the extracted data." - also applies to our work, RoadRunner, and Structured Data Extraction in SIGMOD03.
last update: April 3, 2005