Research Interests

Web Content Mining, Text Classificaton; One-class Classification. Data Mining, Machine Learning, Artificial Intelligence; Semantic Web, Natual Language Processing.

Publication

  • Yanhong Zhai, and Bing Liu. "Automatic Wrapper Generation Using Tree Matching and Partial Tree Alignment" Accepted in AAAI Nectar Papers Track at the 21st National Conference on Artificial Intelligence (AAAI-06), Boston, USA, July 16 - 20, 2006.

  • Yanhong Zhai, and Bing Liu. "Extracting Web Data Using Instance-Based Learning" Proc. The 6th International Conference on Web Information Systems Engineering (WISE-2005), Nov 20-22, New York.
  • [pdf] (Best Paper Award)

  • Bing Liu, and Yanhong Zhai. "Extracting Data from Nested Data Records in Web Pages" Proc. The 6th International Conference on Web Information Systems Engineering (WISE-2005), Nov 20-22, New York.
  • [pdf]

  • Yanhong Zhai, and Bing Liu. "Web Data Extraction Based on Partial Tree Alignment" Proc. The 14th international World Wide Web conference (WWW-2005), May 10-14, 2005, in Chiba, Japan. [pdf]


  • Bing Liu, Robert Grossman, Yanhong Zhai. "Mining Data Records in Web Pages." Proc. The ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD-2003), Washington, DC, USA, August 24 - 27, 2003. [pdf-conference version][full version]

  • Yanhong Zhai, and Bing Liu. "Structured Data Extraction from the Web based on Partial Tree Alignment." accepted by IEEE Transactions on Knowledge and Data Engineering, 2006.

  • Bing Liu, Robert Grossman and Yanhong Zhai. "Mining Web Pages for Data Records," IEEE Intelligent Systems special issue on Mining the Web for Actionable Knowledge, 2004. [pdf]
  •  

    last update: June 9, 2006



    Research Projects

    Courses

    Reading Notes

    Bio & CV