A Framework for Automatic Annotation of Web Pages Using the Google Rich Snippets Vocabulary

J van der Meer, F Boon, Frederik Hogenboom, Flavius Frasincar, U Kaymak

Research output: Chapter/Conference proceedingConference proceedingAcademicpeer-review

9 Citations (Scopus)

Abstract

One of the latest developments for the Semantic Web is Google Rich Snippets, a service that uses Web page annotations for displaying search results in a visually appealing manner. In this paper we propose the Automatic Review Recognition and annOtation of Web pages (ARROW) framework, which is able to identify reviews on Web pages and to annotate them using RDFa attributes. The ARROW framework consists of four steps: hotspot identification, subjectivity analysis, information extraction, and page annotation. We evaluate an implementation of the framework by using various Web sites. Based on the evaluation we conclude that our framework is able to properly identify the majority of reviews, reviewed items, and review dates.
Original languageEnglish
Title of host publicationTwenty-Sixth Symposium on Applied Computing (SAC 2011)
EditorsW. Chu, W.E. Wong, M.J. Palakal, C.-C. Hung
PublisherACM
Pages765-772
Number of pages8
DOIs
Publication statusPublished - 21 Mar 2011

Research programs

  • EUR ESE 32

Fingerprint

Dive into the research topics of 'A Framework for Automatic Annotation of Web Pages Using the Google Rich Snippets Vocabulary'. Together they form a unique fingerprint.

Cite this