Skip to main navigation Skip to search Skip to main content

A Data Type-Driven Property Alignment Framework for Product Duplicate Detection on the Web

  • G van Rooij
  • , R (Ravi) Sewnarain
  • , M Skogholt
  • , T van der Zaan
  • , Flavius Frasincar*
  • , Kim Schouten
  • *Corresponding author for this work

Research output: Chapter/Conference proceedingConference proceedingAcademicpeer-review

4 Citations (Scopus)
5 Downloads (Pure)

Abstract

During the last decade daily life has morphed into a world of broadband ubiquity, where devices facilitate constant engagement. As a consequence of this, the area of e-commerce has seen an immense growth. Despite the market opportunities for retailers and the ease for customers to acquire products through webshops, the shift to digital retail has its drawbacks. For example, it leads to cluttered and incomparable information among different webshops, which calls for an automated method to regain homogeneity in product representations. This paper presents a product duplicate detection solution, which exploits a data type-driven property alignment framework. Based on the performed experiment, we show a statistically significant improvement of the F
-score from 47.91 % to 78.13 % compared to an existing state-of-the-art approach.
Original languageEnglish
Title of host publication17th International Conference on Web Information Systems Engineering (WISE 2016)
PublisherSpringer-Verlag
Pages380-395
Number of pages16
Volume10041
DOIs
Publication statusPublished - 2 Nov 2016

Research programs

  • EUR ESE 32

Fingerprint

Dive into the research topics of 'A Data Type-Driven Property Alignment Framework for Product Duplicate Detection on the Web'. Together they form a unique fingerprint.

Cite this