SCHEMA – An Algorithm for Automated Product Taxonomy Mapping in E-commerce

S.S. (Steven) Aanen, L.J. (Lennart) Nederstigt, Damir Vandic, Flavius Frasincar

Research output: Contribution to conferencePaperAcademic

8 Citations (Scopus)

Abstract

This paper proposes SCHEMA, an algorithm for automated mapping between heterogeneous product taxonomies in the e-commerce domain. SCHEMA utilises word sense disambiguation techniques, based on the ideas from the algorithm proposed by Lesk, in combination with the semantic lexicon WordNet. For finding candidate map categories and determining the path-similarity we propose a node matching function that is based on the Levenshtein distance. The final mapping quality score is calculated using the Damerau-Levenshtein distance and a node-dissimilarity penalty. The performance of SCHEMA was tested on three real-life datasets and compared with PROMPT and the algorithm proposed by Park & Kim. It is shown that SCHEMA improves considerably on both recall and F1-score, while maintaining similar precision.
Original languageEnglish
Pages300-314
Number of pages15
DOIs
Publication statusPublished - 27 May 2012
EventThe Interface for Dutch ICT-Research 2012 (ICT.OPEN 2012) - Rotterdam, the Netherlands
Duration: 22 Oct 201223 Oct 2012

Conference

ConferenceThe Interface for Dutch ICT-Research 2012 (ICT.OPEN 2012)
CityRotterdam, the Netherlands
Period22/10/1223/10/12

Fingerprint

Dive into the research topics of 'SCHEMA – An Algorithm for Automated Product Taxonomy Mapping in E-commerce'. Together they form a unique fingerprint.

Cite this