Incremental Cosine Computations for Search and Exploration of Tag Spaces

R Vermaas, Damir Vandic, Flavius Frasincar

Research output: Chapter/Conference proceedingConference proceedingAcademicpeer-review

2 Citations (Scopus)


Tags are often used to describe user-generated content on the Web. However, the available Web applications are not incrementally dealing with new tag information, which negatively influences their scalability. Since the cosine similarity between tags represented as co-occurrence vectors is an important aspect of these frameworks, we propose two approaches for an incremental computation of cosine similarities. The first approach recalculates the cosine similarity for new tag pairs and existing tag pairs of which the co-occurrences has changed. The second approach computes the cosine similarity between two tags by reusing, if available, the previous cosine similarity between these tags. Both approaches compute the same cosine values that would have been obtained when a complete recalculation of the cosine similarities is performed. The performed experiments show that our proposed approaches are between 1.2 and 23 times faster than a complete recalculation, depending on the number of co-occurrence changes and new tags.
Original languageEnglish
Title of host publicationDatabase and Expert Systems Applications
Number of pages12
Publication statusPublished - 2012


Dive into the research topics of 'Incremental Cosine Computations for Search and Exploration of Tag Spaces'. Together they form a unique fingerprint.

Cite this