Sentiment Analysis with a Multilingual Pipeline

DM Bal, MA Bal, A v. Bunningen, Alexander Hogenboom, Frederik Hogenboom, Flavius Frasincar

Research output: Chapter/Conference proceedingConference proceedingAcademicpeer-review

12 Citations (Scopus)

Abstract

Sentiment analysis refers to retrieving an author's sentiment from a text. We analyze the differences that occur in sentiment scoring across languages. We present our experiments for the Dutch and English language based on forum, blog, news and social media texts available on the Web, where we focus on the differences in the use of a language and the effect of the grammar of a language on sentiment analysis. We propose a multilingual pipeline for evaluating how an author's sentiment is conveyed in different languages. We succeed in correctly classifying positive and negative texts with an accuracy of approximately 71% for English and 79% for Dutch. The evaluation of the results shows however that usage of common expressions, emoticons, slang language, irony, sarcasm, and cynicism, acronyms and different ways of negation in English prevent the underlying sentiment scores from being directly comparable.
Original languageEnglish
Title of host publicationWeb Information System Engineering
EditorsA. Bouguettaya, M. Hauswirth, L. Liu
Place of PublicationSydney, Australia
PublisherSpringer-Verlag
Pages129-142
Number of pages14
Volume6997
DOIs
Publication statusPublished - 13 Oct 2011

Fingerprint

Dive into the research topics of 'Sentiment Analysis with a Multilingual Pipeline'. Together they form a unique fingerprint.

Cite this