Leveraging hierarchical language models for aspect-based sentiment analysis on financial data

Matteo Lengkeek, Finn van der Knaap*, Flavius Frasincar

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

23 Citations (Scopus)
17 Downloads (Pure)

Abstract

Every day millions of news articles and (micro)blogs that contain financial information are posted online. These documents often include insightful financial aspects with associated sentiments. In this paper, we predict financial aspect classes and their corresponding polarities (sentiment) within sentences. We use data from the Financial Question & Answering (FiQA) challenge, more precisely the aspect-based financial sentiment analysis task. We incorporate the hierarchical structure of the data by using the parent aspect class predictions to improve the child aspect class prediction (two-step model). Furthermore, we incorporate model output from the child aspect class prediction when predicting the polarity. We improve the F1 score by 7.6% using the two-step model for aspect classification over direct aspect classification in the test set. Furthermore, we improve the state-of-the-art test F1 score of the original aspect classification challenge from 0.46 to 0.70. The model that incorporates output from the child aspect classification performs up to par in polarity classification with our plain RoBERTa model. In addition, our plain RoBERTa model outperforms all the state-of-the-art models, lowering the MSE score by at least 28% and 33% for the cross-validation set and the test set, respectively.

Original languageEnglish
Article number103435
JournalInformation Processing and Management
Volume60
Issue number5
DOIs
Publication statusPublished - Sept 2023

Bibliographical note

Publisher Copyright:
© 2023 The Author(s)

Fingerprint

Dive into the research topics of 'Leveraging hierarchical language models for aspect-based sentiment analysis on financial data'. Together they form a unique fingerprint.

Cite this