Anthology of Computers and the Humanities · Volume 3

Continuous sentiment scores for literary and multilingual contexts

Laurits Lyngbaek1 ORCID , Pascale Feldkamp1 ORCID , Yuri Bizzoni1 ORCID , Kristoffer L. Nielbo1 ORCID and Kenneth Enevoldsen1 ORCID

  • 1 Center for Humanities Computing, Aarhus University, Aarhus, Denmark

Permanent Link: https://doi.org/10.63744/nVu1Zq5gRkuD

Published: 21 November 2025

Keywords: sentiment analysis, computational literary studies, historical texts, semantic embeddings

Abstract

Sentiment Analysis is widely used to quantify sentiment in text, but its application to literary texts poses unique challenges due to figurative language, stylistic ambiguity, as well as sentiment evocation strategies. Traditional dictionary-based tools tend to underperform, especially for low-resource languages, and transformer models, while promising, output coarse categorical labels that limit fine-grained analysis. We introduce a novel continuous sentiment scoring method based on concept vector projection, trained on multilingual literary data, which captures nuanced sentiment expressions across genres, languages, and historical periods. Our approach outperforms existing tools on English and Danish texts, producing sentiment scores which distribution matches human ratings, improving sentiment arc modeling and analysis in literature.