A Universal Framework for Offline Serendipity Evaluation in Recommender Systems via Large Language Models

Kei HARADA

doi:10.1145/3746252.3760911

戻る

ジャーナル論文 - rm_misc: Others

A Universal Framework for Offline Serendipity Evaluation in Recommender Systems via Large Language Models

Kei HARADA

25/08/2025

DOI: https://doi.org/10.1145/3746252.3760911

抄録

Serendipity in recommender systems (RSs) has attracted increasing attention as a concept that enhances user satisfaction by presenting unexpected and useful items. However, evaluating serendipitous performance remains challenging because its ground truth is generally unobservable. The existing offline metrics often depend on ambiguous definitions or are tailored to specific datasets and RSs, thereby limiting their generalizability. To address this issue, we propose a universally applicable evaluation framework that leverages large language models (LLMs) known for their extensive knowledge and reasoning capabilities, as evaluators. First, to improve the evaluation performance of the proposed framework, we assessed the serendipity prediction accuracy of LLMs using four different prompt strategies on a dataset containing user-annotated serendipitous ground truth and found that the chain-of-thought prompt achieved the highest accuracy. Next, we re-evaluated the serendipitous performance of both serendipity-oriented and general RSs using the proposed framework on three commonly used real-world datasets, without the ground truth. The results indicated that there was no serendipity-oriented RS that consistently outperformed across all datasets, and even a general RS sometimes achieved higher performance than the serendipity-oriented RS.

ファイルとリンク (4)

url

https://doi.org/10.1145/3746252.3760911表示

is_downloadable: False

url

http://arxiv.org/abs/arXiv:2508.17571表示

is_downloadable: False

url

http://arxiv.org/abs/2508.17571v1表示

is_downloadable: False

url

http://arxiv.org/pdf/2508.17571v1表示

is_downloadable: True

メトリック

2 レコードビュー

See more details

詳細

タイトル: A Universal Framework for Offline Serendipity Evaluation in Recommender Systems via Large Language Models
作成者 – 役職なし: Kei HARADA
ID: 991002622754107421
組織: The University of Electro-Communications
資料タイプ: ジャーナル論文
リソースのサブタイプ: rm_misc: Others