Abstract Content platforms typically engage with their users through small recommendation sets of items drawn from an extensive catalog. These sets are curated using machine-learned models optimized to present choices most likely to align with user preferences. We present surprising findings about such platforms. Even with complete information on user preferences within sets of up to k items, these models can only predict preferences within a “quadratic horizon” of k2 items and might fail to identify the best items in larger sets. To illustrate, we present striking examples where a platform interacting with users through small item sets, despite knowing that one item is favored by millions of users, cannot identify this item with better than random chance. Through both theoretical analysis and studies across various datasets, we demonstrate that “hidden gems”, items preferred by many users but invisible to platforms, exist in real-world datasets of moderate size, highlighting a significant gap in current recommendation platforms.
Building similarity graph...
Analyzing shared references across papers
Loading...
Flavio Chierichetti
M. Giacchini
R Dorai Kumar
PNAS Nexus
Sapienza University of Rome
Google (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Chierichetti et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69e320cc40886becb653fee6 — DOI: https://doi.org/10.1093/pnasnexus/pgag122