September 26, 2022

複数の方策を用いた深層強化学習に基づく推薦システム

Key Points

Key points are not available for this paper at this time.

Abstract

深層強化学習（DRL）に基づく推薦システムは、ユーザーのコールドスタート問題に適しており、段階的にユーザーの嗜好を捉えることができます。しかし、多くの既存のDRLベースの推薦システムは、異なるユーザーの動的特性に対応するために同一の方策を使用しているため、最適とは言えません。我々は推薦を多タスクマルコフ決定過程として再定式化し、各タスクは類似したユーザー群を表します。類似ユーザーはより近い動的特性を持つため、単一の普遍的方策よりもタスク固有の方策が効果的です。コールドスタートユーザーへの推薦では、まずデフォルト方策を用いて初期のやり取りを収集し、ユーザーのタスクを特定した後にタスク固有の方策を適用します。我々のフレームワークの最適化にはQ学習を用い、タスクに関する相互情報量によりタスク不確実性を考慮します。提案フレームワークの有効性を確認するために、三つの実世界データセットで実験を行いました。

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Mingsheng Fu

Liwei Huang

Ananya Rao

Journals

IEEE Transactions on Industrial Informatics

Actions

Institutions

Nanyang Technological University

University of Electronic Science and Technology of China

University of Macau

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

複数の方策を用いた深層強化学習に基づく推薦システム

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider