March 3, 2026Open Access

Fuzzy Clustering-Induced Switching Reinforcement Learning with Deterministic Annealing

Key Points

The proposed model achieves more efficient Q-learning by leveraging fuzzy clustering techniques with multiple agents.
Fuzzy c-Means is utilized to compute memberships, improving Q-value updates for parallel learning in distinct environments.
By incorporating deterministic annealing, the model enhances robustness and maximizes the gains of agents during learning.
This approach may enable better adaptability of agents to varying environmental complexities, optimizing reinforcement learning outcomes.

Abstract

強化学習（Q-learning）の効率的な学習法として，複数のエージェントが同時並列に試行しながら協調的にQテーブルを更新するアプローチが提案されている．本研究では，複数のエージェントがいくつかの異なる環境下で問題を解いている状態で，個々のエージェントがどの環境で問題を解いているか分からないとして，エージェントのクラスタリングとクラスターごとのQ-learningを同時分析することで，スイッチング強化学習モデルを提案する．クラスターごとの方策に基づく獲得利得をクラスタリング基準としてFuzzy c-Means（FCM）法に倣ったファジィメンバシップを算出し，メンバシップの重み付きでQ値の更新を行うことで，環境ごとのQテーブルを並列的に学習する．また，分割のファジィ度の決定論的アニーリングを導入することで，ロバストなモデル推定と獲得利得の最大化を合わせて実現する．

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Katsuhiro Honda

Taimu Yaotome

Seiki Ubukata

Journals

Journal of Japan Society for Fuzzy Theory and Intelligent Informatics

Actions

Institutions

Osaka Metropolitan University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Fuzzy Clustering-Induced Switching Reinforcement Learning with Deterministic Annealing

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study