March 19, 2024Open Access

Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making

Key Points

Key points are not available for this paper at this time.

Abstract

Sequential decision-making under multiple objective functions includes the problem of exhaustively searching for a Pareto-optimal policy and the problem of selecting a policy from the resulting set of Pareto-optimal policies based on the decision maker’s preferences. This paper focuses on the latter problem. In order to select a policy that reflects the decision maker’s preferences, it is necessary to order these policies, which is problematic because the decision-maker’s preferences are generally tacit knowledge. Furthermore, it is difficult to order them quantitatively. For this reason, conventional methods have mainly been used to elicit preferences through dialogue with decision-makers and through one-to-one comparisons. In contrast, this paper proposes a method based on inverse reinforcement learning to estimate the weight of each objective from the decision-making sequence. The estimated weights can be used to quantitatively evaluate the Pareto-optimal policies from the viewpoints of the decision-makers preferences. We applied the proposed method to the multi-objective reinforcement learning benchmark problem and verified its effectiveness as an elicitation method of weights for each objective function.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Akiko Ikenaga

Sachiyo Arai

Journals

Journal of Advanced Computational Intelligence and Intelligent Informatics

Actions

Institutions

Chiba University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study