Meta-Strategy Based on Multi-Armed Bandit Approach for Multi-Time Negotiation

KAWATA Ryohei, FUJITA Katsuhide

doi:10.1587/transinf.2020sap0003

<p>Multi-time negotiation which repeats negotiations many times under the same conditions is an important class of automated negotiation. We propose a meta-strategy that selects an agent's individual negotiation strategy for multi-time negotiation. Because the performance of the negotiating agents depends on situational parameters, such as the negotiation domains and the opponents, a suitable and effective individual strategy should be selected according to the negotiation situation. However, most existing agents negotiate based on only one negotiation policy: one bidding strategy, one acceptance strategy, and one opponent modeling method. Although the existing agents effectively negotiate in most situations, they do not work well in particular situations and their utilities are decreased. The proposed meta-strategy provides an effective negotiation strategy for the situation at the beginning of the negotiation. We model the meta-strategy as a multi-armed bandit problem that regards an individual negotiation strategy as a slot machine and utility of the agent as a reward. We implement the meta-strategy as the negotiating agents that use existing effective agents as the individual strategies. The experimental results demonstrate the effectiveness of our meta-strategy under various negotiation conditions. Additionally, the results indicate that the individual utilities of negotiating agents are influenced by the opponents' strategies, the profiles of the opponent and its own profiles.</p>

Meta-Strategy Based on Multi-Armed Bandit Approach for Multi-Time Negotiation

抄録

収録刊行物

参考文献 (22)*注記

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

Meta-Strategy Based on Multi-Armed Bandit Approach for Multi-Time Negotiation

抄録

収録刊行物

参考文献 (22)*注記

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について