研究業績リスト
ジャーナル論文 - rm_published_papers: Scientific Journal
Evaluating the Efficiency of Regulation in Matching Markets with Distributional Disparities
公開済 02/07/2025
Proceedings of the 26th ACM Conference on Economics and Computation, 94 - 94
ジャーナル論文 - rm_published_papers: Scientific Journal
Approximate State Abstraction for Markov Games
公開済 11/04/2025
Proceedings of the AAAI Conference on Artificial Intelligence, 39, 17, 17555 - 17563
This paper introduces state abstraction for two-player zero-sum Markov games (TZMGs) where the payoffs for the two players are determined by the state representing the environment and their respective actions, with state transitions following a Markov decision processes. For example, in games like soccer, the value of actions changes according to the state of play, we should describe them as Markov games. In TZMGs, the more the number of states becomes, the more difficult computing the equilibrium becomes. Therefore, we abstract the states of TZMGs and examine the performance. State abstraction reduces the number of states by treating multiple different states as a single state, and there is a substantial body of research on finding optimal policies for Markov decision processes using state abstraction. This study extends the state abstraction for MDPs to Markov games. In this case, the game with state abstraction may yield different equilibrium solutions from those of the ground game. To evaluate the equilibrium solutions of the game with state abstraction, we derived bounds on duality gap, which represents the distance from the equilibrium solutions of the ground game. Finally, we demonstrate our state abstraction with Markov Soccer, compute equilibrium policies, and examine the results.
ジャーナル論文 - rm_published_papers: Scientific Journal
Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games,
公開済 04/2025
The Thirteenth International Conference on Learning Representations
ジャーナル論文 - rm_published_papers: Scientific Journal
Adaptively Perturbed Mirror Descent for Learning in Games
公開済 07/2024
Proceedings of the Forty-first International Conference on Machine Learning
ジャーナル論文 - rm_published_papers: International Conference Proceedings
Learning Fair Division from Bandit Feedback
公開済 05/2024
Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, 238, 3106 - 3114
ジャーナル論文 - rm_published_papers: Scientific Journal
二人零和ゲームにおける突然変異駆動型正則化先導者追従法の終極反復収束
公開済 05/2024
情報処理学会論文誌, 65, 5
ジャーナル論文 - rm_published_papers: Scientific Journal
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games
公開済 04/2023
Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, 206, 7999 - 8028
ジャーナル論文 - rm_published_papers: Symposium
取り違えのある繰り返し囚人のジレンマにおける単独裏切-相互同期戦略
公開済 03/2023
情報処理学会第85回全国大会, 5B-03
ジャーナル論文 - rm_published_papers: Symposium
研修医配属における地域間格差を調整するための制約のモンテカルロ木探索
公開済 03/2023
情報処理学会第85回全国大会, 2T-08
ジャーナル論文 - rm_published_papers: Symposium
公開済 03/2023
情報処理学会第85回全国大会, 7S-07