Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective Jan 31, 2026ยท Haichuan Wang , Tao Lin , Lingkai Kong , Ce Li , Hezi Jiang , Milind Tambe ยท 0 min read PDF Cite Last updated on Jan 31, 2026 LLM AI Alignment Principal-Agent Problem On the Coordination of Value-Maximizing Bidders Nov 7, 2025 →