CSCE 631: Intelligent Agents — Computational Game Solving

Why This Course

Game theory is the mathematical foundation for reasoning about strategic interaction. This course covers the core theory—from Nash equilibria through CFR and game abstraction—and connects it to the design of autonomous agents.

Format

Intensive & Flexible

Five weeks, fully online asynchronous. Pre-recorded lectures, weekly deadlines, and active discussion boards. Designed for graduate students with summer schedules.

Assessment

Reproduce, Don’t Memorize

No exams. Two programming assignments grounded in published work, plus an individual research project worth 50%. Understanding is the bottleneck, not implementation.

Depth

Theory With Applications

Four weeks of classical game theory, one week of modern applications. The final week connects game-theoretic foundations to the design and evaluation of autonomous agents.

Research

Course → Lab

Strong projects can become research collaborations. The instructor’s group works on submodular optimization, game-theoretic monitoring, and action abstraction.

Weekly Schedule

Five weeks, roughly two modules of classical game theory per week, with a final module on autonomous agents and game theory.

Week	Dates	Topics	Content
1	May 26–30	Foundations of Game Theory	Normal-form games, Nash equilibrium, mixed strategies, maxmin, correlated equilibrium, dominated strategies. Notes
2	Jun 2–6	Computing Equilibria & Regret Minimization	Support enumeration, LP for zero-sum, external/internal/swap regret, MWU, RM, RM+, no-regret → CCE convergence. Notes
3	Jun 9–13	Extensive-Form Games & CFR	Game trees, information sets, subgame perfection, sequence form. Vanilla CFR, CFR+, Monte Carlo CFR. Notes
4	Jun 16–20	Game Abstraction & Poker	Information/action abstraction, safe subgame solving, Libratus & Pluribus. Notes
5	Jun 23–29	Advanced Topics & Autonomous Agents New	Deep CFR, agent architectures (ReAct, ToT, LATS), multi-agent debate as a game, red-teaming as zero-sum. Notes

The narrative arc: Define games → Solve games → Learn in games → Handle sequential games at scale → Apply the theory to autonomous agents.

Course Materials

Lecture slides, module notes, and programming assignments are linked below. Video recordings are hosted separately.

General

Syllabus (PDF) — Course policies, grading, university statements, key dates
Welcome Slides — Course overview, grading, project topics, TAMU API setup
Course Schedule (PDF) — Weekly lecture topics, assignment due dates, grade weights

Module Notes

Module 1 — Foundations of Game Theory (Lectures 1–4)
Module 2 — Computing Equilibria & Regret Minimization (Lectures 5–9)
Module 3 — Extensive-Form Games & CFR (Lectures 10–14)
Module 4 — Game Abstraction & Poker Case Studies (Lectures 15–17)
Module 5 — Advanced Topics & Autonomous Agents (Lectures 18, 5.1–5.3)

Programming Assignments

PA1 — Normal-Form Games: Utilities, Best Responses, and Equilibria
PA2 — Multi-Agent Debate with LLM Agents via the TAMU API
Course Project — Topic list, rubric, and submission guidelines

Programming Assignments

Two PAs grounded in specific published work. The implementation is the ticket to entry; the analysis is the assessment.

PA 1 Classical Due: Fri Jun 6 (Week 2)

Normal-Form Games: Utilities, Best Responses, and Equilibria

Support enumeration & Nash equilibrium verification

Implement core algorithms for 2-player normal-form games: expected utility, best response, Nash equilibrium verification, strictly dominated strategy elimination, and correlated equilibrium verification. Write a methods note reflecting on your design choices. Extra credit: compute a CE via linear programming. Download notebook.

PA 2 TAMU API Due: Sat Jun 27 (Week 5)

Multi-Agent Debate with LLMs

Du et al., “Improving Factuality and Reasoning in Language Models through Multiagent Debate” (2023)

Using the TAMU API, implement a multi-agent debate protocol. Run experiments varying agent count, round count, and model choice. Analyze through a game-theoretic lens: does debate converge, what are the failure modes, and how do information sets and equilibrium concepts help interpret behavior? $5/day budget. Download notebook.

Course Project

Apply a game-theoretic concept from this course to a multi-agent or human-agent interaction problem. Individual project, 50% of final grade. See the full project description for the 10 curated topics and rubric.

Example Topics

Build and evaluate a multi-agent debate system using regret minimization.
Implement a monitoring protocol for a coding agent and analyze its game-theoretic properties.
Design an auction mechanism for AI agents and measure equilibrium behavior.
Red-team an AI agent and formalize the attack as an exploitation strategy.
Compare CFR self-play to debate-style self-play on a structured task.

Deliverables

Proposal (1 page)	Jun 6 (Wk 2)
Code + Report (4 pages)	Jun 29 (Wk 5)

Reading List

Core papers and textbook chapters organized by week. Papers are provided as PDFs on Canvas.

Week 1: NF Games + Equilibria

Shoham & Leyton-Brown, Multiagent Systems, Ch. 3–4
Guo et al., “Game Theory Meets Large Language Models: A Systematic Survey” (IJCAI 2025) Framing

Week 2: Regret Minimization

Cesa-Bianchi & Lugosi, Prediction, Learning, and Games, Ch. 4

Week 3: Extensive-Form Games + CFR

Shoham & Leyton-Brown, Ch. 5
Zinkevich et al., “Regret Minimization in Games with Incomplete Information” (NeurIPS 2007)
Lanctot et al., “Monte Carlo Sampling for Regret Minimization” (NeurIPS 2009)

Week 4: Abstraction + Poker

Sandholm, “Abstraction for Solving Large Incomplete-Information Games” (AAAI 2015)
Brown & Sandholm, “Superhuman AI for Multiplayer Poker” (Science, 2019)

Week 5: Autonomous Agents + Game Theory

Yao et al., “ReAct: Synergizing Reasoning and Acting in Language Models” (ICLR 2023) PA2
Du et al., “Multiagent Debate” (2023) PA2
Anthropic, “Building Effective Agents” (blog, Dec 2024)
Greenblatt et al., “AI Control: Improving Safety Despite Intentional Subversion” (ICML 2024)
Shinn et al., “Reflexion” (NeurIPS 2023)

Logistics

Format

Online asynchronous. Pre-recorded lecture videos released at the start of each week. Weekly deadlines for assignments and discussion posts. Office hours via Zoom.

Grading

Component	Weight
PA1 (Normal-Form Games)	20%
PA2 (Multi-Agent Debate)	30%
Course Project	50%

No midterm. No final exam.

Late policy: 3 slip days total, usable in 24-hour increments (max 2 per assignment). Assignments submitted after slip days are exhausted will not be accepted.

All Deadlines

Item	Due
PA1 (Normal-Form Games)	Fri Jun 6
Project Proposal (1 page)	Fri Jun 6
PA2 (Multi-Agent Debate)	Sat Jun 27
Project Report + Code	Mon Jun 29

Prerequisites

CSCE 420 or CSCE 625, or instructor approval. Mathematical maturity (proofs, probability, optimization) expected.

Resources

Syllabus: PDF
Textbook: Shoham & Leyton-Brown, Multiagent Systems (free online)
Supplementary: Cesa-Bianchi & Lugosi, Prediction, Learning, and Games
API: TAMU API ($5/day; setup guide provided with PA2)
Computing: Python 3.x, Jupyter. No GPU required.

AI Tool Policy

You may use AI tools for writing code. All written analysis must be your own work. The analysis is the assessment—outsourcing it defeats the purpose.

Instructor

Alan Kuhnle

Assistant Professor, Computer Science & Engineering

Email: kuhnle at tamu.edu

Office Hours: By appointment via Zoom

Zoom: tamu.zoom.us/my/kuhnle

Research: submodular optimization, algorithmic game theory, autonomous agent design.