Societies of Agents: From Individual Intelligence to Intelligent Societies
My research centers on a fundamental transition now underway: intelligence is no longer confined to isolated models or tools,
but increasingly embodied in agents that act, learn, and influence one another within shared environments.
I frame this direction as Societies of Agents - systems in which multiple learning agents form structured,
adaptive, and value-driven societies rather than merely coexisting as independent components.
A three-layer view of Societies of Agents: infrastructure → agent foundation learning → societal-scale deployment(image generated by nana banana).
This research agenda unfolds across three interconnected layers:
Infrastructure for Agentic Learning
I develop training and execution frameworks that enable long-horizon learning, coordination, and adaptation among agents.
This layer focuses on how agents are systematically organized to support sustained interaction and collective behavior.
Domain-Specific Agent Foundation Learning
Building on this infrastructure, I study how datasets, post-training, reward shaping, and workflow design give rise to stable,
interpretable, and transferable agent behaviors. The central question here is not model capability, but how agents acquire
roles, responsibilities, and norms within a society.
Societal-Scale Multi-Agent Systems in the Real World
At the highest level, I explore how societies of agents operate in industrial and real-world settings, where cooperation and
competition coexist, and where governance, credit assignment, and value flow become central design challenges.
Across reinforcement learning, multi-agent systems, trust modeling, and governance, my work is guided by a single conviction:
the future of intelligence lies not in isolated agents, but in learning societies of agents.
Current Research Projects
My current projects span reinforcement learning, multi-agent systems, post-training, and real-world deployments.
Below is a structured snapshot aligned with the Societies of Agents agenda.
A. Agentic Infrastructure and Orchestration
Symphony: Decentralized multi-agent framework for large-scale coordination.
Workflow Optimization: Dynamic routing, planning, and scheduling for complex tasks.
Memory Evolve: Scalable memory systems enabling long-term learning.