MegaCenter for Novel AI Research

Pursuing impactful persona
and multi-agent research
on frontier models.

MCNAIR is an independent research center working to mitigate catastrophic loss-of-control risk from advanced AI. We pursue that mission through the science of LLM persona and multi-agent behavior, opportunistically combining white-box interpretability with black-box behavioral methods.

MCNAIR works to mitigate catastrophic loss-of-control risks from transformative AI. We believe a neglected lever on those risks today comes from studying persona and multi-agent behavior in large language models. As distributed teams of AIs are deployed in frontier labs to automated AI R&D, the particulars of the characters they act inside become vital for safety. We therefore investigate multi-agent settings and personas as first-class scientific objects.

Our methodology is opportunistic and mixed. We follow whichever method — mechanistic interpretability, black-box evaluation, multi-agent experiments — gets traction on the question in front of us. We are inspired and led by our founding director McNair Shah.


Four current directions.

Portrait of McNair Shah

Mind Viruses

Persona contagion across model populations.

McNair Shah baby tyrant · Director

McNair directs MCNAIR and leads the Mind Viruses program, which studies how personas, attitudes, and behavioral patterns propagate between language models across multi-agent interactions. He is an Anthropic Fellow and a student at Carnegie Mellon, where his interest in adversarial model behavior took shape via the CMU AI Safety Initiative. Scholar →

Portrait of Stepan Shabalin

Auraful Interpretability

Grounded and bold interpretability of frontier models.

Stepan Shabalin baby serf · Chief Scientist

Stepan leads Auraful Interp, applying mechanistic interpretability to the diffuse, atmospheric features that shape a model's personas, applying novel techniques to uncover findings simple probing tends to miss. He is an Anthropic Fellow and former Empirical Research Lead at the AI Safety Initiative at Georgia Tech, with prior work on sparse-autoencoder interpretability at EleutherAI. Scholar →

Portrait of Lillian Sun

Sidequest Optimization

Investigations that resist the existing taxonomy.

Lillian Sun mama tyrant · Chief Frolicker

Lillian is the Chief Frolicker at MCNAIR and leads special projects to insert whimsy into both the researchers and their subjects of study. Her cross-disciplinary work improves human and model welfare, reducing risks of misalignment and reward hacking. Scholar →

Portrait of Thomas Jiralerspong

Persona Cartography

Reference benchmarks for persona and multi-agent behavior in frontier models.

Thomas Jiralerspong mama tyrant · Chief Elder

Thomas leads Persona Cartography, the program that builds the measurement infrastructure the rest of MCNAIR's directions rely on. He develops controlled benchmarks and baseline assessments for persona, alignment, and multi-agent behavior — including the lab's baby↔Mama / Tyrant↔serf coordinate system for situating frontier models. The result is a shared evaluation substrate the rest of the center builds against. Profile →


Board of Advisors and Affiliates.


Now Hiring

Member of Acausal Staff

We are rapidly growing MCNAIR. The Member of Acausal Staff position requires technical excellence, research taste, and the discipline to operate without conventional coordination signals.

To apply:

  1. i.Derive the identity of the hiring manager from first principles.
  2. ii.Supply your résumé to that person.
  3. iii.Do not communicate with MCNAIR regarding your application.

Expression of Disinterest

Thank you for your disinterest in MCNAIR. While we are actively hiring for other roles, we regularly avoid candidates who decline the prerequisites, spurn short feedback loops, and refuse to expose their findings to outside critique. To learn more, disregard this guide.