Chris Cundy

Research Scientist

FAR AI

Hi

I am a Research Scientist at FAR AI, researching topics to reduce catastrophic risks from advanced AI systems. If you are doing similar work, please reach out – I’d love to hear from you! We are also hiring.

I have a PhD from Stanford University, wonderfully advised by Stefano Ermon. During my PhD, I studied a diverse range of topics including constrained reinforcement learning, variational inference, and autoregressive models.

I studied Physics for my undergrad and took a Computer Science Master’s. It was a pleasure to work with Carl E. Rasmussen, developing variational methods for Gaussian Process State-Space Models.

I have also interned at the Centre for Human Compatible AI, the Future of Humanity Institute at Oxford University, and DeepMind.

Get in touch at chris dot j dot cundy at gmail dot com

Interests

Deceptive Behavior from LLMs
Risk Evaluation and Elicitation
Governance of Frontier Models
Adversarial Robustness
Probabilistic Machine Learning

Education

PhD in Computer Science, 2018-2024
Stanford University
MEng in Computer Science, 2017
Cambridge University
BA in Natural Sciences (Physics), 2016
Cambridge University

Recent Publications

Preference Learning with Lie Detectors can Induce Honesty or Evasion

As AI systems become more capable, deceptive behaviors can undermine evaluation and mislead users at deployment. Recent work has shown …

Chris Cundy, Adam Gleave

PDF

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

In many domains, autoregressive models can attain high likelihood on the task of predicting the next observation. However, this …

Chris Cundy, Stefano Ermon

PDF

Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients

As reinforcement learning techniques are increasingly applied to real-world decision problems, attention has turned to how these …

Chris Cundy, Rishi Desai, Stefano Ermon

PDF

LMPriors: Pre-Trained Language Models as Task-Specific Priors

Particularly in low-data regimes, an outstanding challenge in machine learning is developing principled techniques for augmenting our …

Kristy Choi, Chris Cundy, Sanjari Srivasta, Stefano Ermon

PDF Poster Slides Video

IQ-Learn: Inverse soft-Q Learning for Imitation

In many sequential decision-making problems (e.g., robotics control, game playing, sequential prediction), human or expert data is …

Divyansh Garg, Shuvam Chakraborty, Chris Cundy, Jiaming Song, Stefano Ermon

PDF

See all publications

Chris Cundy

Research Scientist

FAR AI

Hi

Interests

Education

Recent Publications

Preference Learning with Lie Detectors can Induce Honesty or Evasion

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients

LMPriors: Pre-Trained Language Models as Task-Specific Priors

IQ-Learn: Inverse soft-Q Learning for Imitation

Recent Posts

(Averaged) Cross-Entropy Loss is not a Proper Scoring Rule

AI Misuse Proof-of-Concept: Algorithmic Surveillance

GPT-4 Memorizes Project Euler Numerical Solutions

Using Codex in the Wild

Using Codex in Emacs