My research interests are LLM Interpretability and Alignment, Bayesian Theory of Mind, or Human-centered CogSci AI in general.
Social Cognitive AI (SCAI) Lab
My research interests are LLM Interpretability and Alignment, Bayesian Theory of Mind, or Human-centered CogSci AI in general.