The Artificial General Intelligence Show
AI Alignment Landscape with Thomas Larsen (Center for AI Policy)
Show Notes
In this episode, Soroush explores crucial technical research directions in AI alignment with Thomas Larsen, the Director for Strategy at the Center for AI Policy in Washington, DC. Thomas, who dedicated approximately 75 hours in 2022 to compile an extensive overview of the technical alignment landscape, provides listeners with an updated snapshot of the diverse research directions within the field.
The episode touches on a long list of research areas including: model splintering, out-of-distribution (OOD) detection, low impact measures, threat modeling, scaling laws, brain-like AI safety, adversarial training and brain-machine interfaces (Neuralink).