Jordan Taylor

JordanTensor

AI & ML interests

Mechanistic interpretability, mechanistic anomaly detection, model internals techniques and AI safety techniques generally.

Organizations

Mechanistic  Anomaly Detection's profile picture