Abdullah

amirali1985

AI & ML interests

Mechanistic interpretability, high dimensional geometry, persona role playing.

Recent Activity

updated a dataset about 4 hours ago
stride-influence/qwen-leakage-math-sweep
published a dataset about 4 hours ago
stride-influence/qwen-leakage-math-sweep
updated a model about 4 hours ago
stride-influence/stride-applications-models
View all activity

Organizations

Thoughtworks's profile picture Apart Research's profile picture Martian's profile picture nlp-and-interpretability's profile picture Backdoors research's profile picture PhillipsLab's profile picture TailsResearch's profile picture Flocker AI's profile picture stride_influence's profile picture curveball-steering's profile picture curveball-steering's profile picture