Nikita Kezins
entfane
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated a model about 18 hours ago
entfane/jailbreak-cot-lin-probe published a model about 18 hours ago
entfane/jailbreak-cot-lin-probe updated a model about 18 hours ago
entfane/jailbreak-input-lin-probe