view article Article Activation Steering With Mean Response Probes : A Case Study In Suppressing Sycophancy In Language Models During TTC about 1 month ago • 1
view article Article Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement Nov 7 • 4