The Trojan Knowledge: Bypassing Commercial LLM Guardrails via Harmless Prompt Weaving and Adaptive Tree Search Paper • 2512.01353 • Published Dec 1, 2025 • 2
Measuring Physical-World Privacy Awareness of Large Language Models: An Evaluation Benchmark Paper • 2510.02356 • Published Sep 27, 2025 • 11