Add research attention analysis endpoint with real CodeGen tokenization 8f63685 gary-boon Claude commited on Nov 18
Add research attention analysis endpoints with Q/K/V extraction 37ed739 gary-boon Claude commited on Nov 13
Add Code Llama 7B support with hardware-aware filtering and ICL timeout fixes ed40a9a gary-boon Claude commited on Oct 30
Fix SWE-bench service to gracefully handle dataset loading failures ae9e159 gary-boon Claude commited on Sep 16
Consolidate HuggingFace deployment into security workflow 07be0bf gary-boon Claude commited on Sep 12
Add GitHub Action to deploy to both CPU and GPU HuggingFace Spaces 8b77dd5 gary-boon Claude commited on Sep 12
Fix: Correct layer hook output format for layer_norm compatibility 070f9b8 gary-boon Claude commited on Sep 1
Fix: Downgrade transformers to 4.36.2 for PyTorch 2.1.0 compatibility 97df962 gary-boon commited on Sep 1
Add GitHub Actions workflow for security scanning and automated deployment 0e48dc7 gary-boon commited on Aug 28
feat: Add pipeline analyzer and QKV extractor for transformer visualization 767a3fd gary-boon Claude commited on Aug 27
Add ablation support to model service with comprehensive testing bb8a292 gary-boon Claude commited on Aug 20