Integrate mistral-common for correct Devstral tokenization ed06dcb gary-boon Claude Opus 4.5 commited on 6 days ago
Use mistral_common for proper Devstral prompt formatting 3e80769 gary-boon Claude Opus 4.5 commited on 6 days ago
Add system prompt support for instruction-tuned models 2860768 gary-boon Claude Opus 4.5 commited on 6 days ago
fix: Simpler prompt format and temperature=0 for Devstral 76020ee gary-boon Claude Opus 4.5 commited on 6 days ago
fix: Sanitize JSON response for NaN/Inf float values 99f6209 gary-boon Claude Opus 4.5 commited on 6 days ago
fix: Check chat_template is set before using apply_chat_template 474927d gary-boon Claude Opus 4.5 commited on 6 days ago
fix: Add chat template support for Devstral instruct model 8d85da8 gary-boon Claude Opus 4.5 commited on 6 days ago
fix: Convert bfloat16 to float32 for numpy compatibility cb6f39c gary-boon Claude Opus 4.5 commited on 7 days ago
fix: Use eager attention for output_attentions support 5333b21 gary-boon Claude Opus 4.5 commited on 7 days ago
Add DEVICE env var to force CPU mode on DGX Spark 5f122aa gary-boon Claude Opus 4.5 commited on 7 days ago
Add research attention analysis endpoint with real CodeGen tokenization 8f63685 gary-boon Claude commited on Nov 18
Add research attention analysis endpoints with Q/K/V extraction 37ed739 gary-boon Claude commited on Nov 13
Add Code Llama 7B support with hardware-aware filtering and ICL timeout fixes ed40a9a gary-boon Claude commited on Oct 30
Fix: Correct layer hook output format for layer_norm compatibility 070f9b8 gary-boon Claude commited on Sep 1
feat: Add pipeline analyzer and QKV extractor for transformer visualization 767a3fd gary-boon Claude commited on Aug 27
Add ablation support to model service with comprehensive testing bb8a292 gary-boon Claude commited on Aug 20