inference-optimization/ctest-Qwen3.5-9B-subset-397-speculator.dflash 2B • Updated about 15 hours ago • 16
inference-optimization/ctest-Qwen3.5-9B-subset-397-speculator.dflash 2B • Updated about 15 hours ago • 16
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated about 18 hours ago • 156
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w8a8 Text Generation • 235B • Updated about 18 hours ago • 150
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated about 18 hours ago • 135
RedHatAI/Qwen3-235B-A22B-Instruct-2507-quantized.w8a8 Text Generation • 235B • Updated about 19 hours ago • 76
inference-optimization/ctest-subset-Qwen3.5-397B-A17B-FP8-dynamic-speculator-dataset Viewer • Updated 2 days ago • 10k • 34
inference-optimization/ctest-subset-Qwen3.5-397B-A17B-FP8-dynamic-speculator-dataset Viewer • Updated 2 days ago • 10k • 34
inference-optimization/final-ctest-Qwen3-8B-speculator-dataset Viewer • Updated 8 days ago • 10k • 38
inference-optimization/final-ctest-Qwen3-8B-speculator-dataset Viewer • Updated 8 days ago • 10k • 38
inference-optimization/updated-ctest-Qwen3-8B-speculator-dataset Viewer • Updated 13 days ago • 10k • 50
inference-optimization/updated-ctest-Qwen3-8B-speculator-dataset Viewer • Updated 13 days ago • 10k • 50