-
Qwen/Qwen3.5-397B-A17B
Image-Text-to-Text • 403B • Updated • 1.76M • • 1.38k -
Qwen/Qwen3.5-397B-A17B-FP8
Image-Text-to-Text • 403B • Updated • 602k • 142 -
Qwen/Qwen3.5-122B-A10B
Image-Text-to-Text • 125B • Updated • 639k • • 454 -
Qwen/Qwen3.5-122B-A10B-FP8
Image-Text-to-Text • 125B • Updated • 772k • 73
Collections
Discover the best community collections!
Collections trending this week
-
nvidia/Nemotron-Cascade-2-30B-A3B
Text Generation • 32B • Updated • 19.7k • 244 -
nvidia/Nemotron-Cascade-2-RL-data
Viewer • Updated • 55.7k • 353 • 25 -
nvidia/Nemotron-Cascade-2-SFT-Data
Viewer • Updated • 15.9M • 3.73k • 25 -
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Paper • 2603.19220 • Published • 54
-
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
Text Generation • 124B • Updated • 116k • 290 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
Text Generation • 124B • Updated • 690k • 193 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
Text Generation • 67B • Updated • 869k • 205 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16
Text Generation • 124B • Updated • 9.59k • 21
-
unsloth/Qwen3.5-35B-A3B-GGUF
Image-Text-to-Text • 35B • Updated • 2.1M • 723 -
unsloth/Qwen3.5-9B-GGUF
Image-Text-to-Text • 9B • Updated • 1.31M • 403 -
unsloth/Qwen3.5-27B-GGUF
Image-Text-to-Text • 27B • Updated • 944k • 346 -
unsloth/Qwen3.5-122B-A10B-GGUF
Image-Text-to-Text • 122B • Updated • 522k • 211
-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2
Image-Text-to-Text • 28B • Updated • 4.82k • 26 -
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF
Image-Text-to-Text • 27B • Updated • 33.8k • 90 -
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2
Image-Text-to-Text • 10B • Updated • 23.1k • 109 -
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF
Image-Text-to-Text • 9B • Updated • 43.9k • 115
-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled
Image-Text-to-Text • 28B • Updated • 164k • 1.15k -
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF
Image-Text-to-Text • 27B • Updated • 461k • 333 -
Jackrong/Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled
Text Generation • 36B • Updated • 4.22k • 51 -
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled
Text Generation • 5B • Updated • 6.01k • 19
-
meta-llama/Llama-3.2-1B
Text Generation • 1B • Updated • 1.98M • 2.34k -
meta-llama/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 4.2M • • 1.33k -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 6.27M • • 2.06k -
meta-llama/Llama-3.2-3B
Text Generation • 3B • Updated • 1.23M • 707
-
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction • 7B • Updated • 31.8k • 218 -
facebook/dinov3-vits16-pretrain-lvd1689m
Image Feature Extraction • 21.6M • Updated • 150k • 76 -
facebook/dinov3-convnext-small-pretrain-lvd1689m
Image Feature Extraction • 49.5M • Updated • 24.5k • 22 -
facebook/dinov3-vitb16-pretrain-lvd1689m
Image Feature Extraction • 85.7M • Updated • 904k • 110
-
Qwen/Qwen3.5-397B-A17B
Image-Text-to-Text • 403B • Updated • 1.76M • • 1.38k -
Qwen/Qwen3.5-397B-A17B-FP8
Image-Text-to-Text • 403B • Updated • 602k • 142 -
Qwen/Qwen3.5-122B-A10B
Image-Text-to-Text • 125B • Updated • 639k • • 454 -
Qwen/Qwen3.5-122B-A10B-FP8
Image-Text-to-Text • 125B • Updated • 772k • 73
-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2
Image-Text-to-Text • 28B • Updated • 4.82k • 26 -
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF
Image-Text-to-Text • 27B • Updated • 33.8k • 90 -
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2
Image-Text-to-Text • 10B • Updated • 23.1k • 109 -
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF
Image-Text-to-Text • 9B • Updated • 43.9k • 115
-
nvidia/Nemotron-Cascade-2-30B-A3B
Text Generation • 32B • Updated • 19.7k • 244 -
nvidia/Nemotron-Cascade-2-RL-data
Viewer • Updated • 55.7k • 353 • 25 -
nvidia/Nemotron-Cascade-2-SFT-Data
Viewer • Updated • 15.9M • 3.73k • 25 -
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Paper • 2603.19220 • Published • 54
-
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled
Image-Text-to-Text • 28B • Updated • 164k • 1.15k -
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF
Image-Text-to-Text • 27B • Updated • 461k • 333 -
Jackrong/Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled
Text Generation • 36B • Updated • 4.22k • 51 -
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled
Text Generation • 5B • Updated • 6.01k • 19
-
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16
Text Generation • 124B • Updated • 116k • 290 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
Text Generation • 124B • Updated • 690k • 193 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
Text Generation • 67B • Updated • 869k • 205 -
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16
Text Generation • 124B • Updated • 9.59k • 21
-
unsloth/Qwen3.5-35B-A3B-GGUF
Image-Text-to-Text • 35B • Updated • 2.1M • 723 -
unsloth/Qwen3.5-9B-GGUF
Image-Text-to-Text • 9B • Updated • 1.31M • 403 -
unsloth/Qwen3.5-27B-GGUF
Image-Text-to-Text • 27B • Updated • 944k • 346 -
unsloth/Qwen3.5-122B-A10B-GGUF
Image-Text-to-Text • 122B • Updated • 522k • 211
-
meta-llama/Llama-3.2-1B
Text Generation • 1B • Updated • 1.98M • 2.34k -
meta-llama/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 4.2M • • 1.33k -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 6.27M • • 2.06k -
meta-llama/Llama-3.2-3B
Text Generation • 3B • Updated • 1.23M • 707
-
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction • 7B • Updated • 31.8k • 218 -
facebook/dinov3-vits16-pretrain-lvd1689m
Image Feature Extraction • 21.6M • Updated • 150k • 76 -
facebook/dinov3-convnext-small-pretrain-lvd1689m
Image Feature Extraction • 49.5M • Updated • 24.5k • 22 -
facebook/dinov3-vitb16-pretrain-lvd1689m
Image Feature Extraction • 85.7M • Updated • 904k • 110