-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 521k • 281 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 41.3k • 114 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking • 0.3B • Updated • 289k • 334 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 181k • 730
Bjorn Melin
BjornMelin
AI & ML interests
Large Language Models, AI Agents, Multi-Agent Orchestrations, Deep Learning, NLP, Local LLM Optimization.
Recent Activity
updated
a collection
about 2 months ago
Rerankers
liked
a model
about 2 months ago
zeroentropy/zerank-2
liked
a model
about 2 months ago
cerebras/MiniMax-M2-REAP-162B-A10B
Organizations
None yet
Datasets
Fine Tuning
-
Running60
GGUF Model VRAM Calculator
📈60Calculate VRAM requirements for LLM models
-
Running on CPU UpgradeFeatured993
Model Memory Utility
🚀993Calculate vRAM needed for model training and inference
-
RunningFeatured1.03k
Can You Run It? LLM version
🚀1.03kDetermine GPU requirements for running large language models
Legendary VL Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 37.8k • 350 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 7.32k • • 215 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 940k • • 835 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 462k • • 585
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 150k • 13 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.98M • • 1.91k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 10.8M • • 5.22k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 150k • 32
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 801k • • 4.01k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 52 -
internlm/internlm3-8b-instruct
Text Generation • 9B • Updated • 10.1k • 228 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 109 • • 550
Embedding Models
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 298 • • 121 -
Qwen/Qwen3-32B-AWQ
Text Generation • 33B • Updated • 76.3k • 119 -
OpenHands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 239 • • 393 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 151k • • 595
Leaderboards
-
RunningFeatured140
smolagents LLM leaderboard
🏆140A leaderboard for LLMs powering smolagents
-
RunningFeatured430
LLM Performance Leaderboard
🐨430View LLM performance rankings
-
RunningFeatured191
Low-bit Quantized Open LLM Leaderboard
🏆191Track, rank and evaluate open LLMs and chatbots
-
Running1.37k
UGI Leaderboard
📢1.37kUncensored General Intelligence Leaderboard
Coding Models
Google
Qwen
Rerankers
-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 521k • 281 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 41.3k • 114 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking • 0.3B • Updated • 289k • 334 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 181k • 730
Embedding Models
Datasets
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 298 • • 121 -
Qwen/Qwen3-32B-AWQ
Text Generation • 33B • Updated • 76.3k • 119 -
OpenHands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 239 • • 393 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 151k • • 595
Fine Tuning
-
Running60
GGUF Model VRAM Calculator
📈60Calculate VRAM requirements for LLM models
-
Running on CPU UpgradeFeatured993
Model Memory Utility
🚀993Calculate vRAM needed for model training and inference
-
RunningFeatured1.03k
Can You Run It? LLM version
🚀1.03kDetermine GPU requirements for running large language models
Leaderboards
-
RunningFeatured140
smolagents LLM leaderboard
🏆140A leaderboard for LLMs powering smolagents
-
RunningFeatured430
LLM Performance Leaderboard
🐨430View LLM performance rankings
-
RunningFeatured191
Low-bit Quantized Open LLM Leaderboard
🏆191Track, rank and evaluate open LLMs and chatbots
-
Running1.37k
UGI Leaderboard
📢1.37kUncensored General Intelligence Leaderboard
Legendary VL Models
Coding Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 37.8k • 350 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 7.32k • • 215 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 940k • • 835 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 462k • • 585
Google
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 150k • 13 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.98M • • 1.91k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 10.8M • • 5.22k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 150k • 32
Qwen
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 801k • • 4.01k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 52 -
internlm/internlm3-8b-instruct
Text Generation • 9B • Updated • 10.1k • 228 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 109 • • 550