Instructions to use deepseek-ai/DeepSeek-V3.2-Exp with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use deepseek-ai/DeepSeek-V3.2-Exp with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="deepseek-ai/DeepSeek-V3.2-Exp") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("deepseek-ai/DeepSeek-V3.2-Exp", dtype="auto") - Inference
- HuggingChat
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use deepseek-ai/DeepSeek-V3.2-Exp with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "deepseek-ai/DeepSeek-V3.2-Exp" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "deepseek-ai/DeepSeek-V3.2-Exp", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/deepseek-ai/DeepSeek-V3.2-Exp
- SGLang
How to use deepseek-ai/DeepSeek-V3.2-Exp with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "deepseek-ai/DeepSeek-V3.2-Exp" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "deepseek-ai/DeepSeek-V3.2-Exp", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "deepseek-ai/DeepSeek-V3.2-Exp" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "deepseek-ai/DeepSeek-V3.2-Exp", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use deepseek-ai/DeepSeek-V3.2-Exp with Docker Model Runner:
docker model run hf.co/deepseek-ai/DeepSeek-V3.2-Exp
Fix chat_template crash when assistant message omits the `content` key
#37 opened 21 days ago
by
qgallouedec
deepseek什么时候能够继续出几个小模型呢?
#36 opened 3 months ago
by
mimeng1990
Request: DOI
1
#35 opened 5 months ago
by
hahty
Added Evaluation Benchmarks to Metadata
#34 opened 6 months ago
by
mackenzietechdocs
which tool-call-parser should be used when serving by vllm?
#33 opened 6 months ago
by
xiaoheixiaohei
cool model
#31 opened 6 months ago
by
adamm-hf
Reproducibility inquiry
👍🚀 3
#30 opened 7 months ago
by
andresnowak
DeepSeek v3.2
3
#29 opened 7 months ago
by
Diene10
如果基于sglang 部署如何支持 思考/非思考模式切换?
2
#28 opened 7 months ago
by
verigle
Upload Cadient Revenue Radar 2026.xlsx
#26 opened 7 months ago
by
basisakai
Question: Why are the definitions related to max-model-len in config.json and tokenizer_config.json inconsistent?
#25 opened 7 months ago
by
foyoux
Request: DOI
#24 opened 7 months ago
by
xtolxy1
Is it possible to run inference on an A100 GPU?
2
#23 opened 8 months ago
by
Tony664
3.2 Exp 32b or distilled Qwen ?
1
#22 opened 8 months ago
by
guizpublic
DeepSeek-V3.2 全方位最新实测出炉(300+维度),欢迎进群交流讨论~
#17 opened 8 months ago
by
JEIN
Question about long-context evaluation in DeepSeek-V3.2-Exp
1
#15 opened 8 months ago
by
fcMpKYz6Avp5QK
国庆deepwork
➕🤗 5
#14 opened 8 months ago
by
fengyujian
能不能一直保留旧版的deepseek v3.1的API接口?
❤️👍 3
7
#10 opened 8 months ago
by
lixin4sky
Full Coverage Video of V3.2 - Step by Step
👍 2
#9 opened 8 months ago
by
fahdmirzac
The whale is back
❤️ 7
1
#8 opened 8 months ago
by
Nechintosh
How Much VRAM ?
5
#7 opened 8 months ago
by
Ni3SinghR
Transformers does not recognize this architecture
6
#6 opened 8 months ago
by
eva20150932-atlascloud
Context length
3
#5 opened 8 months ago
by
cheflee668
咱这个模型是非得国庆前更新吗??
😔👍 113
31
#1 opened 8 months ago
by
luckjone