Supported Model List

View Source On Gitee

Model

Status

Model Download Link

DeepSeek-V3

Supported

DeepSeek-V3

DeepSeek-R1

Supported

DeepSeek-R1

DeepSeek-R1 W8A8

Supported

Deepseek-R1-W8A8

Qwen2.5

Supported

Qwen2.5-0.5B-Instruct, Qwen2.5-1.5B-Instruct, Qwen2.5-3B-Instruct, Qwen2.5-7B-Instruct, Qwen2.5-14B-Instruct, Qwen2.5-32B-Instruct, Qwen2.5-72B-Instruct

Qwen3-32B

Supported

Qwen3-32B

Qwen3-235B-A22B

Supported

Qwen3-235B-A22B

Qwen3, Qwen3-MOE

Testing

Qwen3-0.6B, Qwen3-1.7B, Qwen3-4B, Qwen3-8B, Qwen3-14B, Qwen3-30B-A3

Qwen2.5-VL

Testing

Qwen2.5-VL-3B-Instruct, Qwen2.5-VL-7B-Instruct, Qwen2.5-VL-32B-Instruct, Qwen2.5-VL-72B-Instruct

QwQ-32B

Testing

QwQ-32B

Llama3.1

Testing

Llama-3.1-8B-Instruct, Llama-3.1-70B-Instruct, Llama-3.1-405B-Instruct

Llama3.2

Testing

Llama-3.2-1B-Instruct, Llama-3.2-3B-Instruct

Note: refer to Environment Variable List, and set the model backend by environment variable vLLM_MODEL_BACKEND.