How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-MTP-GGUF:
Run and chat with the model
lemonade run user.Qwen3.5-9B-DeepSeek-V4-Flash-MTP-GGUF-
List all available models
lemonade list
Quick Links

Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-MTP-GGUF

Source model: Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash MTP source fallback: unsloth/Qwen3.5-9B

Uploaded GGUF variants:

  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q2_K.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q3_K_S.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q3_K_M.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q3_K_L.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-IQ4_XS.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q4_K_S.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q4_K_M.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q5_K_S.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q5_K_M.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q6_K.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-Q8_0.gguf
  • Qwen3.5-9B-DeepSeek-V4-Flash-MTP-BF16.gguf

The conversion pipeline first verifies whether the source HF model already contains MTP tensors. If not, it extracts the MTP tensors from the matching unsloth base model and injects them into the safetensors index before GGUF conversion.

Downloads last month
15,071
GGUF
Model size
9B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-MTP-GGUF

Finetuned
Qwen/Qwen3.5-9B
Quantized
(9)
this model

Collection including Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-MTP-GGUF