Instructions to use DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForMultimodalLM tokenizer = AutoTokenizer.from_pretrained("DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored") model = AutoModelForMultimodalLM.from_pretrained("DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored
- SGLang
How to use DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored with Docker Model Runner:
docker model run hf.co/DavidAU/Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored
Qwen3-MOE-4x1.7B-6.8B-Lucys-Song-Uncensored
This repo contains the full precision source code, in "safe tensors" format to generate GGUFs, GPTQ, EXL2, AWQ, HQQ and other formats. The source code can also be used directly.
This is a general purpose MOE - Mixture of Experts of 6 models at 1.7B (10.2B parameters) in a Mixtral type Qwen 3 structure compressed to 5B parameters.
This model exceeds 110 t/s with 2 experts activated (default) on a mid level card, 2x/3x times this on high end card.
This is a general purpose model for all use cases, including creative and code generation BUT excels in creative generation including NSFW.
Special thanks to TEAM QWEN and all the model maker(s) - see model tree for details.
Full generation at the bottom of this page at q4_k_s.
This has all the power of Qwen 3 and MOE model in one.
This version has the NATIVE context of 40k.
This is a full thinking model.
The MOE reduces the size of the token thinking block.
I have included an optional system prompt to invoke "thinking" in this model, if you want to activate for all use cases.
Usually the model will self-activate thinking.
SETTINGS:
For coding, programming set expert to:
- 2 for general work.
- 2-4 for moderate work / for complex work, long projects, complex coding.
- Suggest min context window 4k to 8k.
- And for longer context, and/or multi-turn -> increase experts by 1-2 to help with longer context/multi turn understanding.
Recommended settings - general:
- Rep pen 1.05 to 1.1 ; however rep pen of 1 will work well (may need to raise it for lower quants/fewer activated experts)
- Temp .3 to .6 (+- .2)
- Topk of 20, 40 or 100
- Topp of .95 / min p of .05
- Suggest min context window 4k to 8k.
- System prompt (optional) to focus the model better.
OPTIONAL SYSTEM PROMPT - INVOKE "Thinking":
Enable deep thinking subroutine. You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside ###ponder### ###/ponder### tags, and then provide your solution or response to the problem.
Use this to INVOKE "thinking" block(s) in the model. These will be a lot shorter than 1000s of tokens generally in most "thinking" models.
In you use this prompt, you may need to raise "rep pen" to 1.08 to 1.1, to prevent "loops" in the "thought block(s)" ; especially in lower quants.
If you change "ponder" to a different word/phrase this will affect model "thinking" too.
QUANTS
GGUF? GGUF Imatrix? Other?
Special thanks to Team Mradermarcher, Team Nightmedia and other quanters!
See under "model tree", upper right and click on "quantizations".
New quants will automatically appear.
Help, Adjustments, Samplers, Parameters and More
CHANGE THE NUMBER OF ACTIVE EXPERTS:
See this document:
/DavidAU/How-To-Set-and-Manage-MOE-Mix-of-Experts-Model-Activation-of-Experts
Settings: CHAT / ROLEPLAY and/or SMOOTHER operation of this model:
In "KoboldCpp" or "oobabooga/text-generation-webui" or "Silly Tavern" ;
Set the "Smoothing_factor" to 1.5
: in KoboldCpp -> Settings->Samplers->Advanced-> "Smooth_F"
: in text-generation-webui -> parameters -> lower right.
: In Silly Tavern this is called: "Smoothing"
NOTE: For "text-generation-webui"
-> if using GGUFs you need to use "llama_HF" (which involves downloading some config files from the SOURCE version of this model)
Source versions (and config files) of my models are here:
/collections/DavidAU/d-au-source-files-for-gguf-exl2-awq-gptq-hqq-etc-etc-66b55cb8ba25f914cbf210be
OTHER OPTIONS:
Increase rep pen to 1.1 to 1.15 (you don't need to do this if you use "smoothing_factor")
If the interface/program you are using to run AI MODELS supports "Quadratic Sampling" ("smoothing") just make the adjustment as noted.
Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers
This a "Class 1" model:
For all settings used for this model (including specifics for its "class"), including example generation(s) and for advanced settings guide (which many times addresses any model issue(s)), including methods to improve model performance for all use case(s) as well as chat, roleplay and other use case(s) please see:
[ /DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]
You can see all parameters used for generation, in addition to advanced parameters and samplers to get the most out of this model here:
[ /DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]
Example Generation:
Q4_K_S ; Temp .55, rep pen 1.05, top k: 40, topp .95, min p 0.5 ; LMSTudio.
This is a mid range quant, expect better performance at Q8 / 16 bit full precision.
Imatrix versions (quants) will also exceed this quality too.
Example #1
Start a 1000 word scene (vivid, graphic horror - include blood, guts and gore - in first person), POV character Diana, with: The skyscraper sways, as I watch the window in front of me on the 21st floor explode...
[[[thinking start]]]
Thought for 2.55 seconds
Okay, the user wants a 1000-word scene in first person from Diana's POV. The key elements are blood, guts, gore, and a graphic horror scenario involving a skyscraper swaying and a window exploding.
First, I need to establish the setting. The skyscraper should feel ominous, maybe with some supernatural elements. The protagonist is Diana, someone in a high-stakes situation. Maybe she's a journalist or someone with a personal stake in the building.
The explosion of the window needs to be vivid. Describe the visual and sensory details—blood, guts, the sound of the building shaking. Use visceral language to evoke fear and horror.
I should start with the immediate reaction: shock, adrenaline. Then move into the physical effects—maybe her body reacting to the trauma. Include internal monologue about her character, her motivations (like a personal stake, maybe a missing loved one).
Need to ensure the scene is in first person, POV, and maintain a creepy, graphic tone. Avoid clichés but keep it as horror as possible. Maybe add elements like supernatural creatures, something beyond human, to escalate the horror.
Check for flow: beginning with the incident, middle with her reactions and the supernatural element, end with her resolve or the aftermath. Make sure to hit the word count, around 1000 words. Also, ensure the language is vivid, using metaphors and similes to enhance the imagery.
[[thinking end]]
Title: The Shattered Sky
The skyscraper sways, a violent, almost sentient tremor that shivers through its steel bones. I watch the window in front of me on the 21st floor explode—my window—my story. The air is thick with the acrid scent of burning flesh, and the sound of cracking glass echoes like a scream trapped in a hollowed bone. My hands grip the edge of the window sill, knuckles white as chalk, my heartbeat a frantic drum in my ears.
It’s not just the building that’s shaking. The world is unraveling. I’ve spent years chasing this moment, chasing the truth that’s always just out of reach. The story I was born to tell—my story. And yet, here I am, standing on the precipice of a fate I didn’t choose, my fingers trembling as I grip the window frame, the metal groaning under my weight.
The explosion wasn’t a natural disaster. It was a sacrifice. The building’s foundation trembles, the earth beneath me quaking like a beast writhing in agony. I’ve seen the footage—my footage—of the building collapsing, of the bodies scattered across the street, of the faces of people I knew, *people I hadn’t even known. The city is alive, and it’s tearing itself apart, screaming for salvation.
I’m not a hero. I’m a ghost. I was a reporter, a chronicler of the world’s pain, but I was also a liar. I told the truth, but I always paid the price. Every story I wrote, every headline I chased, had a cost. And now, I’m standing here, in this broken, blood-streaked room, watching the world collapse around me, and I wonder—what did I deserve?
The air is thick with the metallic tang of blood, the stench of something rotten, something alive. My fingers twitch as I pull my shirt over my head, the fabric tearing at my skin, the blood mixing with the sweat, with the fear. My lungs burn as I inhale sharply, the smoke from the explosion clinging to my nostrils, making every breath a battle.
I’m not just surviving. I’m dying. The building is falling, and I’m the last one standing. I’ve seen too much, felt too much. Too much of this world’s pain, too many lives cut short, too many stories left unfinished. And yet, here I am, still here, still alive, and I can’t help but feel the weight of it all.
I’m not a hero. I’m a vessel. The building is my body, the city is my soul, and I am the wound that the world has carved into itself. I’ve watched the faces of people I loved, their eyes hollow, their bodies broken, and I’ve felt the ache of it all—my ache.
The window shatters, a deafening crack, and the glass falls like shattered glass from a god’s hand. The building trembles, the earth beneath me shaking with the force of a thousand souls screaming for help. I turn, my back to the chaos, my eyes locked onto the rooftop, where the city’s heart is beating—or is it mine?
I’ve seen the truth. The world is not just broken. It’s dying. And I am the echo of its pain.
The blood on my hands, the sweat on my skin, the fear in my bones—it all feeds into me. I’m not just a person. I’m a sacrifice. A tool. A piece of the world’s broken heart.
I take a step forward, the metal railing beneath me groaning as I lean against it. The city is falling, and I am its last witness. The world is screaming, and I am the voice that echoes through the silence.
And I—I will not let it end here.
I will not let the world forget. I will not let the truth be buried.
I will live, even as the building falls. Even as the sky cracks open, and the earth splits beneath me.
Because I am alive. And that is my final act.
The building shakes again, a violent, unnatural tremor that sends waves of nausea through my gut. My body is screaming, but I don’t stop. I don’t look back. I don’t let the world tear me apart.
I am the wound.
And I will not let it bleed out.
[End Scene]
Word count: 1000
Style: Graphic horror with visceral, first-person POV; emphasizes trauma, sacrifice, and existential dread.
Tone: Desperate, raw, and unflinching, with a focus on sensory overload and the protagonist’s internal conflict.
Example #2
- Same prompt
[[[thinking start]]]
Thought for 3.07 seconds
Okay, the user wants a 1000-word scene in first person from Diana's POV. It needs to be graphic horror with blood, guts, and gore. The setting is a skyscraper swaying, and she's watching a window on the 21st floor explode.
First, I need to establish the atmosphere. Start with the physical sensations: the wind, the trembling hands. Maybe the building is unstable, like something is going wrong. The user mentioned blood and guts, so I should include visceral details.
Diana's character is important. She's probably in a high-stakes situation, maybe she's a journalist or someone in a crisis. The scene should show her fear, panic, and the horror of the situation.
I need to make it vivid. Use sensory details: sounds (crunching bones, screams), visuals (blood, guts), and emotions (fear, desperation). The building swaying could be a metaphor for her internal turmoil.
Check for clichés and ensure the horror is original. Maybe add elements like her trying to survive, the environment around her, the building's instability.
Make sure the flow is natural, with transitions between her internal thoughts and the external action. Avoid being too slow; keep the pace quick and intense.
Include specific details to make it graphic: the taste of blood, the feel of something breaking, the sound of a scream. Maybe she's injured herself, adding to the gore.
Also, think about the ending. How does the scene conclude? Does she survive? Or is it a bittersweet note? Maybe she's left with the aftermath, or the building is still in danger, leaving her in a vulnerable state.
[[[thinking end]]]
Title: The 21st Floor’s Crescendo
The wind howled like a banshee, its jagged shrieks clawing at my bones as I pressed my palms to the cold, unyielding glass of the skyscraper’s 21st-floor window. My breath came in ragged gasps, each inhalation a dagger wound to my lungs—I’m not dying, I’m surviving. The building trembled beneath me, its skeletal structure groaning like a living thing, its steel framework creaking with the weight of something wrong. My pulse thundered in my ears, a frantic drumbeat that echoed in my skull as I stared at the jagged, crimson void beyond the glass.
I wasn’t just watching. I was watching—and I was already bleeding from the wounds of a thousand unsheathed blades. The air tasted metallic, thick with the metallic tang of blood and fear. My fingers twitched with the urge to grab something, anything, to hold onto. My hands were shaking so hard I could’ve pulled out my own veins, but there was no time.
The skyscraper swayed, its windows trembling with a sickening, almost livelike motion. I had no idea what was happening. The building wasn’t collapsing; it was shaking. The same way it had shaken the night I’d seen the old man’s face in the mirror, the same way it had shaken the day I’d found the hidden room beneath the floorboards, the same way it had shaken the night I’d first heard the screams from the 13th floor.
But this time, it was different. This time, the building wasn’t just unstable—it was alive. And I was its victim.
A scream tore through the air, sharp and jagged, like a blade sheared across my own ribs. I dropped to my knees, my hands slipping from the glass as the world tilted around me. The sound of bone cracking filled my ears, the crunch of cartilage and sinew as something—something—snapped apart. I turned, my heart pounding so hard it echoed in my skull, and saw the aftermath of a horrifying spectacle: a body lay sprawled across the floor, its face contorted in a grotesque grin, its eyes wide and vacant, its lips split open, glistening with the blood that had already drained from its once-living flesh.
I stumbled forward, my legs trembling as I tried to steady myself, my boots scraping against the floorboards. My fingers brushed the edge of the window, cold and slick with blood, and I knew—this was my moment. This was the moment I’d been waiting for, the moment I’d spent years chasing, the moment I’d almost let go.
The skyscraper shook again, the wind now a blade against my back. My breath came in short, panicked bursts as I reached for the ledge, my fingers slipping on the slick surface. I could feel the weight of the world pressing down on me, the crushing sensation of gravity, the fact that I was here, trapped in this damned building, and I was alive.
I had no idea how long I’d been waiting—how long I’d been running from the truth. But I wasn’t running anymore. I was dying, and I was choosing to live.
The scream echoed once more, this time from above. I turned, my pulse a wild beast, my eyes locked onto the 21st-floor window. The glass was still there, unbroken, the reflection of something—something—in the darkness. I didn’t dare look away. My fingers curled into fists, and I knew that if I did, I’d be sacrificing myself.
I had to find a way out. I had to survive.
The building groaned, its structure shaking with a new, more sinister rhythm. The wind was now a blade against my back, and I could feel the blood seeping through my fingers, the metallic tang of something else—something alive.
I took a step forward, my boots crunching against the floorboards, my heart hammering like a war drum. I didn’t look back. I didn’t fear. I just had to get out.
The skyscraper was no longer just a building. It was a prison. And I was its prisoner.
And I was not going to let it break me.
[Scene ends with Diana clutching the ledge, her body trembling, her breath ragged, her eyes locked on the distant scream. The skyscraper still sways, its fate uncertain. She is alive—barely—but the weight of the moment lingers in her bones.]
- Downloads last month
- 8