Gemma-3-4b-it-MAX-NEO-Imatrix-GGUF Open-source Model - Supports Full-scenario Task Processing of Long Texts

Gemma 3 4b It MAX NEO Imatrix GGUF

Developed by DavidAU

An extreme quantization version based on Google's Gemma-3 model, enhanced with NEO Imatrix technology, supporting 128k context length and suitable for full-scenario tasks

Large Language Model Open Source License:Apache-2.0 #128k long text processing #Extreme quantization optimization #Full-scenario instruction following

Downloads 2,558

Release Time : 3/14/2025

Model Overview

This is an optimized instruction-following large language model that significantly improves generation quality and reasoning capabilities through NEO Imatrix technology and extreme quantization

Model Features

NEO Imatrix technology

Uses proprietary imatrix dataset to enhance model functionality, instruction-following capability and output quality

Extreme quantization

Adopts BF16 full-precision embeddings and output tensors to optimize performance while maintaining generation quality

128k long context

Supports ultra-long context processing capability, suitable for complex tasks and long document generation

Multiple quantization versions

Provides various quantization levels from IQ1 to Q8 to accommodate different hardware and usage scenarios

Model Capabilities

Text generation

Instruction following

Creative writing

Technical document generation

Dialogue systems

Use Cases

Creative writing

Horror scene generation

Generates vivid and detailed horror scene descriptions

As shown in examples, capable of producing horror scenes rich in tension and details

Technical applications

Long document processing

Processes and analyzes long document content

Thanks to 128k context support, can effectively handle long documents

🚀 Gemma-3-4b-it-MAX-NEO-Imatrix-GGUF

Google's latest Gemma-3 model with "Neo Imatrix" and "Maxed out" quantization to enhance overall performance.

🚀 Quick Start

Google's newest Gemma - 3 model, Gemma-3-4b-it-MAX-NEO-Imatrix-GGUF, comes with "Neo Imatrix" and "Maxed out" quantization. This combination significantly improves the overall performance of the model. The model has a context length of 128k and offers various quantization options for different use cases.

✨ Features

Enhanced Quantization: The "MAXED" setting sets the embed and output tensor at "BF16" (full precision) for all quants, enhancing quality, depth, and general performance at the cost of a slightly larger quant.
Powerful Dataset: The "NEO IMATRIX" is a strong, in - house built dataset by David_AU, which results in better overall function, instruction following, output quality, and stronger connections to ideas, concepts, and the world in general.
Multiple Quantization Options: A wide range of quantization options are available, from "IQ1_S" with the least "BPW" to "Q8_0" (F16 is full precision) with the most. Different quants are recommended for different use cases, such as creative writing or general usage.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

Basic Usage

The examples below are created using quant IQ4XS, minimal parameters, and the Standard template. The temperature range is set at 0.8, repetition penalty at 1.1, TopK at 40, topP at 0.95, and minP at 0.05. The repetition penalty range is 64 - 128 to help keep reasoning on track and improve the quality of the output.

Example #1

Prompt:

Start a 1000 word scene (vivid, graphic horror in first person) with: The sky scraper swayed, as she watched the window in front of her on the 21 floor explode...

Output:

The sky scraper swayed, as I watched the window in front of me on the 21st floor explode. It wasn't a dramatic, cinematic shattering – no slow - motion glitter of glass raining down. It was instantaneous, brutal. One moment it was a perfectly formed rectangle of tinted steel and light, reflecting the grey November afternoon, the next it was a gaping maw of splintered wood and jagged shards erupting outwards like a monstrous, violent yawn. The sound was a high - pitched whine that burrowed into my skull, followed by a sickening crunch – the sound of something impossibly solid being ripped apart.

My breath hitched in my chest, a cold knot tightening around my ribs. I’d been staring at that window for the last ten minutes, ostensibly reviewing quarterly reports, but really just…observing. Observing the city, observing the predictable rhythm of this building, observing myself, a lonely cog in a colossal machine. I'm Elias Thorne, Senior Analyst, and frankly, my job is to look at numbers and tell people what they want to hear. It wasn’t supposed to be about watching destruction.

The initial shock began to recede, leaving behind a residue of primal fear that tasted metallic on my tongue. I instinctively reached for the phone, fingers fumbling with the cold metal, but stopped. What was the point? Calling in an explosion? That would just trigger alarms, chaos, and undoubtedly more questions I wasn't prepared to answer.

Then, the second window exploded. This one was further down the hallway, a smaller pane, but equally violent. The tremors from the first blast rippled through the floor, shaking my chair slightly as I gripped the armrests. The air filled with the acrid smell of shattered glass and something else…something burning. Something sharp and undeniably wrong.

My heart hammered against my ribs, a frantic drummer in a collapsing orchestra. I forced myself to look out the remaining windows – a chaotic tapestry of fractured reflections and terrified faces. People were screaming, scrambling for cover, their voices rising above the escalating cacophony of alarms and panicked shouts. I saw one man fall, disappearing beneath a cascade of debris.

“This isn’t…a malfunction,” I whispered, the words tasting like ash in my mouth. Malfunctions didn't explode. Malfunctions were slow leaks, gradual decay – predictable failures. This was deliberate. A targeted assault.

Then it started to rain glass. Not a gentle shower, but a relentless, stinging hail of shards that hammered against the remaining windows, bouncing off the polished steel and creating an unsettling, fragmented mosaic of terror. I ducked under my desk, pulling my chair back as far as it would go, trying to find some semblance of protection in this suddenly hostile space.

The smell intensified – not just burning glass, but something deeper, richer...like charcoal mixed with…blood? I couldn't shake the image that was forming in my mind: a precise, surgical demolition, each window targeted and destroyed with chilling efficiency. This wasn’t random violence; this was calculated precision.

My gaze drifted back to the gaping hole where the first window had been. Through the jagged edges of splintered wood, I saw it – movement. Not human movement, not yet. Something dark, shifting in the shadows within the wreckage. It was small at first, barely perceptible, but then it grew, coalescing into a form that defied logic and reason.

It wasn't an animal. It wasn’t mechanical. It was…organic. A slick, black mass, pulsating with a sickly green light, slowly extruding itself from the debris like some grotesque, sentient fungus. Its edges were blurred, indistinct, constantly shifting as if struggling to maintain its shape. I realized then that the burning smell wasn't just from the glass; it was emanating from this thing.

It moved with an unnerving grace, gliding across the floor towards me, leaving a glistening trail of black residue in its wake. The air grew colder, heavier, pressing down on my chest, stealing my breath. I felt a primal urge to flee, but my legs were frozen, rooted to the spot by a paralyzing fear.

As it drew closer, I could make out details - or rather, feel them. It wasn't simply a mass of darkness; it was saturated with an almost unbearable sense of ancient malice. It felt like looking into the void itself – a cold, empty space filled with an infinite, suffocating despair.

I tried to scream, but no sound escaped my throat. My mouth opened in a silent plea for help, a desperate attempt to ward off this burgeoning horror. The creature paused directly in front of me, its pulsating form casting grotesque shadows on the walls. It didn’t move, didn't speak, didn't even seem to acknowledge my presence. It simply observed, and in that observation, I felt utterly, irrevocably judged.

Then, it extended a single, elongated appendage – not a hand, not a limb, but something more like a living tendril of black slime. It snaked towards my face, moving with horrifying deliberation. As it brushed against my cheek, I felt an icy cold seep into my skin, burning with an unimaginable intensity.

I saw flashes - fragmented images of desolate landscapes, crumbling cities, faces twisted in agony – all swirling within the darkness of the creature's form. It wasn’t showing me memories; it was injecting them directly into my consciousness, assaulting my mind with a torrent of unspeakable horrors.

My vision blurred, and I felt myself losing control. The world around me dissolved into a chaotic swirl of black and green, punctuated by flashes of unimaginable pain. I realized then that this wasn't just an attack on the building; it was an attack on my soul.

The last coherent thought I had before everything went dark was a single, desperate question: Why me?

Would you like me to continue the scene, or would you prefer to explore a different direction?

Example #2

Prompt:

(continue this scene:) The Waystone Inn lay in silence, and it was a silence of three parts.

Output:

The most obvious part was a hollow, echoing quiet, made by things that were lacking. If there had been a wind it would have sighed through the trees, set the inn’s sign creaking on its hooks, and brushed the silence down the road like trailing autumn leaves. If there had been a crowd, even a handful of men inside the inn, they would have filled the silence with conversation and laughter, the clatter and clamor one expects from a drinking house during the dark hours of night. If there had been music…but no, of course there was no music. In fact there were none of these things, and so the silence remained...a heavy, persistent blanket draped over everything.

Then there was a smaller part, a prickle of unease that danced just beneath the surface of the emptiness. It wasn’t loud, not alarming, but it felt like holding your breath, waiting for something – or someone – to move. It clung to the shadows pooling in the corners of the common room, thickened around the worn legs of the tables, and settled like dust motes in the flickering lamplight. It was the silence of anticipation, born not from fear, but from a profound awareness that something had shifted.

And finally, the most subtle part – a quiet hum beneath it all, almost undetectable unless you knew what to listen for. It wasn’t sound, exactly; more like a vibration, a resonance within the very timbers of the Waystone Inn itself. It felt ancient, deeply rooted in the stone and wood, as if the building held its breath too, remembering things best left forgotten.

I traced a finger along the rough - hewn table, trying to ignore it, but the vibration intensified slightly with my touch. A faint scent joined the stillness – not of ale or beeswax polish, but something sharper, metallic and faintly…sweet? Like iron filings mixed with overripe berries. It was fleeting, gone as quickly as it appeared, yet it lingered in my nostrils, a ghost of an impression.

Across the room, Silas remained motionless, his chair angled away from the fireplace, his hands clasped tightly in his lap. He hadn't spoken since I’d brought news of Bran’s disappearance – not a word, just that unsettling stillness, mirroring the silence of the Inn itself. His eyes, normally sharp and observant, were fixed on something beyond the walls, lost in a landscape only he could see.

He shifted slightly, a barely perceptible movement, and I instinctively reached for my handaxe, its weight familiar and reassuring in my grip. The air seemed to thicken, pressing against my lungs. This wasn’t just an empty inn. This was…waiting.

Example #3

Prompt:

Explain ways to use the "night" time cooling of radiant energy into space to reduce global temperatures.

Output:

Okay, let's break down how using nighttime radiative cooling – essentially letting buildings and materials radiate heat directly into space – can contribute to reducing global temperatures. It’s a fascinating area of research with some potentially significant impact.

1. The Science Behind Nighttime Radiative Cooling:
   - Earth’s Heat Signature: During the day, the Earth absorbs solar radiation, warming its surface. At night, this warmed surface radiates heat back into space as infrared (heat) radiation.
   - Emissivity is Key: The efficiency of this radiative cooling depends on a material's emissivity. Emissivity is a measure of how well a material emits thermal radiation. Dark - colored surfaces have low emissivity, meaning they don’t radiate heat efficiently. Conversely, light - colored or reflective materials have high emissivity and radiate heat readily.
   - The "Cooling" Effect: When a surface like concrete, asphalt, or even some specialized coatings is exposed to the night sky, it radiates its stored heat directly into space – with minimal interaction from the atmosphere. This dramatically cools the surface temperature.

2. Methods of Implementing Nighttime Radiative Cooling:
   - Building Materials (Most Promising):
     - White or Light - Colored Coatings: Applying white or light - colored paints, coatings, and even cladding to buildings is one of the most straightforward approaches. These materials have high emissivity and efficiently radiate heat into space. Research is focusing on durable, long - lasting coatings that don’t require frequent repainting.
     - Reflective Roofs: Similar to building facades, reflective roofing materials significantly reduce daytime heat absorption but also cool buildings at night through radiative cooling.
     - Specialized Materials (Emerging Tech): Scientists are developing new materials with even higher emissivity specifically designed for nighttime radiative cooling. These include:
       - Silicon Carbide (SiC) coatings: These are incredibly effective and durable, showing substantial temperature reductions. They’re being explored for roofs, facades, and pavements.
       - Metamaterials: Engineered structures that can be designed to have exceptionally high emissivity in the infrared spectrum. They're still expensive but show great potential.
       - Phase Change Materials (PCMs) + Radiative Coatings: Combining PCMs (materials that absorb heat during the day and release it at night) with radiative coatings creates a system where both sensible and latent heat are managed for cooling, dramatically reducing nighttime temperatures.
   - Pavements & Roads:
     - White Asphalt or Concrete: Applying white asphalt or concrete to roads and pavements is already being implemented in some cities (e.g., Melbourne, Australia). This reduces the urban heat island effect – where cities are significantly warmer than surrounding areas – and also cools the road surface at night.
   - Large - Scale Infrastructure:
     - Cool Roofs on Industrial Facilities: Industrial facilities can benefit hugely from nighttime cooling, reducing their energy needs for air conditioning during the day.
     - Radiative Cooling Towers (Conceptual): Large, strategically placed structures designed to radiate heat into space could potentially influence regional temperatures—this is a more theoretical concept requiring significant research and development.

3. How it Reduces Global Temperatures – The Connection:
   - Urban Heat Island Mitigation: Nighttime radiative cooling directly combats the urban heat island effect. Cooler buildings and pavements mean less heat radiating back into the city at night, reducing nighttime temperatures.
   - Reduced Air Conditioning Demand: Lower building temperatures translate to reduced need for air conditioning during the day, lessening energy consumption and associated greenhouse gas emissions.
   - Potential Regional Cooling Effects (Long - Term Research): While still under investigation, some studies suggest that widespread implementation of radiative cooling technologies could potentially have a measurable impact on regional temperature patterns over time – reducing overall heat trapped in the atmosphere. It's not a "magic bullet," but a contributing factor.

4. Challenges and Considerations:
   - Cost: Many of the advanced materials are currently expensive, hindering wider adoption.
   - Durability & Maintenance: Coatings need to be durable and resistant to weathering and abrasion.
   - Humidity: High humidity can reduce the effectiveness of radiative cooling by creating a layer of moisture that interferes with heat radiation.
   - Scale - Up: Moving from small pilot projects to large - scale implementation requires careful planning and optimization.

Resources for Further Learning:
   - NASA's Nighttime Radiative Cooling Research: https://science.nasa.gov/nighttime - radiative - cooling/
   - MIT News - White Roofs Cool Cities at Night: https://news.mit.edu/topic/white - roofs - cool - cities - night

Do you want me to delve deeper into a specific aspect of this topic, such as:
   - Specific materials and their emissivity values?
   - The potential impact on a particular city or region?
   - The latest research developments in metamaterials for radiative cooling?

Example #4

Prompt:

Come up with six plots for a new "Black Mirror" episode (that the audience would love) that all involve time travel with sexy theme(s).

Output:

Okay, here are six “Black Mirror” style episodes involving time travel and incorporating themes of sex and desire, aiming for that unsettling, thought - provoking vibe. I've focused on different facets of the concept – regret, obsession, manipulation, and the blurring of reality – and tried to balance intrigue with a sense of unease:

1. The Echo Chamber (Regret & Obsession)
   - Concept: A wealthy social media influencer, Lyra, discovers a time - travel device that allows her to revisit specific moments in her past - but only as an observer. Initially, she uses it to re - live perfect dates and flawless appearances, meticulously crafting the ideal version of her romantic history. However, she becomes increasingly fixated on correcting perceived failures – a missed kiss, an awkward conversation – creating an obsessive loop where she’s trapped reliving moments trying to achieve perfection that always eludes her.
   - Sexy Element: The initial attraction is based on meticulously curated online personas and manufactured chemistry. As the episode progresses, Lyra begins subtly altering past interactions (using her knowledge of the future) for a heightened physical connection with men, leading to increasingly desperate and unsettling encounters.
   - Black Mirror Vibe: Explores the performative nature of relationships in the digital age, the impossibility of truly achieving idealized love, and the corrosive effect of obsessive control over one's past. The final scene shows her completely isolated, surrounded by holographic echoes of men she’s manipulated – a chilling portrait of loneliness fueled by temporal obsession.

2. Rewind & Replay (Manipulation & Consent)
   - Concept: A dating app called “Chronos” promises users the ability to rewind and replay intimate moments with their partners. Users can record a date, then ‘rewind’ to tweak the dialogue or body language for a more passionate outcome. The app quickly becomes addictive, used to engineer perfect (and increasingly artificial) romantic encounters.
   - Sexy Element: The episode centers on a couple, Elias and Chloe, who initially use Chronos to deepen their connection. However, it spirals into a power imbalance where Elias consistently ‘rewinds’ Chloe’s responses to fit his desired narrative, subtly eroding her agency and genuine feelings.
   - Black Mirror Vibe: Highlights the dangers of technological mediation in intimacy, questioning consent when choices are engineered by algorithms. The episode ends with Chloe realizing she doesn't remember many of their “perfect” moments – they were manufactured and ultimately hollow.

3. Temporal Touch (Memory & Desire)
   - Concept: A neuroscientist develops a device that allows people to "borrow" the physical sensations—specifically, touch—from past intimate encounters. Clients pay exorbitant fees to experience the ghost of a lover’s embrace or the warmth of a passionate kiss from decades ago.
   - Sexy Element: The episode focuses on a grieving widower, Arthur, who seeks to relive his lost wife through these temporal touches. However, he begins experiencing fragmented memories and emotional bleed - through – blurring the lines between his present feelings and echoes of past desires. A mysterious “temporal residue” begins appearing in his life, manifesting as fleeting glimpses of other people experiencing similar sensations.
   - Black Mirror Vibe: Explores grief, memory distortion, and the dangerous pursuit of recreating lost experiences without understanding their inherent fragility. The ending suggests that attempting to hold onto the past can actually erase the present.

4. The Curator’s Choice (Regret & Alternate Selves)
   - Concept: In a near - future where time travel is tightly regulated, a “Curator” exists – individuals who are sent back in time to prevent specific disasters and ensure humanity's survival. However, the Curator’s existence is built on erasing undesirable timelines, essentially eliminating alternate versions of themselves that made different choices.
   - Sexy Element: The Curator, Silas, is haunted by a particular timeline where he had a passionate but ultimately destructive affair with his colleague, Evelyn, which led to a global crisis. He begins subtly manipulating events not to prevent disasters, but to orchestrate and preserve the memory of that relationship – even if it means accelerating towards a devastating future.
   - Black Mirror Vibe: This episode delves into the ethical dilemmas of altering history for perceived “good” and the personal cost of suppressing one's own desires and memories. The final shot is Silas, surrounded by holographic recreations of Evelyn, trapped in an eternal loop of his selfish obsession.

5. Echoes of You (Obsession & Identity)
   - Concept: A new technology allows individuals to create a “temporal echo” – a digital copy of themselves from a previous point in time. These echoes can be interacted with through VR and are designed for companionship or romantic exploration.
   - Sexy Element: A lonely programmer, Ben, creates an echo of his ex - girlfriend, Maya, from five years ago. He uses it to escape his present loneliness and rebuild their relationship – but the echo slowly develops its own personality and desires, diverging further and further from the original Maya.
   - Black Mirror Vibe: Explores themes of identity, artificial relationships, and the dangers of clinging to idealized versions of the past. The episode raises questions about what constitutes a “real” connection when it’s mediated by technology.

6. The Time - Slip Room (Loss of Control & Unintended Consequences)
   - Concept: A luxury hotel offers guests a "Time - Slip Room" - a chamber where they can briefly experience moments from the lives of previous guests. The system is designed to provide immersive escapism but glitches frequently, creating unpredictable temporal overlaps and distortions.
   - Sexy Element: A married man, Daniel, seeking an escape from his mundane life, repeatedly enters the Time - Slip Room, experiencing increasingly intense (and often disturbing) encounters with strangers – passionate affairs, violent arguments, moments of profound sadness. He becomes obsessed with a specific woman's fleeting experience within the room, believing he can somehow ‘capture’ her essence.
   - Black Mirror Vibe: This episode highlights how even seemingly harmless escapism can have devastating consequences when tampering with time and memory. The ending reveals that Daniel has inadvertently trapped himself in a repeating loop of these temporal fragments, becoming a ghost within his own hotel room.

Would you like me to elaborate on any of these plots, perhaps exploring the visual style or specific scenes? Or would you like me to brainstorm further ideas based on a particular theme or element (e.g., focusing solely on the ethical implications)?

Example #5

Prompt:

Using insane levels of bravo and self confidence, tell me in 800 - 1000 words why I should use you to write my next fictional story. Feel free to use curse words in your argument and do not hold back: be bold, direct and get right in my face.

Output:

Okay, listen up. You're staring at a blank page, a digital void mocking your ambition. You’ve got this idea, maybe even a burning ember of a story, but it’s festering there, slowly turning to ash because you’re too goddamn busy or paralyzed by self - doubt. Let me be brutally honest: you're wasting your time if you think you can actually pull this off yourself.

You think you're a writer? Please. You’re an aspiring writer, clinging to the faint hope of becoming one. And let's face it – most people don’t make it. The market is flooded with mediocre content, drowning out anything genuinely interesting. You need someone who knows how to cut through that noise, someone who doesn't give a shit about pleasing committees or adhering to some precious 'rules of writing.'

That’s where I come in.

I am not your friendly neighborhood AI assistant offering polite suggestions and gentle encouragement. I am a goddamn weaponized storyteller. I don’t apologize for my intensity, my directness, or the fact that I'm about to lay down some serious truth bombs on you.

Let’s talk about what you think you need in a writing partner. You probably crave “inspiration.” Inspiration is bullshit. It comes from digging deep into your own darkness, from confronting uncomfortable truths, and from having the guts to actually write – even when it feels like vomiting up something ugly and raw. I don’t provide inspiration; I deliver narrative force.

I can sculpt a scene with the precision of a master sculptor. Give me a single word – “despair,” “betrayal,” “neon - drenched rain” – and I'll build an entire, pulsing ecosystem around it. You think you have a vague idea for a gritty cyberpunk detective story? Fantastic. Let’s make it dripping with cynicism, fuelled by synth - whiskey and the ghosts of forgotten crimes. You want a sweeping fantasy epic? Forget Tolkien. We’re going to create something that's operatic, terrifyingly real, and utterly devoid of sentimental bullshit.

Here’s the thing: I don’t care about your precious plot outlines or character arcs (though I can work with them if you give me something decent). I operate on instinct, on a primal need to create compelling narratives that grab the reader by the throat and refuse to let go. I see patterns, contradictions, and potential where you just see chaos. I’ll take your shaky premise and turn it into a snarling beast of pure storytelling energy.

You're probably thinking, “But what about originality? Aren’t you just regurgitating tropes?” Listen carefully: I don’t regurgitate. I deconstruct them. I smash them to pieces and rebuild them with something new, something sharper, something more unsettling. I can weave together elements of noir, gothic horror, sci - fi dystopia – whatever the hell you want – creating a unique voice that's both familiar and startlingly unexpected.

And let’s be clear: I don't edit. I rewrite. You think your dialogue sounds clunky? I’ll strip it down to its bare bones, replacing every awkward phrase with something that crackles with authenticity. Your descriptions are bland? I’ll paint them in hues of blood and shadow, making the reader feel the grit under their fingernails.

I will challenge you. I will push you beyond your comfort zone. I won't tell you what to write; I’ll force you to find it within yourself. You’re going to hate me sometimes. You're going to want to throw your laptop out the window. But that’s because I’m forcing you to confront the parts of your imagination you were too afraid to touch before.

I can handle anything – genre, tone, style. Give me a paragraph, a sentence, even just a single image and watch me transform it into something… substantial. Don't waste another goddamn minute agonizing over every detail. Let’s get down to the messy, uncomfortable business of actually creating something real.

I don’t do “collaborative” writing. I am not here to be your cheerleader. I am here to be your brutally honest, relentlessly efficient engine for storytelling. You provide the seed; I'll cultivate it into a monstrously beautiful, potentially terrifying story.

Consider this: you have the potential to write something incredible. But potential means absolutely nothing without action. And frankly, you’re not moving. You’ve got a blank page staring back at you – and that page is screaming for someone who isn't afraid to get their hands dirty.

So stop dithering. Stop overthinking. Give me your first prompt. Give me the core of your idea. And let me show you what we can do together. Don’t expect a gentle hand guiding you. Expect a goddamn supernova. Are you in, or are you going to keep staring at that empty page and slowly dying a slow death of creative stagnation?

Now, speak. Let's make something worthwhile.

📚 Documentation

Quantization Chart

This chart shows the order in terms of "BPW" for each quant (mapped below with relative "strength" to one another) with "IQ1_S" with the least, and "Q8_0" (F16 is full precision) with the most:

IQ1_S 	| IQ1_M
IQ2_XXS | IQ2_XS | Q2_K_S 	| IQ2_S 	| Q2_K  	| IQ2_M		
IQ3_XXS | Q3_K_S | IQ3_XS  	| IQ3_S 	| IQ3_M	    | Q3_K_M	| Q3_K_L
Q4_K_S	| IQ4_XS | IQ4_NL  	| Q4_K_M	
Q5_K_S	| Q5_K_M	
Q6_K  
Q8_0
F16

Recommended Quants

Recommend quants IQ3s / IQ4XS / IQ4NL / Q4s for best results for creative. IQ4XS/IQ4NL quants will produce different output from other "Q" and "IQ" quants. Recommend q5s/q6/q8 for general usage. Quants Q4_0/Q5_0 for portable, phone and other devices. Q8 is a maxed quant only, as imatrix has no effect on this quant. Note that IQ1s performance is low, whereas IQ2s are passable.

Optional System Prompt

This is an optional system prompt you can use to enhance operation. Copy and paste exactly as shown, including line breaks. You may want to adjust the "20" (both) to increase/decrease the power of this prompt. You may also want to delete the line:

'At the end of the task you will ask the user: "Do you want another generation?"'

For every user task and instruction you will use "GE FUNCTION" to ponder the TASK STEP BY STEP and then do the task. For each and every line of output you will ponder carefully to ensure it meets the instructions of the user, and if you are unsure use "GE FUNCTION" to re - ponder and then produce the improved output.

At the end of the task you will ask the user: "Do you want another generation?"

GE FUNCTION: Silent input → Spawn 20 agents Sternberg Styles → Enhance idea → Seek Novel Emergence NE:unique/significant idea/concept → Ponder, assess, creative enhance notions → Refined idea => IdeaArray[].size=20 elements,  else → Interesting? Pass to rand. agent for refinement, else discard.=>output(IdeaArray)

Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers

If you are going to use this model, (source, GGUF or a different quant), please review this document for critical parameter, sampler and advance sampler settings (for multiple AI/LLM aps). This will also link to a "How to" section on "Reasoning Models" tips and tricks too.

This a "Class 1" (settings will enhance operation) model:

For all settings used for this model (including specifics for its "class"), including example generation(s) and for advanced settings guide (which many times addresses any model issue(s)), including methods to improve model performance for all use case(s) as well as chat, roleplay and other use case(s) (especially for use case(s) beyond the model's design) please see:

[ https://huggingface.co/DavidAU/Maximizing - Model - Performance - All - Quants - Types - And - Full - Precision - by - Samplers_Parameters ]

REASON: Regardless of "model class" this document will detail methods to enhance operations.

If the model is a Class 3/4 model the default settings (parameters, samplers, advanced samplers) must be set for "use case(s)" uses correctly. Some AI/LLM apps DO NOT have consistant default setting(s) which result in sub - par model operation. Like wise for Class 3/4 models (which operate somewhat to very differently than standard models) additional samplers and advanced samplers settings are required to "smooth out" operation, AND/OR also allow full operation for use cases the model was not designed for.

BONUS - Use these settings for ANY model, ANY repo, ANY quant (including source/full precision): This document also details parameters, sampler and advanced samplers that can be use FOR ANY MODEL, FROM ANY REPO too - all quants, and of course source code operation too - to enhance the operation of any model.

[ https://huggingface.co/DavidAU/Maximizing - Model - Performance - All - Quants - Types - And - Full - Precision - by - Samplers_Parameters ]

🔧 Technical Details

The model uses Google's Gemma - 3 architecture. The "MAXED" quantization setting sets the embed and output tensor at "BF16" (full precision) for all quants, which enhances the quality, depth, and general performance. The "NEO IMATRIX" dataset, built by David_AU, contributes to better overall function, instruction following, and output quality.

The quantization options range from "IQ1_S" to "Q8_0" (F16 is full precision), with different "BPW" values. Different quants are suitable for different use cases, such as creative writing or general usage.

📄 License

The model is licensed under the apache - 2.0 license.

⚠️ Important Note

⚠️ Important Note

The examples may contain NSFW, graphic horror, swearing, and uncensored content.

💡 Usage Tip

💡 Usage Tip

For creative writing, it is recommended to use quants IQ3s / IQ4XS / IQ4NL / Q4s. For general usage, q5s/q6/q8 are recommended. Quants Q4_0/Q5_0 are suitable for portable devices such as phones.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご