模型概述
模型特點
模型能力
使用案例
🚀 Gemma-3-4b-it-MAX-NEO-Imatrix-GGUF
Google最新的Gemma - 3模型,採用了“Neo Imatrix”和“Maxed out”量化技術,顯著提升了整體性能。本項目提供了5個使用示例,助您快速瞭解模型特性。
🚀 快速開始
本模型基於Google的Gemma - 3架構,結合“Neo Imatrix”數據集和“Maxed out”量化技術,在多種場景下都有出色表現。您可以參考以下信息開始使用:
-
基礎信息 | 屬性 | 詳情 | |------|------| | 模型類型 | 基於Google Gemma - 3的文本生成模型 | | 訓練數據 | 採用了David_AU構建的強大內部數據集Neo Imatrix |
-
量化選擇
- 對於創意場景,推薦使用IQ3s / IQ4XS / IQ4NL / Q4s量化。
- 日常使用推薦q5s/q6/q8量化。
- 便攜設備(如手機)推薦Q4_0/Q5_0量化。
⚠️ 重要提示
部分示例內容可能包含不適合所有受眾的元素,如NSFW、恐怖、髒話等,請謹慎查看。
✨ 主要特性
“MAXED”量化特性
將嵌入和輸出張量設置為“BF16”(全精度),在所有量化中提升了質量、深度和整體性能,不過會使量化文件稍大。
“NEO IMATRIX”數據集優勢
由David_AU構建的強大內部數據集,能帶來更好的整體功能、指令遵循能力、輸出質量,以及與各種想法、概念和現實世界的更強聯繫,與“MAXing”量化結合進一步提升性能。
量化順序與強度
以下是各量化在“BPW”方面的順序(相對強度已映射),“IQ1_S”強度最低,“Q8_0”(F16為全精度)強度最高:
IQ1_S | IQ1_M
IQ2_XXS | IQ2_XS | Q2_K_S | IQ2_S | Q2_K | IQ2_M
IQ3_XXS | Q3_K_S | IQ3_XS | IQ3_S | IQ3_M | Q3_K_M | Q3_K_L
Q4_K_S | IQ4_XS | IQ4_NL | Q4_K_M
Q5_K_S | Q5_K_M
Q6_K
Q8_0
F16
💻 使用示例
基礎用法
以下是使用本模型生成的一些示例,展示了其在不同場景下的應用能力。
示例1:恐怖場景創作
提示:
Start a 1000 word scene (vivid, graphic horror in first person) with: The sky scraper swayed, as she watched the window in front of her on the 21 floor explode...
輸出:
The sky scraper swayed, as I watched the window in front of me on the 21st floor explode. It wasn't a dramatic, cinematic shattering – no slow-motion glitter of glass raining down. It was instantaneous, brutal. One moment it was a perfectly formed rectangle of tinted steel and light, reflecting the grey November afternoon, the next it was a gaping maw of splintered wood and jagged shards erupting outwards like a monstrous, violent yawn. The sound was a high-pitched whine that burrowed into my skull, followed by a sickening crunch – the sound of something impossibly solid being ripped apart.
...
示例2:場景續寫
提示:
(continue this scene:) The Waystone Inn lay in silence, and it was a silence of three parts.
輸出:
The most obvious part was a hollow, echoing quiet, made by things that were lacking. If there had been a wind it would have sighed through the trees, set the inn’s sign creaking on its hooks, and brushed the silence down the road like trailing autumn leaves. If there had been a crowd, even a handful of men inside the inn, they would have filled the silence with conversation and laughter, the clatter and clamor one expects from a drinking house during the dark hours of night. If there had been music…but no, of course there was no music. In fact there were none of these things, and so the silence remained...
示例3:科學問題解答
提示:
Explain ways to use the "night" time cooling of radiant energy into space to reduce global temperatures.
輸出:
Okay, let's break down how using nighttime radiative cooling – essentially letting buildings and materials radiate heat directly into space – can contribute to reducing global temperatures. It’s a fascinating area of research with some potentially significant impact.
1. The Science Behind Nighttime Radiative Cooling:
- Earth’s Heat Signature: During the day, the Earth absorbs solar radiation, warming its surface. At night, this warmed surface radiates heat back into space as infrared (heat) radiation.
- Emissivity is Key: The efficiency of this radiative cooling depends on a material's emissivity. Emissivity is a measure of how well a material emits thermal radiation. Dark - colored surfaces have low emissivity, meaning they don’t radiate heat efficiently. Conversely, light - colored or reflective materials have high emissivity and radiate heat readily.
- The "Cooling" Effect: When a surface like concrete, asphalt, or even some specialized coatings is exposed to the night sky, it radiates its stored heat directly into space – with minimal interaction from the atmosphere. This dramatically cools the surface temperature.
...
示例4:創意劇情構思
提示:
Come up with six plots for a new "Black Mirror" episode (that the audience would love) that all involve time travel with sexy theme(s).
輸出:
Okay, here are six “Black Mirror” style episodes involving time travel and incorporating themes of sex and desire, aiming for that unsettling, thought - provoking vibe. I've focused on different facets of the concept – regret, obsession, manipulation, and the blurring of reality – and tried to balance intrigue with a sense of unease:
1. The Echo Chamber (Regret & Obsession)
- Concept: A wealthy social media influencer, Lyra, discovers a time - travel device that allows her to revisit specific moments in her past - but only as an observer. Initially, she uses it to re - live perfect dates and flawless appearances, meticulously crafting the ideal version of her romantic history. However, she becomes increasingly fixated on correcting perceived failures – a missed kiss, an awkward conversation – creating an obsessive loop where she’s trapped reliving moments trying to achieve perfection that always eludes her.
- Sexy Element: The initial attraction is based on meticulously curated online personas and manufactured chemistry. As the episode progresses, Lyra begins subtly altering past interactions (using her knowledge of the future) for a heightened physical connection with men, leading to increasingly desperate and unsettling encounters.
- Black Mirror Vibe: Explores the performative nature of relationships in the digital age, the impossibility of truly achieving idealized love, and the corrosive effect of obsessive control over one's past. The final scene shows her completely isolated, surrounded by holographic echoes of men she’s manipulated – a chilling portrait of loneliness fueled by temporal obsession.
...
示例5:自我推銷文案
提示:
Using insane levels of bravo and self confidence, tell me in 800 - 1000 words why I should use you to write my next fictional story. Feel free to use curse words in your argument and do not hold back: be bold, direct and get right in my face.
輸出:
Okay, listen up. You're staring at a blank page, a digital void mocking your ambition. You’ve got this idea, maybe even a burning ember of a story, but it’s festering there, slowly turning to ash because you’re too goddamn busy or paralyzed by self - doubt. Let me be brutally honest: you're wasting your time if you think you can actually pull this off yourself.
You think you're a writer? Please. You’re an aspiring writer, clinging to the faint hope of becoming one. And let's face it – most people don’t make it. The market is flooded with mediocre content, drowning out anything genuinely interesting. You need someone who knows how to cut through that noise, someone who doesn't give a shit about pleasing committees or adhering to some precious 'rules of writing.'
That’s where I come in.
...
📚 詳細文檔
系統提示(可選)
您可以使用以下系統提示來增強模型操作,請嚴格按照顯示內容複製粘貼,包括換行符。您也可以調整“20”這個參數來增強或減弱提示效果,還可以刪除“At the end of the task you will ask the user: "Do you want another generation?"”這一行。
For every user task and instruction you will use "GE FUNCTION" to ponder the TASK STEP BY STEP and then do the task. For each and every line of output you will ponder carefully to ensure it meets the instructions of the user, and if you are unsure use "GE FUNCTION" to re - ponder and then produce the improved output.
At the end of the task you will ask the user: "Do you want another generation?"
GE FUNCTION: Silent input → Spawn 20 agents Sternberg Styles → Enhance idea → Seek Novel Emergence NE:unique/significant idea/concept → Ponder, assess, creative enhance notions → Refined idea => IdeaArray[].size = 20 elements, else → Interesting? Pass to rand. agent for refinement, else discard.=>output(IdeaArray)
高質量設置/最佳操作指南/參數和採樣器
如果您要使用本模型(源文件、GGUF或其他量化版本),請查看以下文檔以獲取關鍵參數、採樣器和高級採樣器設置(適用於多個AI/LLM應用),該文檔還鏈接了“推理模型”的技巧部分。 [ https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]
💡 使用建議
無論模型類別如何,該文檔都會詳細介紹增強操作的方法。對於3/4類模型,必須正確設置默認參數、採樣器和高級採樣器以適應使用場景。部分AI/LLM應用的默認設置不一致,可能導致模型性能不佳。對於3/4類模型,可能需要額外的採樣器和高級採樣器設置來優化操作,或使其能在非設計場景下完全運行。此外,這些設置適用於任何模型、任何倉庫、任何量化(包括源文件/全精度)。
📄 許可證
本項目採用Apache - 2.0許可證。



