MN Slush
Slush is a two-stage model trained with high LoRA dropout rate, focusing on enhancing creativity and role-playing capabilities
Downloads 59
Release Time : 11/20/2024
Model Overview
Slush is a specially trained large language model that adopts a two-stage training approach. The first stage continues the pre-training of the base model to improve creativity and writing abilities, while the second stage involves fine-tuning to enhance role-playing capabilities. The model is suitable for non-all-age users and is particularly ideal for role-playing scenarios.
Model Features
Two-stage Training
Adopts pre-training continuation and fine-tuning stages to enhance creativity and role-playing capabilities respectively
High LoRA Dropout Rate
Uses a LoRA dropout rate of 0.5 during training, optimizing model performance based on the latest research findings
Long Context Support
Supports long context processing of up to 16384 tokens
LoRA+ Technology
Employs LoRA+ training method with a learning rate ratio of 15 to optimize training effectiveness
Model Capabilities
Creative writing
Role-playing dialogue
Long text generation
Instruction following
Use Cases
Entertainment
Role-playing Games
Engages in immersive interactions as in-game characters with users
Delivers high-quality role-playing experiences
Creative Writing Assistance
Helps users generate creative stories or novel content
Produces creative text outputs
Dialogue Systems
Personalized Chatbots
Builds chatbots with specific personality traits
Provides personalized and consistent dialogue experiences
Featured Recommended AI Models
Š 2025AIbase