L

L3 8B Stheno V3.3 32K

Developed by Sao10K
A 32K long-context model optimized from Llama-3-8B, extending context length through PoSE training, specializing in role-playing and creative writing tasks
Downloads 541
Release Time : 6/22/2024

Model Overview

This model is an optimized version of Llama-3-8B that extends the context from 8K to 32K using PoSE training, with enhanced capabilities for role-playing and creative writing while maintaining fundamental language understanding

Model Features

Extended context processing
Extends context length from 8K to 32K through PoSE training, outperforming conventional rope scaling solutions
High-quality role-playing
Deeply cleaned and manually curated role-playing samples provide excellent interactive experiences
Creative writing enhancement
Doubled creative writing training samples significantly improve generation quality
Optimized training configuration
Uses a tuned optimal Rope Theta value (2 million) configuration to ensure training stability

Model Capabilities

Long text generation
Role-playing dialogue
Creative content generation
Instruction following
Context understanding

Use Cases

Entertainment & creation
Interactive role-playing
Immersive role-playing dialogues with AI
Subjective experience reports show excellent interaction quality
Creative writing assistance
Generating creative texts like novels and poetry
Training data shows a 2x increase in creative writing samples
Long document processing
Long document summarization
Handling summarization tasks for documents up to 32K in context
Basic tests show superiority over conventional rope scaling solutions
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase