Sarashina2 7b
A large Japanese/English bilingual language model trained by SB Intuitions, based on the Llama2 architecture
Downloads 1,561
Release Time : 6/7/2024
Model Overview
Sarashina2 is a large language model based on the Llama2 architecture, supporting both Japanese and English. The model is trained with high-quality data and is suitable for tasks such as text generation.
Model Features
Bilingual Support
Supports text generation in both Japanese and English
High-Quality Training Data
Utilizes rigorously cleaned Japanese corpus from Common Crawl and English corpus from SlimPajama
Optimized Tokenizer
Employs a sentencepiece tokenizer based on unigram language model and byte fallback, eliminating the need for pre-tokenization
Model Capabilities
Japanese Text Generation
English Text Generation
Context Understanding
Use Cases
Content Creation
Weather Report Generation
Generates weather-related descriptions based on prompts
Examples demonstrate the model's ability to generate diverse weather descriptions
Daily Dialogue Simulation
Simulates everyday conversation scenarios
The model can generate contextually appropriate dialogue content
Featured Recommended AI Models