S

SIMS Llama3.2 3B

Developed by slprl
This model is a fine-tuned speech-language model based on Llama-3.2-3B, focusing on analyzing the scalability of interleaved speech-text SLM and supporting speech and text generation tasks.
Downloads 54
Release Time : 4/2/2025

Model Overview

This is a Speech Language Model (SLM) that generates speech or text continuations based on discrete Hubert tokens when given speech-text prompts.

Model Features

Efficient Scalability
Through interleaved speech-text initialization, computational scaling efficiency is significantly improved, making it more efficient compared to pure speech SLMs.
Knowledge Transfer
Initialized from a pre-trained Text Language Model (TextLM), enabling knowledge transfer and enhancing model performance.
Multimodal Support
Supports speech and text generation tasks, capable of handling cross-modal tasks such as generating text continuations from speech prompts.

Model Capabilities

Speech Generation
Text Generation
Cross-Modal Task Handling

Use Cases

Speech Generation
Speech Segment Continuation
Generate continuations of speech segments based on given speech prompts.
Cross-Modal Tasks
Speech-to-Text Generation
Generate text continuations based on speech prompts.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase