S

SIMS 7B

Developed by slprl
A speech-language model based on Qwen2.5-7B extension, supporting speech-text interleaved training and cross-modal generation
Downloads 51
Release Time : 3/31/2025

Model Overview

This model is fine-tuned by extending the vocabulary of Qwen2.5-7B, adding 500 speech tokens, focusing on the scalability research of interleaved speech-text SLM. It can be used for generating speech segment continuations or cross-modal generation.

Model Features

Efficient Scalability
Compared to pure speech SLMs, it achieves higher computational resource utilization efficiency, with fundamentally different scaling dynamics.
Cross-modal Generation
Supports generating text continuations from speech prompts or generating speech continuations from speech-text prompts.
Knowledge Transfer
Achieves knowledge transfer through speech-text interleaved training initialized from pre-trained text language models.

Model Capabilities

Speech segment continuation generation
Speech-to-text cross-modal generation
Speech-text interleaved processing

Use Cases

Speech Generation
Speech Continuation Generation
Generates natural speech continuations based on input speech segments.
Performs comparably to mainstream models in speech semantic metrics.
Cross-modal Applications
Speech-to-Text Generation
Generates relevant text content based on speech prompts.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase