Qwen1.5-moe-tiny-random Open-Source Text Generation Model - Free for All Kinds of Text Creation Tasks

Qwen1.5 Moe Tiny Random

Developed by yujiepan

This is a small randomly initialized model based on the Qwen1.5-MoE architecture, using float16 precision, suitable for text generation tasks.

Large Language Model

Transformers

#Small-scale MoE architecture #float16 precision #Sliding window attention

Downloads 30

Release Time : 3/30/2024

Model Overview

This model is a smaller-scale randomly initialized version based on the Qwen/Qwen1.5-MoE-A2.7B-Chat configuration, primarily designed for text generation tasks.

Model Features

Compact design

Based on the Qwen1.5-MoE architecture but with a smaller scale, suitable for resource-limited environments.

float16 precision

Uses float16 precision to balance computational efficiency and model performance.

Sliding window support

Configured with sliding window (max_window_layers=1) to optimize long text processing.

Model Capabilities

Text generation

Dialogue systems

Use Cases

Dialogue systems

Simple dialogue

Can be used to build simple chatbots

Text generation

Short text generation

Generates short text content

🚀 Transformers

This library provides a randomly initialized model using the config from Qwen/Qwen1.5-MoE-A2.7B-Chat but with a smaller size. It's designed for text generation and the model is in float16.

🚀 Quick Start

This model is randomly initialized, leveraging the configuration from Qwen/Qwen1.5-MoE-A2.7B-Chat, yet with a reduced scale. Note that the model operates in float16 precision.

💻 Usage Examples

Basic Usage

import transformers
import torch
import os
from huggingface_hub import create_repo, upload_folder

source_model_id = 'Qwen/Qwen1.5-MoE-A2.7B-Chat'
save_path = '/tmp/yujiepan/qwen1.5-moe-tiny-random'
repo_id = 'yujiepan/qwen1.5-moe-tiny-random'

config = transformers.AutoConfig.from_pretrained(
    source_model_id, trust_remote_code=True)
config.hidden_size = 4
config.intermediate_size = 2
config.num_attention_heads = 4
config.num_hidden_layers = 2
config.num_key_value_heads = 2
config.moe_intermediate_size = 2
config.shared_expert_intermediate_size = 2
config.max_window_layers = 1
config.use_sliding_window = True
config.torch_dtype = torch.float16

model = transformers.AutoModelForCausalLM.from_config(
    config, trust_remote_code=True, torch_dtype=torch.float16)
model = model.half()

tokenizer = transformers.AutoTokenizer.from_pretrained(
    source_model_id, trust_remote_code=True)

result = transformers.pipelines.pipeline(
    'text-generation',
    model=model, tokenizer=tokenizer,
    device=0,
    max_new_tokens=16,
)('Hello World!')
print(result)

model.save_pretrained(save_path)
tokenizer.save_pretrained(save_path)

os.system(f'ls -alh {save_path}')
create_repo(repo_id, exist_ok=True)
upload_folder(repo_id=repo_id, folder_path=save_path)

Advanced Usage

There isn't a specific advanced usage scenario described in the original content. If you need to modify the model configuration further or use it in a more complex application, you can adjust the parameters in the above code according to your needs.

# You can modify the code above to adapt to different scenarios, such as changing the model configuration parameters or using different input texts.

Property	Details
Library Name	transformers
Pipeline Tag	text-generation
Inference	true

⚠️ Important Note

The model is randomly initialized and uses the config from Qwen/Qwen1.5-MoE-A2.7B-Chat but with a smaller size. Also, note that the model is in float16.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご