C

Cogview4 6B

Developed by THUDM
CogView4-6B is a text-to-image model based on the GLM-4-9B foundation model, supporting both Chinese and English, capable of generating high-quality images.
Downloads 333.85k
Release Time : 3/3/2025

Model Overview

CogView4-6B is a high-performance text-to-image model that can generate high-quality images based on text prompts, supporting various resolutions and complex text descriptions.

Model Features

High-resolution support
Supports multiple resolutions with width and height between 512px and 2048px, with total pixels not exceeding 2^21.
Low VRAM optimization
Significantly reduces VRAM usage through techniques like model CPU offloading and 4bit quantization of the text encoder.
High accuracy for Chinese text
Achieves an F1 score of 0.6168 in Chinese text accuracy evaluation, significantly outperforming other models.

Model Capabilities

Text-to-image generation
High-resolution image generation
Multilingual support (Chinese, English)

Use Cases

Creative design
Sports car design
Generate high-quality sports car images based on detailed text descriptions.
The generated images feature high levels of detail and realism, accurately reflecting the attributes described in the text.
Advertising and marketing
Product showcase
Generate attractive product display images based on product descriptions.
The generated images highlight the key features and selling points of the product.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase