B

Bytedance BAGEL 7B MoT INT8

Developed by Gapeleon
BAGEL is an open-source 7B active parameter multimodal foundation model supporting multimodal understanding and generation tasks
Downloads 190
Release Time : 5/21/2025

Model Overview

A multimodal model based on Mixture of Experts Transformer architecture, excelling in vision understanding, text generation, and image editing tasks

Model Features

Unified Multimodal Architecture
Simultaneously supports vision understanding and generation tasks, processing multiple modalities through a single model
Advanced Editing Capabilities
Supports free-form visual editing, multi-view synthesis, and world navigation tasks
Quantization Optimization
Provides INT8 quantized version for optimized inference efficiency
Emergent Properties
Demonstrates phased capability emergence with increased training data

Model Capabilities

Multimodal Understanding
Text-to-Image Generation
Image Editing
Multi-view Synthesis
World Navigation
Sequential Reasoning

Use Cases

Vision Understanding
Multimodal Q&A
Question answering system based on image content
Scored 85.0 on MMBench benchmark
Content Generation
Text-to-Image Generation
Generates high-quality images from text descriptions
Achieved composite score of 0.88 on GenEval benchmark
Image Editing
Intelligent Editing
Image editing based on natural language instructions
Scored 7.36 on GEdit-Bench-EN benchmark
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase