S

Seed Coder Triton 8b V1

Developed by winglian
A large language model fine-tuned on a specific dataset based on the ByteDance-Seed/Seed-Coder-8B-Base model, supporting long sequence input and efficient training strategies.
Downloads 1,388
Release Time : 5/13/2025

Model Overview

This model is the result of fine-tuning Seed-Coder-8B-Base on the axolotl-ai-internal/gpumode-py2triton-reasoning-v2 dataset, suitable for specific domain task requirements.

Model Features

Long sequence support
Supports sequence input up to 16384, suitable for processing long texts or complex code
Efficient training strategy
Adopts sample packing and padding strategies, combined with various optimization plugins, to improve training efficiency
Optimized architecture
Uses optimization techniques such as LigerPlugin to improve the model architecture and enhance performance

Model Capabilities

Code generation
Logical reasoning
Long text processing

Use Cases

Code-related
Code generation
Generate code with specific functions according to requirements
The loss value on the evaluation set is 0.2177
Code reasoning
Understand and analyze the logic of existing code
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase