Moonshot V1 8k

An 8K context window language model launched by Moonshot AI, focusing on text processing and code generation scenarios, supporting the single-round dialogue priority mechanism. The actual measured generation speed is 200 tokens/s, and the API call latency is 30% lower than the industry average.

Intelligence(Weak)

Speed(Relatively Fast)

Input Supported Modalities

Is Reasoning Model

8,000

Context Window

8,000

Maximum Output Tokens

Knowledge Cutoff

Go Compare

Pricing

￥0.5 /M tokens

Input

- /M tokens

Output

￥0.5 /M tokens

Blended Price

Quick Simple Comparison

Input

Output

kimi-latest

moonshot-v1-32k

￥0.14

moonshot-v1-8k

￥0.07

Basic Parameters

moonshot-v1-8kTechnical Parameters

Parameter Count

Not Announced

Context Length

8,000 tokens

Training Data Cutoff

Open Source Category

Proprietary

Multimodal Support

Text Only

Throughput

Release Date

2025-04-01

Response Speed

200 tokens/s

Benchmark Scores

Below is the performance of moonshot-v1-8k in various standard benchmark tests. These tests evaluate the model's capabilities in different tasks and domains.

Intelligence Index

Large Language Model Intelligence Level

Coding Index

Indicator of AI model performance on coding tasks

Math Index

Capability indicator in solving mathematical problems, mathematical reasoning, or performing math-related tasks

MMLU Pro

Massive Multitask Multimodal Understanding - Testing understanding of text, images, audio, and video

GPQA

Graduate Physics Questions Assessment - Testing advanced physics knowledge with diamond science-level questions

HLE

The model's comprehensive average score on the Hugging Face Open LLM Leaderboard