C

Caplattessdolxaboros Yi 34B 200K DARE Ties HighDensity

Developed by brucethemoose
This is a high-density merged model based on the Yi-34B-200K foundation model, integrating multiple homologous models through the DARE Ties method, featuring 200K long-context processing capability.
Downloads 94
Release Time : 12/9/2023

Model Overview

The model merges multiple homologous models such as Dolphin-2.2-yi-34b-200k, Nous-Capybara-34B, and Tess-M-v1.4 using mergekit's DARE Ties method, retaining Yi-34B-200K's long-context capabilities while excelling in various benchmark tests.

Model Features

Long-context processing
Supports 200K tokens of long-context processing, suitable for handling lengthy documents and complex reasoning tasks.
High-density merging
Uses the DARE Ties method to merge multiple homologous models at a higher-than-recommended density, enhancing model performance.
Multi-model advantage fusion
Integrates the strengths of multiple models like Dolphin, Capybara, and Tess, providing diverse capabilities.
Efficient inference
Runs on 24GB GPUs and supports 45K-75K context lengths on exllamav2.

Model Capabilities

Text generation
Long-text understanding
Complex reasoning
Q&A systems
Knowledge-based Q&A

Use Cases

Knowledge-based Q&A
AI2 Reasoning Challenge
Performance on few-shot samples from the AI2 Reasoning Challenge (ARC)
Normalized accuracy 67.41
Commonsense reasoning
HellaSwag test
Commonsense reasoning capability on the HellaSwag dataset
Normalized accuracy 85.77
Mathematical reasoning
GSM8k math problems
Ability to solve elementary school math word problems
Accuracy 61.33
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase