B

Buddhi 128k Chat 7b

Developed by aiplanet
Buddhi-128k-Chat is a pioneering general-purpose chat model with a 128K context window, finely tuned based on Mistral 7B Instruct and optimized through innovative YaRN technology to handle extended context lengths of up to 128,000 tokens.
Downloads 196
Release Time : 4/2/2024

Model Overview

Buddhi-128k-Chat is the first general-purpose chat model featuring a 128K context window. It is finely tuned based on Mistral 7B Instruct and optimized using the innovative YaRN (Yet another Rope Extension) technology, enabling it to process extended context lengths of up to 128,000 tokens. This enhancement allows Buddhi to maintain deep contextual understanding in long documents or conversations, excelling particularly in tasks requiring extensive context retention, such as comprehensive document summarization, detailed narrative generation, and complex Q&A.

Model Features

128K Context Window
Expands the context window to 128K via YaRN technology, capable of handling ultra-long texts and complex dialogues
Fine-tuned on Mistral-7B Instruct
Inherits the superior reasoning capabilities of Mistral-7B Instruct while being optimized for long-context tasks
Dynamic YaRN Technology
Employs NTK-aware dynamic adjustment technology to effectively extend positional embedding capabilities

Model Capabilities

Long-text comprehension
Complex dialogue processing
Document summarization
Narrative generation
Q&A systems

Use Cases

Document Processing
Long Document Summarization
Generates comprehensive summaries for ultra-long documents
Maintains accurate understanding of document content within the 128K context window
Full Book Analysis
Analyzes and answers questions about entire book contents
Capable of processing book contents up to 75,000 tokens in length
Dialogue Systems
Complex Dialogue
Handles complex dialogues with extensive context
Maintains contextual consistency in long conversations
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase