C

Chonky Modernbert Large 1

Developed by mirth
Chonky is a Transformer model capable of intelligently splitting text into meaningful semantic chunks, suitable for RAG systems.
Downloads 54
Release Time : 4/26/2025

Model Overview

This model processes text and divides it into semantically coherent segments, which can be used as part of RAG workflows, input into embedding-based retrieval systems or language models.

Model Features

Intelligent Semantic Chunking
Capable of splitting text into meaningful semantic chunks while maintaining content coherence.
RAG System Optimization
Designed specifically for Retrieval-Augmented Generation (RAG) systems, optimizing chunk quality.
Long Sequence Support
Fine-tuned on sequences of length 1024 (base model supports sequences up to 8192 in length).

Model Capabilities

Text Semantic Chunking
Paragraph Segmentation
RAG System Preprocessing

Use Cases

Information Retrieval
RAG System Preprocessing
Preparing semantically coherent text chunks for retrieval-augmented generation systems
Improves retrieval system accuracy and relevance
Text Processing
Document Segmentation
Splitting long documents into meaningful paragraphs
Facilitates subsequent processing and analysis
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase