J

Jellyfish 13B

Developed by NECOUDBFM
Jellyfish-13B is a 13-billion-parameter large language model specifically customized for data preprocessing tasks, including error detection, data imputation, pattern matching, and entity matching.
Downloads 102
Release Time : 10/16/2023

Model Overview

Fine-tuned based on the Open-Orca/OpenOrca-Platypus2-13B model, focusing on data preprocessing tasks, with performance comparable to GPT-3.5 and GPT-4, capable of cost-effective local operation while ensuring data security.

Model Features

Data Preprocessing Expert
Optimized specifically for data cleaning and preprocessing tasks, excelling in various data tasks
Efficient Local Operation
The 13B-scale model can be deployed locally, balancing performance and resource consumption
Dual-version Design
Offers standard and interpreter versions, suitable for system integration and end-user use respectively

Model Capabilities

Error Detection
Data Imputation
Pattern Matching
Entity Matching
Column Type Annotation
Attribute Value Extraction

Use Cases

Data Quality Management
Dataset Error Detection
Identify erroneous and outlier values in datasets
Achieved 95.59% F1 score on the Hospital dataset
Missing Value Imputation
Automatically fill missing values in datasets
Achieved 100% accuracy on the Buy dataset
Data Integration
Entity Matching
Identify records from different data sources that refer to the same entity
Achieved 98.51% F1 score on the DBLP-GoogleScholar dataset
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Ā© 2025AIbase